Emergence of Locomotion Behaviours in Rich Environments

Nicolas Heess(Google DeepMind (United Kingdom)), David Silver(Google DeepMind (United Kingdom)), Greg Wayne, Jay Lemmon, S. M. Ali Eslami, Martin Riedmiller(California Institute of Technology), Josh Merel, Tom Erez, Ziyu Wang, Dhruva Tb, Sriram Srinivasan(Amrita Vishwa Vidyapeetham), Yuval Tassa
arXiv (Cornell University)
July 7, 2017
Cited by 669


Related Papers

Playing Atari with Deep Reinforcement Learning
|arXiv (Cornell University)|2013|5.1k
A direct adaptive method for faster backpropagation learning: the RPROP algorithm
|IEEE International Conference on Neural Networks|2002|3.9k
Striving for Simplicity: The All Convolutional Net
|arXiv (Cornell University)|2014|2.6k
Deterministic policy gradient algorithms
|HAL (Le Centre pour la Communication Scientifique Directe)|2014|1.7k