Emergence of Locomotion Behaviours in Rich Environments
Nicolas Heess(Google DeepMind (United Kingdom)), David Silver(Google DeepMind (United Kingdom)), Greg Wayne, Jay Lemmon, S. M. Ali Eslami, Martin Riedmiller(California Institute of Technology), Josh Merel, Tom Erez, Ziyu Wang, Dhruva Tb, Sriram Srinivasan(Amrita Vishwa Vidyapeetham), Yuval Tassa
Cited by 669
Related Papers
Human-level control through deep reinforcement learning
|Nature|2015|29.9k
Playing Atari with Deep Reinforcement Learning
|arXiv (Cornell University)|2013|5.1k
A direct adaptive method for faster backpropagation learning: the RPROP algorithm
|IEEE International Conference on Neural Networks|2002|3.9k
Striving for Simplicity: The All Convolutional Net
|arXiv (Cornell University)|2014|2.6k
Deterministic policy gradient algorithms
|HAL (Le Centre pour la Communication Scientifique Directe)|2014|1.7k