Deterministic policy gradient algorithms

David Silver(Google (United Kingdom)), Martin Riedmiller(Google (United States)), Thomas Degris(Google DeepMind (United Kingdom)), Guy Lever, Daan Wierstra(Google DeepMind (United Kingdom)), Nicolas Heess(Google DeepMind (United Kingdom))

HAL (Le Centre pour la Communication Scientifique Directe)

January 1, 2014

Cited by 1,742

Related Papers

|Nature|2015|29.9k

|arXiv (Cornell University)|2013|5.1k

|IEEE International Conference on Neural Networks|2002|3.9k

|arXiv (Cornell University)|2014|2.6k

|Lecture notes in computer science|2005|770