Optimizing agent behavior over long time scales by transporting value

Chia-Chun Hung, Greg Wayne, Josh Abramson(Google DeepMind (United Kingdom)), Mehdi Mirza, Federico Carnevale, Timothy Lillicrap, Yan Wu(Google DeepMind (United Kingdom)), Arun Ahuja
Nature Communications
November 19, 2019
Cited by 17


Related Papers

DeepMind Control Suite
|arXiv (Cornell University)|2018|523
The SuperSID project: exploiting high-level information for high-accuracy speaker recognition
|2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).|2004|221