Optimizing agent behavior over long time scales by transporting value
Chia-Chun Hung, Greg Wayne, Josh Abramson(Google DeepMind (United Kingdom)), Mehdi Mirza, Federico Carnevale, Timothy Lillicrap, Yan Wu(Google DeepMind (United Kingdom)), Arun Ahuja
Cited by 17
Related Papers
Emergence of Locomotion Behaviours in Rich Environments
|arXiv (Cornell University)|2017|669
DeepMind Control Suite
|arXiv (Cornell University)|2018|523
Addendum: Accurate structure prediction of biomolecular interactions with AlphaFold 3
|Nature|2024|292
The SuperSID project: exploiting high-level information for high-accuracy speaker recognition
|2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).|2004|221