Optimizing agent behavior over long time scales by transporting value

Chia-Chun Hung, Greg Wayne(Google (United Kingdom)), Josh Abramson(Google DeepMind (United Kingdom)), Mehdi Mirza, Federico Carnevale, Timothy Lillicrap(Google DeepMind (United Kingdom)), Yan Wu(Google DeepMind (United Kingdom)), Arun Ahuja

Nature Communications

November 19, 2019

Cited by 17

Related Papers

|Nature|2024|13.6k

|arXiv (Cornell University)|2017|669

|arXiv (Cornell University)|2018|523

|Nature|2024|292

|2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).|2004|221