Linear Least-Squares algorithms for temporal difference learning

Steven J. Bradtke(University of Massachusetts Amherst), Andrew G. Barto(Tata Institute of Fundamental Research)
Machine Learning
January 1, 1996
Cited by 639


Related Papers

Introduction to Reinforcement Learning
|MIT Press eBooks|1998|6.9k
Reinforcement Learning
|IFAC Proceedings Volumes|1998|3k
Learning to act using real-time dynamic programming
|Artificial Intelligence|1995|1.1k
Recent Advances in Hierarchical Reinforcement Learning
|Discrete Event Dynamic Systems|2003|1k