Linear Least-Squares algorithms for temporal difference learning

Steven J. Bradtke(University of Massachusetts Amherst), Andrew G. Barto(Tata Institute of Fundamental Research)

Machine Learning

January 1, 1996

Cited by 639

Related Papers

|MIT Press eBooks|1998|6.9k

|IFAC Proceedings Volumes|1998|3k

|Psychological Review|1981|1.5k

|Artificial Intelligence|1995|1.1k

|Discrete Event Dynamic Systems|2003|1k