A robot that reinforcement-leams to identify and memorize important previous observations

Boudewijn Bakker; Viktor Zhumatiy; G. Gruener; Jürgen Schmidhuber

doi:10.1109/iros.2003.1250667

A robot that reinforcement-leams to identify and memorize important previous observations

Boudewijn Bakker(Dalle Molle Institute for Artificial Intelligence Research), Viktor Zhumatiy(Dalle Molle Institute for Artificial Intelligence Research), G. Gruener(Swiss Center for Electronics and Microtechnology (Switzerland)), Jürgen Schmidhuber(Dalle Molle Institute for Artificial Intelligence Research)

Unknown

July 8, 2004

10.1109/iros.2003.1250667

Cited by 56

Abstract

It is difficult to apply traditional reinforcement learning algorithms to robots, due to problems with large and continuous domains, partial observability, and limited numbers of learning experiences. This paper deals with these problems by combining: (1) reinforcement learning with memory, implemented using an LSTM recurrent neural network whose inputs are discrete events extracted from raw inputs; (2) online exploration and offline policy learning. An experiment with a real robot demonstrates the methodology's feasibility.

Related Papers

Long Short-Term Memory

Sepp Hochreiter, Jürgen Schmidhuber|Neural Computation|1997|97.1k

Reinforcement Learning: A Survey

Leslie Pack Kaelbling, Michael L. Littman, Andrew Moore|Journal of Artificial Intelligence Research|1996|8.8k

Self-improving reactive agents based on reinforcement learning, planning and teaching

Long-Ji Lin|Machine Learning|1992|1.6k

Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming

Richard S. Sutton|Elsevier eBooks|1990|1.4k

Reinforcement Learning with Long Short-Term Memory

Bram Bakker|Unknown|2001|194