A robot that reinforcement-leams to identify and memorize important previous observations

Boudewijn Bakker(Dalle Molle Institute for Artificial Intelligence Research), Viktor Zhumatiy(Dalle Molle Institute for Artificial Intelligence Research), G. Gruener(Swiss Center for Electronics and Microtechnology (Switzerland)), Jürgen Schmidhuber(Dalle Molle Institute for Artificial Intelligence Research)
Unknown
July 8, 2004
Cited by 56

Abstract

It is difficult to apply traditional reinforcement learning algorithms to robots, due to problems with large and continuous domains, partial observability, and limited numbers of learning experiences. This paper deals with these problems by combining: (1) reinforcement learning with memory, implemented using an LSTM recurrent neural network whose inputs are discrete events extracted from raw inputs; (2) online exploration and offline policy learning. An experiment with a real robot demonstrates the methodology's feasibility.


Related Papers