Adaptive linear quadratic control using policy iterationSteven J. Bradtke, Andrew G. Barto, B. Erik Ydstie|Unknown|2005Cited by 417