Non-Cooperative Inverse Reinforcement LearningXiangyuan Zhang, Tamer Başar, Kaiqing Zhang et al.|arXiv (Cornell University)|2019Cited by 15
Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample ComplexityKaiqing Zhang, Tamer Başar, Xiangyuan Zhang et al.|arXiv (Cornell University)|2021Cited by 8