Improving Multimodal Interactive Agents with Reinforcement Learning from Human FeedbackJosh Abramson, Rui Zhu, Arun Ahuja et al.|arXiv (Cornell University)|2022Cited by 5