Gemini 1.5: Unlocking multimodal understanding across millions of tokens of contextGemini Robotics Team, Samer Hassan, Zach Gleicher et al.|arXiv (Cornell University)|2024Cited by 282
Imitating Interactive IntelligenceJosh Abramson, Rui Zhu, Arun Ahuja et al.|arXiv (Cornell University)|2020Cited by 43
Creating Multimodal Interactive Agents with Imitation and\n Self-Supervised LearningDeepMind Interactive Agents Team, Rui Zhu, Josh Abramson et al.|arXiv (Cornell University)|2021Cited by 32
Creating Multimodal Interactive Agents with Imitation and Self-Supervised LearningDeepMind Interactive Agents Team, Rui Zhu, Josh Abramson et al.|arXiv (Cornell University)|2021Cited by 15
A data-driven approach for learning to control computersPeter C. Humphreys, Timothy Lillicrap, David Raposo et al.|arXiv (Cornell University)|2022Cited by 7