Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Josh Abramson(Google DeepMind (United Kingdom)), Rui Zhu, Arun Ahuja, Tamara von Glehn, Alex Goldin, Petko Georgiev, Federico Carnevale, Guy Scully, Jirka Lhotka, Nathaniel Wong, Timothy Lillicrap, Alden Hung, Yan Chen(Florida International University), George Powell, Sanjana Srivastava, Adam Santoro, Greg Wayne, Alistair Muldal, Jessica Landon
Cited by 5
Related Papers
Emergence of Locomotion Behaviours in Rich Environments
|arXiv (Cornell University)|2017|669
DeepMind Control Suite
|arXiv (Cornell University)|2018|523
Addendum: Accurate structure prediction of biomolecular interactions with AlphaFold 3
|Nature|2024|292
Generative AI in Medical Practice: In-Depth Exploration of Privacy and Security Challenges
|Journal of Medical Internet Research|2024|289