Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Josh Abramson(Google DeepMind (United Kingdom)), Rui Zhu, Arun Ahuja, Tamara von Glehn, Alex Goldin, Petko Georgiev, Federico Carnevale, Guy Scully, Jirka Lhotka, Nathaniel Wong, Timothy Lillicrap(Google DeepMind (United Kingdom)), Alden Hung, Yan Chen(Florida International University), George Powell, Sanjana Srivastava, Adam Santoro, Greg Wayne(Google (United Kingdom)), Alistair Muldal, Jessica Landon

arXiv (Cornell University)

November 21, 2022

10.48550/arxiv.2211.11602

Cited by 5

Related Papers

Accurate structure prediction of biomolecular interactions with AlphaFold 3

|Nature|2024|13.6k

Emergence of Locomotion Behaviours in Rich Environments

|arXiv (Cornell University)|2017|669

DeepMind Control Suite

|arXiv (Cornell University)|2018|523

Addendum: Accurate structure prediction of biomolecular interactions with AlphaFold 3

|Nature|2024|292

Generative AI in Medical Practice: In-Depth Exploration of Privacy and Security Challenges

|Journal of Medical Internet Research|2024|289