RT-1: Robotics Transformer for Real-World Control at Scale

Anthony Brohan; Noah Brown; Justice Carbajal; Yevgen Chebotar; Joseph Dabis; Chelsea Finn; Keerthana Gopalakrishnan; Karol Hausman; Alexander Herzog; Jasmine Hsu; Julian Ibarz; Brian Ichter; Alex Irpan; Tomas Jackson; Sally Jesmonth; Nikhil Joshi; Ryan Julian; Dmitry Kalashnikov; Yuheng Kuang; Isabel Leal; Kuang-Huei Lee; Sergey Levine; Yao Lu; Utsav Malla; Deeksha Manjunath; Igor Mordatch; Ofir Nachum; Carolina Parada; Jodilyn Peralta; Emily Pérez; Karl Pertsch; Jornell Quiambao; Kanishka Rao; Michael S. Ryoo; Grecia Salazar; Pannag Sanketi; Kevin Sayed; Jaspiar Singh; Sumedh Sontakke; Austin V. Stone; Clayton Tan; Huong Tran; Vincent Vanhoucke; Steve Vega; Quan Vuong; Fei Xia; Ted Xiao; Peng Xu; Sichun Xu; Tianhe Yu; Brianna Zitkovich

doi:10.15607/rss.2023.xix.025

RT-1: Robotics Transformer for Real-World Control at Scale

Anthony Brohan(Google (United States)), Noah Brown(Google (United States)), Justice Carbajal(Google (United States)), Yevgen Chebotar(Google (United States)), Joseph Dabis(Google (United States)), Chelsea Finn(Google (United States)), Keerthana Gopalakrishnan(Google (United States)), Karol Hausman(Google (United States)), Alexander Herzog, Jasmine Hsu(Google (United States)), Julian Ibarz(Google (United States)), Brian Ichter(Google (United States)), Alex Irpan(Google (United States)), Tomas Jackson(Google (United States)), Sally Jesmonth(Google (United States)), Nikhil Joshi(Google (United States)), Ryan Julian, Dmitry Kalashnikov(Google (United States)), Yuheng Kuang(Google (United States)), Isabel Leal(Google (United States)), Kuang-Huei Lee(Google (United States)), Sergey Levine(Google (United States)), Yao Lu(Google (United States)), Utsav Malla(Google (United States)), Deeksha Manjunath(Google (United States)), Igor Mordatch(Google (United States)), Ofir Nachum(Google (United States)), Carolina Parada(Google (United States)), Jodilyn Peralta(Google (United States)), Emily Pérez(Google (United States)), Karl Pertsch(Google (United States)), Jornell Quiambao(Google (United States)), Kanishka Rao(Google (United States)), Michael S. Ryoo(Google (United States)), Grecia Salazar(Google (United States)), Pannag Sanketi(Google (United States)), Kevin Sayed(Google (United States)), Jaspiar Singh(Google (United States)), Sumedh Sontakke(Brain (Germany)), Austin V. Stone(Google (United States)), Clayton Tan(Google (United States)), Huong Tran(Google (United States)), Vincent Vanhoucke(Google (United States)), Steve Vega(Google (United States)), Quan Vuong(Google (United States)), Fei Xia(Google (United States)), Ted Xiao(Google (United States)), Peng Xu(Google (United States)), Sichun Xu(Google (United States)), Tianhe Yu(Google (United States)), Brianna Zitkovich(Google (United States))

Unknown

July 10, 2023

10.15607/rss.2023.xix.025

Cited by 506Open Access

Full Text

Abstract

By transferring knowledge from large, diverse, taskagnostic datasets, modern machine learning models can solve specific downstream tasks either zero-shot or with small taskspecific datasets to a high level of performance.While this capability has been demonstrated in other fields such as computer vision, natural language processing or speech recognition, it remains to be shown in robotics, where the generalization capabilities of the models are particularly critical due to the difficulty of collecting real-world robotic data.We argue that one of the keys to the success of such general robotic models lies with open-ended task-agnostic training, combined with highcapacity architectures that can absorb all of the diverse, robotic data.In this paper, we present a model class, dubbed Robotics Transformer, that exhibits promising scalable model properties.We verify our conclusions in a study of different model classes and their ability to generalize as a function of the data size, model size, and data diversity based on a large-scale data collection on real robots performing real-world tasks.

Related Papers

No related papers found

Powered by citation graph analysis