Scalability in Perception for Autonomous Driving: Waymo Open Dataset

Pei Sun(Nomor Research (Germany)), Henrik Kretzschmar(Nomor Research (Germany)), Xerxes Dotiwalla(Nomor Research (Germany)), Aurélien Chouard(Nomor Research (Germany)), Vijaysai Patnaik(Nomor Research (Germany)), Paul Tsui(Nomor Research (Germany)), James C. Y. Guo(Nomor Research (Germany)), Yin Zhou(Nomor Research (Germany)), Yuning Chai(Nomor Research (Germany)), Benjamin Caine(Google (United States)), Vijay Vasudevan(Google (United States)), Wei Han(Google (United States)), Jiquan Ngiam(Google (United States)), Hang Zhao(Nomor Research (Germany)), Aleksei Timofeev(Nomor Research (Germany)), Scott Ettinger(Nomor Research (Germany)), Maxim Krivokon(Nomor Research (Germany)), Amy Gao(Nomor Research (Germany)), Aditya Joshi(Nomor Research (Germany)), Yu Zhang(Nomor Research (Germany)), Jonathon Shlens(Google (United States)), Zhifeng Chen(Google (United States)), Dragomir Anguelov(Nomor Research (Germany))
Unknown
June 1, 2020
Cited by 2,989

Abstract

The research community has increasing interest in autonomous driving research, despite the resource intensity of obtaining representative real world data. Existing self-driving datasets are limited in the scale and variation of the environments they capture, even though generalization within and between operating regions is crucial to the over-all viability of the technology. In an effort to help align the research community’s contributions with real-world self-driving problems, we introduce a new large scale, high quality, diverse dataset. Our new dataset consists of 1150 scenes that each span 20 seconds, consisting of well synchronized and calibrated high quality LiDAR and camera data captured across a range of urban and suburban geographies. It is 15x more diverse than the largest camera+LiDAR dataset available based on our proposed diversity metric. We exhaustively annotated this data with 2D (camera image) and 3D (LiDAR) bounding boxes, with consistent identifiers across frames. Finally, we provide strong baselines for 2D as well as 3D detection and tracking tasks. We further study the effects of dataset size and generalization across geographies on 3D detection methods. Find data, code and more up-to-date information at http://www.waymo.com/open.


Related Papers

No related papers found

Powered by citation graph analysis