Bigtable: a distributed storage system for structured data

Fay W. Chang(Google (United States)), Jay B. Dean(Google (United States)), Sanjay Ghemawat(Google (United States)), Wilson C. Hsieh(Google (United States)), Deborah A. Wallach(Google (United States)), Mike Burrows(Google (United States)), Tushar Chandra(Google (United States)), Andrew Fikes(Google (United States)), Robert Gruber(Google (United States))
Unknown
November 6, 2006
Cited by 1,955

Abstract

Bigtable is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers. Many projects at Google store data in Bigtable, including web indexing, Google Earth, and Google Fi-nance. These applications place very different demands on Bigtable, both in terms of data size (from URLs to web pages to satellite imagery) and latency requirements (from backend bulk processing to real-time data serving). Despite these varied demands, Bigtable has successfully provided a exible, high-performance solution for all of these Google products. In this paper we describe the sim-ple data model provided by Bigtable, which gives clients dynamic control over data layout and format, and we de-scribe the design and implementation of Bigtable. 1


Related Papers

No related papers found

Powered by citation graph analysis