GSA: Genome Sequence Archive

Yanqing Wang; Fuhai Song; Junwei Zhu; Sisi Zhang; Yadong Yang; Tingting Chen; Bixia Tang; Lili Dong; Nan Ding; Qian Zhang; Zhouxian Bai; Xunong Dong; Huanxin Chen; Mingyuan Sun; Shuang Zhai; Yubin Sun; Lei Yu; Lan Li; Jingfa Xiao; Xiangdong Fang; Hongxing Lei; Zhang Zhang; Wenming Zhao

doi:10.1016/j.gpb.2017.01.001

GSA: Genome Sequence Archive

Yanqing Wang(Beijing Institute of Genomics), Fuhai Song(Chinese Academy of Sciences), Junwei Zhu(Beijing Institute of Genomics), Sisi Zhang(Beijing Institute of Genomics), Yadong Yang(Chinese Academy of Sciences), Tingting Chen(Beijing Institute of Genomics), Bixia Tang(Beijing Institute of Genomics), Lili Dong(Beijing Institute of Genomics), Nan Ding(Chinese Academy of Sciences), Qian Zhang(Chinese Academy of Sciences), Zhouxian Bai(Chinese Academy of Sciences), Xunong Dong(Chinese Academy of Sciences), Huanxin Chen(Beijing Institute of Genomics), Mingyuan Sun(Beijing Institute of Genomics), Shuang Zhai(Beijing Institute of Genomics), Yubin Sun(Beijing Institute of Genomics), Lei Yu(Beijing Institute of Genomics), Lan Li(Beijing Institute of Genomics), Jingfa Xiao(Chinese Academy of Sciences), Xiangdong Fang(Chinese Academy of Sciences), Hongxing Lei(Chinese Academy of Sciences), Zhang Zhang(Chinese Academy of Sciences), Wenming Zhao(Fudan University)

Genomics Proteomics & Bioinformatics

February 1, 2017

10.1016/j.gpb.2017.01.001

Cited by 833Open Access

Full Text

Abstract

With the rapid development of sequencing technologies towards higher throughput and lower cost, sequence data are generated at an unprecedentedly explosive rate. To provide an efficient and easy-to-use platform for managing huge sequence data, here we present Genome Sequence Archive (GSA; http://bigd.big.ac.cn/gsa or http://gsa.big.ac.cn), a data repository for archiving raw sequence data. In compliance with data standards and structures of the International Nucleotide Sequence Database Collaboration (INSDC), GSA adopts four data objects (BioProject, BioSample, Experiment, and Run) for data organization, accepts raw sequence reads produced by a variety of sequencing platforms, stores both sequence reads and metadata submitted from all over the world, and makes all these data publicly available to worldwide scientific communities. In the era of big data, GSA is not only an important complement to existing INSDC members by alleviating the increasing burdens of handling sequence data deluge, but also takes the significant responsibility for global big data archive and provides free unrestricted access to all publicly available data in support of research activities throughout the world.

David Wheeler|Nucleic Acids Research|2003|11k

A New Initiative on Precision Medicine

Francis S. Collins, Harold Varmus|New England Journal of Medicine|2015|5.2k

Database resources of the National Center for Biotechnology Information

|Nucleic Acids Research|2015|1.6k

GSA: Genome Sequence Archive

Yanqing Wang, Fuhai Song, Junwei Zhu et al.|Genomics Proteomics & Bioinformatics|2017|833

Large-scale whole-genome sequencing of the Icelandic population

Daníel F. Guðbjartsson, Hannes Helgason, Sigurjón A. Guðjónsson et al.|Nature Genetics|2015|832

GSA: Genome Sequence Archive

Abstract

Related Papers