DDBJ launches a new archive database with analytical tools for next-generation sequence data

Eli Kaminuma(Research Organization of Information and Systems), Jun Mashima(National Institute of Genetics), Yuichi Kodama(Bank of Japan), Takashi Gojobori(Research Organization of Information and Systems), Osamu Ogasawara(Research Organization of Information and Systems), Kousaku Okubo(National Institute of Genetics), Toshihisa Takagi(National Institute of Genetics), Yasukazu Nakamura(Research Organization of Information and Systems)
Nucleic Acids Research
October 22, 2009
Cited by 100Open Access
Full Text

Abstract

The DNA Data Bank of Japan (DDBJ) (http://www.ddbj.nig.ac.jp) has collected and released 1,701,110 entries/1,116,138,614 bases between July 2008 and June 2009. A few highlighted data releases from DDBJ were the complete genome sequence of an endosymbiont within protist cells in the termite gut and Cap Analysis Gene Expression tags for human and mouse deposited from the Functional Annotation of the Mammalian cDNA consortium. In this period, we started a novel user announcement service using Really Simple Syndication (RSS) to deliver a list of data released from DDBJ on a daily basis. Comprehensive visualization of a DDBJ release data was attempted by using a word cloud program. Moreover, a new archive for sequencing data from next-generation sequencers, the 'DDBJ Read Archive' (DRA), was launched. Concurrently, for read data registered in DRA, a semi-automatic annotation tool called the 'DDBJ Read Annotation Pipeline' was released as a preliminary step. The pipeline consists of two parts: basic analysis for reference genome mapping and de novo assembly and high-level analysis of structural and functional annotations. These new services will aid users' research and provide easier access to DDBJ databases.


Related Papers

No related papers found

Powered by citation graph analysis