CDD: NCBI's conserved domain database

Aron Marchler‐Bauer(National Institutes of Health), Myra K. Derbyshire(National Center for Biotechnology Information), Noreen R. Gonzales(National Institutes of Health), Shennan Lu(National Institutes of Health), Farideh Chitsaz(National Institutes of Health), Lewis Y. Geer(National Institutes of Health), Renata C. Geer(National Center for Biotechnology Information), Jane He(National Institutes of Health), Marc Gwadz(National Center for Biotechnology Information), David I. Hurwitz(National Institutes of Health), Christopher J. Lanczycki(National Center for Biotechnology Information), Fu Lu(National Center for Biotechnology Information), Gabriele H. Marchler(National Institutes of Health), James S. Song(National Center for Biotechnology Information), Narmada Thanki(National Institutes of Health), Zhouxi Wang(National Center for Biotechnology Information), Roxanne A. Yamashita(National Center for Biotechnology Information), Dachuan Zhang(National Institutes of Health), Chanjuan Zheng(National Center for Biotechnology Information), Stephen H. Bryant(National Institutes of Health)
Nucleic Acids Research
November 20, 2014
Cited by 3,587Open Access
Full Text

Abstract

NCBI's CDD, the Conserved Domain Database, enters its 15(th) year as a public resource for the annotation of proteins with the location of conserved domain footprints. Going forward, we strive to improve the coverage and consistency of domain annotation provided by CDD. We maintain a live search system as well as an archive of pre-computed domain annotation for sequences tracked in NCBI's Entrez protein database, which can be retrieved for single sequences or in bulk. We also maintain import procedures so that CDD contains domain models and domain definitions provided by several collections available in the public domain, as well as those produced by an in-house curation effort. The curation effort aims at increasing coverage and providing finer-grained classifications of common protein domains, for which a wealth of functional and structural data has become available. CDD curation generates alignment models of representative sequence fragments, which are in agreement with domain boundaries as observed in protein 3D structure, and which model the structurally conserved cores of domain families as well as annotate conserved features. CDD can be accessed at http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.


Related Papers

No related papers found

Powered by citation graph analysis