CDD/SPARCLE: the conserved domain database in 2020

Shennan Lu(National Institutes of Health), Jiyao Wang(National Institutes of Health), Farideh Chitsaz(National Institutes of Health), Myra K. Derbyshire(National Institutes of Health), Renata C. Geer(National Institutes of Health), Noreen R. Gonzales(National Institutes of Health), Marc Gwadz(National Institutes of Health), David I. Hurwitz(National Institutes of Health), Gabriele H. Marchler(National Institutes of Health), James S. Song(National Institutes of Health), Narmada Thanki(National Institutes of Health), Roxanne A. Yamashita(National Institutes of Health), Mingzhang Yang(National Institutes of Health), Dachuan Zhang(National Institutes of Health), Chanjuan Zheng(National Institutes of Health), Christopher J. Lanczycki(National Institutes of Health), Aron Marchler‐Bauer(National Institutes of Health)
Nucleic Acids Research
November 13, 2019
Cited by 3,289Open Access
Full Text

Abstract

As NLM's Conserved Domain Database (CDD) enters its 20th year of operations as a publicly available resource, CDD curation staff continues to develop hierarchical classifications of widely distributed protein domain families, and to record conserved sites associated with molecular function, so that they can be mapped onto user queries in support of hypothesis-driven biomolecular research. CDD offers both an archive of pre-computed domain annotations as well as live search services for both single protein or nucleotide queries and larger sets of protein query sequences. CDD staff has continued to characterize protein families via conserved domain architectures and has built up a significant corpus of curated domain architectures in support of naming bacterial proteins in RefSeq. These architecture definitions are available via SPARCLE, the Subfamily Protein Architecture Labeling Engine. CDD can be accessed at https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.


Related Papers

No related papers found

Powered by citation graph analysis