H

Hua Cheng

Harvard University

ORCID: 0000-0002-0079-8820

Publishes on T-cell and Retrovirus Studies, Animal Disease Management and Epidemiology, Vector-Borne Animal Diseases. 127 papers and 3.3k citations.

127Publications
3.3kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

ECOD: An Evolutionary Classification of Protein Domains
Hua Cheng, R. Dustin Schaeffer, Yuxing Liao et al.|PLoS Computational Biology|2014
Cited by 473Open Access

Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or "fold"). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies.

Prognostic significance of frequent CLDN18-ARHGAP26/6 fusion in gastric signet-ring cell cancer
Yang Shu, Weihan Zhang, Qianqian Hou et al.|Nature Communications|2018
Cited by 142Open Access

Signet-ring cell carcinoma (SRCC) has specific epidemiology and oncogenesis in gastric cancer, however, with no systematical investigation for prognostic genomic features. Here we report a systematic investigation conducted in 1868 Chinese gastric cancer patients indicating that signet-ring cells content was related to multiple clinical characteristics and treatment outcomes. We thus perform whole-genome sequencing on 32 pairs of SRC samples, and identify frequent CLDN18-ARHGAP26/6 fusion (25%). With 797 additional patients for validation, prevalence of CLDN18-ARHGAP26/6 fusion is noticed to be associated with signet-ring cell content, age at diagnosis, female/male ratio, and TNM stage. Importantly, patients with CLDN18-ARHGAP26/6 fusion have worse survival outcomes, and get no benefit from oxaliplatin/fluoropyrimidines-based chemotherapy, which is consistent with the fact of chemo-drug resistance acquired in CLDN18-ARHGAP26 introduced cell lines. Overall, this study provides insights into the clinical and genomic features of SRCC, and highlights the importance of frequent CLDN18-ARHGAP26/6 fusions in chemotherapy response for SRCC.

CASP9 assessment of free modeling target predictions
Lisa N. Kinch, Shuo Yong Shi, Qian Cong et al.|Proteins Structure Function and Bioinformatics|2011
Cited by 117

We present an overview of the ninth round of Critical Assessment of Protein Structure Prediction (CASP9) "Template free modeling" category (FM). Prediction models were evaluated using a combination of established structural and sequence comparison measures and a novel automated method designed to mimic manual inspection by capturing both global and local structural features. These scores were compared to those assigned manually over a diverse subset of target domains. Scores were combined to compare overall performance of participating groups and to estimate rank significance. Moreover, we discuss a few examples of free modeling targets to highlight the progress and bottlenecks of current prediction methods. Notably, a server prediction model for a single target (T0581) improved significantly over the closest structure template (44% GDT increase). This accomplishment represents the "winner" of the CASP9 FM category. A number of human expert groups submitted slight variations of this model, highlighting a trend for human experts to act as "meta predictors" by correctly selecting among models produced by the top-performing automated servers. The details of evaluation are available at http://prodata.swmed.edu/CASP9/ .