H

Hai Fang

Shanghai Jiao Tong University

ORCID: 0000-0003-3961-8572

Publishes on Bioinformatics and Genomic Networks, Gene expression and cancer classification, Biomedical Text Mining and Ontologies. 166 papers and 9.3k citations.

166Publications
9.3kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

A large-scale evaluation of computational protein function prediction
Predrag Radivojac, Wyatt T. Clark, Tal Oron et al.|Nature Methods|2013
Cited by 1.1kOpen Access

Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be high. Here we report the results from the first large-scale community-based critical assessment of protein function annotation (CAFA) experiment. Fifty-four methods representing the state of the art for protein function prediction were evaluated on a target set of 866 proteins from 11 organisms. Two findings stand out: (i) today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is considerable need for improvement of currently available tools.

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens
Naihui Zhou, Yuxiang Jiang, Timothy Bergquist et al.|Genome biology|2019
Cited by 478Open Access

BACKGROUND: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. RESULTS: Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. CONCLUSION: We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.

An expanded evaluation of protein function prediction methods shows an improvement in accuracy
Yuxiang Jiang, Tal Oron, Wyatt T. Clark et al.|Genome biology|2016
Cited by 450Open Access

BACKGROUND: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging. RESULTS: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2. CONCLUSIONS: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent.

The association between endometriosis and autoimmune diseases: a systematic review and meta-analysis
Nina Shigesi, Marina Kvaskoff, Shona Kirtley et al.|Human Reproduction Update|2019
Cited by 360Open Access

BACKGROUND: Endometriosis is a chronic gynaecological disorder that affects 2-10% of women of reproductive age. The aetiology of endometriosis is largely under-explored, yet abnormalities in the immune system have been suggested to explain the origin of ectopic endometrial tissues, and an association between endometriosis and autoimmune diseases has been proposed. Evaluation of current evidence investigating the association between endometriosis and autoimmune diseases from population-based studies will facilitate our understanding of the causes and consequences of endometriosis and provide a reference for better healthcare practices population-wide. OBJECTIVE AND RATIONALE: The aim of this study was to systematically review the literature on population-based studies investigating an association between endometriosis and autoimmune diseases and to conduct a meta-analysis of combinable results to investigate the extent and robustness of evidence. SEARCH METHODS: Four electronic databases were searched (MEDLINE, Embase, Web of Science, and CINAHL) from each database inception date until 7 April 2018. Search terms included a combination of database-specific controlled vocabulary terms and free-text terms relating to 'endometriosis' and 'autoimmune diseases'. Study inclusion criteria focused on peer-reviewed published articles that reported an association between endometriosis and autoimmune diseases, excluding case reports/series, review papers, meta-analyses, organizational guidelines, editorial letters, expert opinions, and conference abstracts. Quality assessment of included studies was performed based on GRADE criteria. Key information of eligible studies was abstracted into a standard form. Meta-analysis was performed for autoimmune diseases with combinable study results from at least three studies investigating an association with endometriosis. For cross-sectional studies and case-control studies, raw data from each study were documented to calculate a Mantel-Haenszel odds ratio with 95% CIs. For cohort studies, an inverse variance probability weighted model was used to pool study results to calculate a rate ratio (a hazard ratio or a standardized incidence rate) with 95% CIs. OUTCOMES: A total of 26 published population-based cross-sectional, case-control, and cohort studies that investigated the association between endometriosis and autoimmune diseases met all eligible criteria and were included in the review. The studies quantified an association between endometriosis and several autoimmune diseases, including systemic lupus erythematosus (SLE), Sjögren's syndrome (SS), rheumatoid arthritis (RA), autoimmune thyroid disorder, coeliac disease (CLD), multiple sclerosis (MS), inflammatory bowel disease (IBD), and Addison's disease. However, the quality of the evidence was generally poor due to the high risk of bias in the majority of the chosen study designs and statistical analyses. Only 5 of the 26 studies could provide high-quality evidence, and among these, 4 supported a statistically significant association between endometriosis and at least 1 autoimmune disease: SLE, SS, RA, CLD, MS, or IBD. WIDER IMPLICATIONS: The observed associations between endometriosis and autoimmune diseases suggest that clinicians need to be aware of the potential coexistence of endometriosis and autoimmune diseases when either is diagnosed. Scientists interested in research studies on endometriosis or autoimmune diseases should consider the likelihood of comorbidity when studying these two types of health conditions. Well-designed large prospective cohort studies with confounding control and mediation quantification, as well as genetic and biological studies, are needed to generate further insights into whether endometriosis is a risk factor for, or a consequence of, autoimmune diseases, and whether these two types of disorders share pathophysiological mechanisms even if they arise independently. Such insights may offer opportunities for the development of novel non-hormonal medications such as immuno-modulators or repurposing of existing immunomodulatory therapies for endometriosis.