Y

Yanmei Huang

Huazhong University of Science and Technology

ORCID: 0000-0003-0450-9173

Publishes on Molecular Biology Techniques and Applications, Cancer Genomics and Diagnostics, Genomics and Phylogenetic Studies. 94 papers and 3.5k citations.

94Publications
3.5kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

New Insights into Human Nostril Microbiome from the Expanded Human Oral Microbiome Database (eHOMD): a Resource for the Microbiome of the Human Aerodigestive Tract
Cited by 597Open Access

The eHOMD (http://www.ehomd.org) is a valuable resource for researchers, from basic to clinical, who study the microbiomes and the individual microbes in body sites in the human aerodigestive tract, which includes the nasal passages, sinuses, throat, esophagus, and mouth, and the lower respiratory tract, in health and disease. The eHOMD is an actively curated, web-based, open-access resource. eHOMD provides the following: (i) species-level taxonomy based on grouping 16S rRNA gene sequences at 98.5% identity, (ii) a systematic naming scheme for unnamed and/or uncultivated microbial taxa, (iii) reference genomes to facilitate metagenomic, metatranscriptomic, and proteomic studies and (iv) convenient cross-links to other databases (e.g., PubMed and Entrez). By facilitating the assignment of species names to sequences, the eHOMD is a vital resource for enhancing the clinical relevance of 16S rRNA gene-based microbiome studies, as well as metagenomic studies.

Molecular Criteria for Defining the Naive Human Pluripotent State
Thorold W. Theunissen, Marc Friedli, Yupeng He et al.|Cell stem cell|2016
Cited by 523Open Access

Recent studies have aimed to convert cultured human pluripotent cells to a naive state, but it remains unclear to what extent the resulting cells recapitulate in vivo naive pluripotency. Here we propose a set of molecular criteria for evaluating the naive human pluripotent state by comparing it to the human embryo. We show that transcription of transposable elements provides a sensitive measure of the concordance between pluripotent stem cells and early human development. We also show that induction of the naive state is accompanied by genome-wide DNA hypomethylation, which is reversible except at imprinted genes, and that the X chromosome status resembles that of the human preimplantation embryo. However, we did not see efficient incorporation of naive human cells into mouse embryos. Overall, the different naive conditions we tested showed varied relationships to human embryonic states based on molecular criteria, providing a backdrop for future analysis of naive human pluripotency.

Comparative genome sequencing of <i>Drosophila pseudoobscura</i> : Chromosomal, gene, and <i>cis</i> -element evolution
Stephen Richards, Yue Liu, Brian R. Bettencourt et al.|Genome Research|2005
Cited by 508Open Access

We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25-55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species--but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila.

Annotation of the Drosophila melanogastereuchromatic genome: a systematic review
Sima Misra, Madeline A. Crosby, Chris Mungall et al.|Genome biology|2002
Cited by 384Open Access

BACKGROUND: The recent completion of the Drosophila melanogaster genomic sequence to high quality and the availability of a greatly expanded set of Drosophila cDNA sequences, aligning to 78% of the predicted euchromatic genes, afforded FlyBase the opportunity to significantly improve genomic annotations. We made the annotation process more rigorous by inspecting each gene visually, utilizing a comprehensive set of curation rules, requiring traceable evidence for each gene model, and comparing each predicted peptide to SWISS-PROT and TrEMBL sequences. RESULTS: Although the number of predicted protein-coding genes in Drosophila remains essentially unchanged, the revised annotation significantly improves gene models, resulting in structural changes to 85% of the transcripts and 45% of the predicted proteins. We annotated transposable elements and non-protein-coding RNAs as new features, and extended the annotation of untranslated (UTR) sequences and alternative transcripts to include more than 70% and 20% of genes, respectively. Finally, cDNA sequence provided evidence for dicistronic transcripts, neighboring genes with overlapping UTRs on the same DNA sequence strand, alternatively spliced genes that encode distinct, non-overlapping peptides, and numerous nested genes. CONCLUSIONS: Identification of so many unusual gene models not only suggests that some mechanisms for gene regulation are more prevalent than previously believed, but also underscores the complex challenges of eukaryotic gene prediction. At present, experimental data and human curation remain essential to generate high-quality genome annotations.