Y

Yixue Li

King University

ORCID: 0000-0002-1198-7176

Publishes on Bioinformatics and Genomic Networks, Machine Learning in Bioinformatics, Genomics and Phylogenetic Studies. 697 papers and 27.3k citations.

697Publications
27.3kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

Regulation of Cellular Metabolism by Protein Lysine Acetylation
Shimin Zhao, Wei Xu, Wenqing Jiang et al.|Science|2010
Cited by 1.9k

Protein lysine acetylation has emerged as a key posttranslational modification in cellular regulation, in particular through the modification of histones and nuclear transcription regulators. We show that lysine acetylation is a prevalent modification in enzymes that catalyze intermediate metabolism. Virtually every enzyme in glycolysis, gluconeogenesis, the tricarboxylic acid (TCA) cycle, the urea cycle, fatty acid metabolism, and glycogen metabolism was found to be acetylated in human liver tissue. The concentration of metabolic fuels, such as glucose, amino acids, and fatty acids, influenced the acetylation status of metabolic enzymes. Acetylation activated enoyl-coenzyme A hydratase/3-hydroxyacyl-coenzyme A dehydrogenase in fatty acid oxidation and malate dehydrogenase in the TCA cycle, inhibited argininosuccinate lyase in the urea cycle, and destabilized phosphoenolpyruvate carboxykinase in gluconeogenesis. Our study reveals that acetylation plays a major role in metabolic regulation.

Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2022
CNCB-NGDC Members and Partners, Yongbiao Xue, Yīmíng Bào et al.|Nucleic Acids Research|2021
Cited by 1.1kOpen Access

The National Genomics Data Center (NGDC), part of the China National Center for Bioinformation (CNCB), provides a family of database resources to support global research in both academia and industry. With the explosively accumulated multi-omics data at ever-faster rates, CNCB-NGDC is constantly scaling up and updating its core database resources through big data archive, curation, integration and analysis. In the past year, efforts have been made to synthesize the growing data and knowledge, particularly in single-cell omics and precision medicine research, and a series of resources have been newly developed, updated and enhanced. Moreover, CNCB-NGDC has continued to daily update SARS-CoV-2 genome sequences, variants, haplotypes and literature. Particularly, OpenLB, an open library of bioscience, has been established by providing easy and open access to a substantial number of abstract texts from PubMed, bioRxiv and medRxiv. In addition, Database Commons is significantly updated by cataloguing a full list of global databases, and BLAST tools are newly deployed to provide online sequence search services. All these resources along with their services are publicly accessible at https://ngdc.cncb.ac.cn.

Predicting protein–protein interactions based only on sequences information
Juwen Shen, Jian Zhang, Xiaomin Luo et al.|Proceedings of the National Academy of Sciences|2007
Cited by 1k

Protein-protein interactions (PPIs) are central to most biological processes. Although efforts have been devoted to the development of methodology for predicting PPIs and protein interaction networks, the application of most existing methods is limited because they need information about protein homology or the interaction marks of the protein partners. In the present work, we propose a method for PPI prediction using only the information of protein sequences. This method was developed based on a learning algorithm-support vector machine combined with a kernel function and a conjoint triad feature for describing amino acids. More than 16,000 diverse PPI pairs were used to construct the universal model. The prediction ability of our approach is better than that of other sequence-based PPI prediction methods because it is able to predict PPI networks. Different types of PPI networks have been effectively mapped with our method, suggesting that, even with only sequence information, this method could be applied to the exploration of networks for any newly discovered protein with unknown biological relativity. In addition, such supplementary experimental information can enhance the prediction ability of the method.

Cytosine base editor generates substantial off-target single-nucleotide variants in mouse embryos
Erwei Zuo, Yidi Sun, Wei Wu et al.|Science|2019
Cited by 794Open Access

Spotting off-targets from gene editing Unintended genomic modifications limit the potential therapeutic use of gene-editing tools. Available methods to find off-targets generally do not work in vivo or detect single-nucleotide changes. Three papers in this issue report new methods for monitoring gene-editing tools in vivo (see the Perspective by Kempton and Qi). Wienert et al. followed the recruitment of a DNA repair protein to DNA breaks induced by CRISPR-Cas9, enabling unbiased detection of off-target editing in cellular and animal models. Zuo et al. identified off-targets without the interference of natural genetic heterogeneity by injecting base editors into one blastomere of a two-cell mouse embryo and leaving the other genetically identical blastomere unedited. Jin et al. performed whole-genome sequencing on individual, genome-edited rice plants to identify unintended mutations. Cytosine, but not adenine, base editors induced numerous single-nucleotide variants in both mouse and rice. Science , this issue p. 286 , p. 289 , p. 292 ; see also p. 234

Cross-host evolution of severe acute respiratory syndrome coronavirus in palm civet and human
Huai‐Dong Song, Changchun Tu, Guowei Zhang et al.|Proceedings of the National Academy of Sciences|2005
Cited by 736Open Access

The genomic sequences of severe acute respiratory syndrome coronaviruses from human and palm civet of the 2003/2004 outbreak in the city of Guangzhou, China, were nearly identical. Phylogenetic analysis suggested an independent viral invasion from animal to human in this new episode. Combining all existing data but excluding singletons, we identified 202 single-nucleotide variations. Among them, 17 are polymorphic in palm civets only. The ratio of nonsynonymous/synonymous nucleotide substitution in palm civets collected 1 yr apart from different geographic locations is very high, suggesting a rapid evolving process of viral proteins in civet as well, much like their adaptation in the human host in the early 2002-2003 epidemic. Major genetic variations in some critical genes, particularly the Spike gene, seemed essential for the transition from animal-to-human transmission to human-to-human transmission, which eventually caused the first severe acute respiratory syndrome outbreak of 2002/2003.