B

Bo Li

Yuhuangding Hospital

ORCID: 0000-0003-0668-1620

Publishes on Single-cell and spatial transcriptomics, Cancer Immunotherapy and Biomarkers, RNA modifications and cancer. 161 papers and 44.4k citations.

161Publications
44.4kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome
Bo Li, Colin N. Dewey|BMC Bioinformatics|2011
Cited by 23.3kOpen Access

BACKGROUND: RNA-Seq is revolutionizing the way transcript abundances are measured. A key challenge in transcript quantification from RNA-Seq data is the handling of reads that map to multiple genes or isoforms. This issue is particularly important for quantification with de novo transcriptome assemblies in the absence of sequenced genomes, as it is difficult to determine which transcripts are isoforms of the same gene. A second significant issue is the design of RNA-Seq experiments, in terms of the number of reads, read length, and whether reads come from one or both ends of cDNA fragments. RESULTS: We present RSEM, an user-friendly software package for quantifying gene and isoform abundances from single-end or paired-end RNA-Seq data. RSEM outputs abundance estimates, 95% credibility intervals, and visualization files and can also simulate RNA-Seq data. In contrast to other existing tools, the software does not require a reference genome. Thus, in combination with a de novo transcriptome assembler, RSEM enables accurate transcript quantification for species without sequenced genomes. On simulated and real data sets, RSEM has superior or comparable performance to quantification methods that rely on a reference genome. Taking advantage of RSEM's ability to effectively use ambiguously-mapping reads, we show that accurate gene-level abundance estimates are best obtained with large numbers of short single-end reads. On the other hand, estimates of the relative frequencies of isoforms within single genes may be improved through the use of paired-end reads, depending on the number of possible splice forms for each gene. CONCLUSIONS: RSEM is an accurate and user-friendly software tool for quantifying transcript abundances from RNA-Seq data. As it does not rely on the existence of a reference genome, it is particularly useful for quantification with de novo transcriptome assemblies. In addition, RSEM has enabled valuable guidance for cost-efficient design of quantification experiments with RNA-Seq, which is currently relatively expensive.

Whole-genome analyses resolve early branches in the tree of life of modern birds
Cited by 2kOpen Access

To better determine the history of modern birds, we performed a genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves using phylogenomic methods created to handle genome-scale data. We recovered a highly resolved tree that confirms previously controversial sister or close relationships. We identified the first divergence in Neoaves, two groups we named Passerea and Columbea, representing independent lineages of diverse and convergently evolved land and water bird species. Among Passerea, we infer the common ancestor of core landbirds to have been an apex predator and confirm independent gains of vocal learning. Among Columbea, we identify pigeons and flamingoes as belonging to sister clades. Even with whole genomes, some of the earliest branches in Neoaves proved challenging to resolve, which was best explained by massive protein-coding sequence convergence and high levels of incomplete lineage sorting that occurred during a rapid radiation after the Cretaceous-Paleogene mass extinction event about 66 million years ago.

RNA-Seq gene expression estimation with read mapping uncertainty
Bo Li, Victor Ruotti, Ron Stewart et al.|Bioinformatics|2009
Cited by 1.3kOpen Access

Abstract Motivation: RNA-Seq is a promising new technology for accurately measuring gene expression levels. Expression estimation with RNA-Seq requires the mapping of relatively short sequencing reads to a reference genome or transcript set. Because reads are generally shorter than transcripts from which they are derived, a single read may map to multiple genes and isoforms, complicating expression analyses. Previous computational methods either discard reads that map to multiple locations or allocate them to genes heuristically. Results: We present a generative statistical model and associated inference methods that handle read mapping uncertainty in a principled manner. Through simulations parameterized by real RNA-Seq data, we show that our method is more accurate than previous methods. Our improved accuracy is the result of handling read mapping uncertainty with a statistical model and the estimation of gene expression levels as the sum of isoform expression levels. Unlike previous methods, our method is capable of modeling non-uniform read distributions. Simulations with our method indicate that a read length of 20–25 bases is optimal for gene-level expression estimation from mouse and maize RNA-Seq data when sequencing throughput is fixed. Availability: An initial C++ implementation of our method that was used for the results presented in this article is available at http://deweylab.biostat.wisc.edu/rsem. Contact: cdewey@biostat.wisc.edu Supplementary information: Supplementary data are available at Bioinformatics on

Resistance to checkpoint blockade therapy through inactivation of antigen presentation
Moshe Sade-Feldman, Yunxin J. Jiao, Jonathan Chen et al.|Nature Communications|2017
Cited by 994Open Access

Treatment with immune checkpoint blockade (CPB) therapies often leads to prolonged responses in patients with metastatic melanoma, but the common mechanisms of primary and acquired resistance to these agents remain incompletely characterized and have yet to be validated in large cohorts. By analyzing longitudinal tumor biopsies from 17 metastatic melanoma patients treated with CPB therapies, we observed point mutations, deletions or loss of heterozygosity (LOH) in beta-2-microglobulin (B2M), an essential component of MHC class I antigen presentation, in 29.4% of patients with progressing disease. In two independent cohorts of melanoma patients treated with anti-CTLA4 and anti-PD1, respectively, we find that B2M LOH is enriched threefold in non-responders (~30%) compared to responders (~10%) and associated with poorer overall survival. Loss of both copies of B2M is found only in non-responders. B2M loss is likely a common mechanism of resistance to therapies targeting CTLA4 or PD1.