M

Mark D. Robinson

SIB Swiss Institute of Bioinformatics

ORCID: 0000-0002-3048-5518

Publishes on Single-cell and spatial transcriptomics, Gene expression and cancer classification, Epigenetics and DNA Methylation. 458 papers and 93.2k citations.

458Publications
93.2kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

<tt>edgeR</tt> : a Bioconductor package for differential expression analysis of digital gene expression data
Cited by 44.3kOpen Access

SUMMARY: It is expected that emerging digital gene expression (DGE) technologies will overtake microarray technologies in the near future for many functional genomics applications. One of the fundamental data analysis tasks, especially for gene expression studies, involves determining whether there is evidence that counts for a transcript or exon are significantly different across experimental conditions. edgeR is a Bioconductor software package for examining differential expression of replicated count data. An overdispersed Poisson model is used to account for both biological and technical variability. Empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference. The methodology can be used even with the most minimal levels of replication, provided at least one phenotype or experimental condition is replicated. The software may have other applications beyond sequencing data, such as proteome peptide count data. AVAILABILITY: The package is freely available under the LGPL licence from the Bioconductor web site (http://bioconductor.org).

A scaling normalization method for differential expression analysis of RNA-seq data
Mark D. Robinson, Alicia Oshlack|Genome biology|2010
Cited by 8.5kOpen Access

The fine detail provided by sequencing-based transcriptome surveys suggests that RNA-seq is likely to become the platform of choice for interrogating steady state RNA. In order to discover biologically important changes in expression, we show that normalization continues to be an essential step in the analysis. We outline a simple and effective method for performing normalization and show dramatically improved results for inferring differential expression in simulated and publicly available data sets.

Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences
Cited by 4.3kOpen Access

<ns4:p> High-throughput sequencing of cDNA (RNA-seq) is used extensively to characterize the transcriptome of cells. Many transcriptomic studies aim at comparing either abundance levels or the transcriptome composition between given conditions, and as a first step, the sequencing reads must be used as the basis for abundance quantification of transcriptomic features of interest, such as genes or transcripts. Several different quantification approaches have been proposed, ranging from simple counting of reads that overlap given genomic regions to more complex estimation of underlying transcript abundances. In this paper, we show that gene-level abundance estimates and statistical inference offer advantages over transcript-level analyses, in terms of performance and interpretability. We also illustrate that while the presence of differential isoform usage can lead to inflated false discovery rates in differential expression analyses on simple count matrices and transcript-level abundance estimates improve the performance in simulated data, the difference is relatively minor in several real data sets. Finally, we provide an R package ( <ns4:italic>tximport</ns4:italic> ) to help users integrate transcript-level abundance estimates from common quantification pipelines into count-based statistical inference engines. </ns4:p>

Systematic Genetic Analysis with Ordered Arrays of Yeast Deletion Mutants
Cited by 2.2k

In Saccharomyces cerevisiae, more than 80% of the approximately 6200 predicted genes are nonessential, implying that the genome is buffered from the phenotypic consequences of genetic perturbation. To evaluate function, we developed a method for systematic construction of double mutants, termed synthetic genetic array (SGA) analysis, in which a query mutation is crossed to an array of approximately 4700 deletion mutants. Inviable double-mutant meiotic progeny identify functional relationships between genes. SGA analysis of genes with roles in cytoskeletal organization (BNI1, ARP2, ARC40, BIM1), DNA synthesis and repair (SGS1, RAD27), or uncharacterized functions (BBC1, NBP2) generated a network of 291 interactions among 204 genes. Systematic application of this approach should produce a global map of gene function.