M

Matthew E. Ritchie

The University of Melbourne

ORCID: 0000-0002-7383-0609

Publishes on Single-cell and spatial transcriptomics, Gene expression and cancer classification, Molecular Biology Techniques and Applications. 269 papers and 58.1k citations.

269Publications
58.1kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

limma powers differential expression analyses for RNA-sequencing and microarray studies
Matthew E. Ritchie, Belinda Phipson, Di Wu et al.|Nucleic Acids Research|2015
Cited by 42.7kOpen Access

limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

Opportunities and challenges in long-read sequencing data analysis
Shanika L. Amarasinghe, Shian Su, Xueyi Dong et al.|Genome biology|2020
Cited by 2.6kOpen Access

Long-read technologies are overcoming early limitations in accuracy and throughput, broadening their application domains in genomics. Dedicated analysis tools that take into account the characteristics of long-read data are thus required, but the fast pace of development of such tools can be overwhelming. To assist in the design and analysis of long-read sequencing projects, we review the current landscape of available tools and present an online interactive database, long-read-tools.org, to facilitate their browsing. We further focus on the principles of error correction, base modification detection, and long-read transcriptomics analysis and highlight the challenges that remain.

A comparison of background correction methods for two-colour microarrays
Cited by 932Open Access

MOTIVATION: Microarray data must be background corrected to remove the effects of non-specific binding or spatial heterogeneity across the array, but this practice typically causes other problems such as negative corrected intensities and high variability of low intensity log-ratios. Different estimators of background, and various model-based processing methods, are compared in this study in search of the best option for differential expression analyses of small microarray experiments. RESULTS: Using data where some independent truth in gene expression is known, eight different background correction alternatives are compared, in terms of precision and bias of the resulting gene expression measures, and in terms of their ability to detect differentially expressed genes as judged by two popular algorithms, SAM and limma eBayes. A new background processing method (normexp) is introduced which is based on a convolution model. The model-based correction methods are shown to be markedly superior to the usual practice of subtracting local background estimates. Methods which stabilize the variances of the log-ratios along the intensity range perform the best. The normexp+offset method is found to give the lowest false discovery rate overall, followed by morph and vsn. Like vsn, normexp is applicable to most types of two-colour microarray data. AVAILABILITY: The background correction methods compared in this article are available in the R package limma (Smyth, 2005) from http://www.bioconductor.org. SUPPLEMENTARY INFORMATION: Supplementary data are available from http://bioinf.wehi.edu.au/resources/webReferences.html.

RNA-seq analysis is easy as 1-2-3 with limma, Glimma and edgeR
Charity W. Law, Monther Alhamdoosh, Shian Su et al.|F1000Research|2018
Cited by 653Open Access

<ns3:p>The ability to easily and efficiently analyse RNA-sequencing data is a key strength of the Bioconductor project. Starting with counts summarised at the gene-level, a typical analysis involves pre-processing, exploratory data analysis, differential expression testing and pathway analysis with the results obtained informing future experiments and validation studies. In this workflow article, we analyse RNA-sequencing data from the mouse mammary gland, demonstrating use of the popular <ns3:bold>edgeR</ns3:bold> package to import, organise, filter and normalise the data, followed by the <ns3:bold>limma</ns3:bold> package with its <ns3:italic>voom</ns3:italic> method, linear modelling and empirical Bayes moderation to assess differential expression and perform gene set testing. This pipeline is further enhanced by the <ns3:bold>Glimma</ns3:bold> package which enables interactive exploration of the results so that individual samples and genes can be examined by the user. The complete analysis offered by these three packages highlights the ease with which researchers can turn the raw counts from an RNA-sequencing experiment into biological insights using Bioconductor.</ns3:p>