D

David Gómez-Cabrero

Universidad Publica de Navarra

ORCID: 0000-0003-4186-3788

Publishes on Epigenetics and DNA Methylation, Single-cell and spatial transcriptomics, Bioinformatics and Genomic Networks. 346 papers and 13.7k citations.

346Publications
13.7kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

A survey of best practices for RNA-seq data analysis
Ana Conesa, Pedro Madrigal, Sonia Tarazona et al.|Genome biology|2016
Cited by 2.9kOpen Access

RNA-sequencing (RNA-seq) has a wide variety of applications, but no single analysis pipeline can be used in all cases. We review all of the major steps in RNA-seq data analysis, including experimental design, quality control, read alignment, quantification of gene and transcript levels, visualization, differential gene expression, alternative splicing, functional analysis, gene fusion detection and eQTL mapping. We highlight the challenges associated with each step. We discuss the analysis of small RNAs and the integration of RNA-seq with other functional genomics techniques. Finally, we discuss the outlook for novel technologies that are changing the state of the art in transcriptomics.

A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data
Cited by 1.9kOpen Access

MOTIVATION: The Illumina Infinium 450 k DNA Methylation Beadchip is a prime candidate technology for Epigenome-Wide Association Studies (EWAS). However, a difficulty associated with these beadarrays is that probes come in two different designs, characterized by widely different DNA methylation distributions and dynamic range, which may bias downstream analyses. A key statistical issue is therefore how best to adjust for the two different probe designs. RESULTS: Here we propose a novel model-based intra-array normalization strategy for 450 k data, called BMIQ (Beta MIxture Quantile dilation), to adjust the beta-values of type2 design probes into a statistical distribution characteristic of type1 probes. The strategy involves application of a three-state beta-mixture model to assign probes to methylation states, subsequent transformation of probabilities into quantiles and finally a methylation-dependent dilation transformation to preserve the monotonicity and continuity of the data. We validate our method on cell-line data, fresh frozen and paraffin-embedded tumour tissue samples and demonstrate that BMIQ compares favourably with two competing methods. Specifically, we show that BMIQ improves the robustness of the normalization procedure, reduces the technical variation and bias of type2 probe values and successfully eliminates the type1 enrichment bias caused by the lower dynamic range of type2 probes. BMIQ will be useful as a preprocessing step for any study using the Illumina Infinium 450 k platform. AVAILABILITY: BMIQ is freely available from http://code.google.com/p/bmiq/. CONTACT: a.teschendorff@ucl.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Multiple sclerosis genomic map implicates peripheral immune cells and microglia in susceptibility
Cited by 1.4kOpen Access

We analyzed genetic data of 47,429 multiple sclerosis (MS) and 68,374 control subjects and established a reference map of the genetic architecture of MS that includes 200 autosomal susceptibility variants outside the major histocompatibility complex (MHC), one chromosome X variant, and 32 variants within the extended MHC. We used an ensemble of methods to prioritize 551 putative susceptibility genes that implicate multiple innate and adaptive pathways distributed across the cellular components of the immune system. Using expression profiles from purified human microglia, we observed enrichment for MS genes in these brain-resident immune cells, suggesting that these may have a role in targeting an autoimmune process to the central nervous system, although MS is most likely initially triggered by perturbation of peripheral immune responses.

Data integration in the era of omics: current and future challenges
David Gómez-Cabrero, Imad Abugessaisa, Dieter Maier et al.|BMC Systems Biology|2014
Cited by 423Open Access

To integrate heterogeneous and large omics data constitutes not only a conceptual challenge but a practical hurdle in the daily analysis of omics data. With the rise of novel omics technologies and through large-scale consortia projects, biological systems are being further investigated at an unprecedented scale generating heterogeneous and often large data sets. These data-sets encourage researchers to develop novel data integration methodologies. In this introduction we review the definition and characterize current efforts on data integration in the life sciences. We have used a web-survey to assess current research projects on data-integration to tap into the views, needs and challenges as currently perceived by parts of the research community.

Single-Cell RNA Sequencing Analysis Reveals a Crucial Role for CTHRC1 (Collagen Triple Helix Repeat Containing 1) Cardiac Fibroblasts After Myocardial Infarction
Cited by 262Open Access

BACKGROUND: Cardiac fibroblasts (CFs) have a central role in the ventricular remodeling process associated with different types of fibrosis. Recent studies have shown that fibroblasts do not respond homogeneously to heart injury. Because of the limited set of bona fide fibroblast markers, a proper characterization of fibroblast population heterogeneity in response to cardiac damage is lacking. The purpose of this study was to define CF heterogeneity during ventricular remodeling and the underlying mechanisms that regulate CF function. METHODS: Collagen1α1-GFP (green fluorescent protein)-positive CFs were characterized after myocardial infarction (MI) by single-cell and bulk RNA sequencing, assay for transposase-accessible chromatin sequencing, and functional assays. Swine and patient samples were studied using bulk RNA sequencing. RESULTS: We identified and characterized a unique CF subpopulation that emerges after MI in mice. These activated fibroblasts exhibit a clear profibrotic signature, express high levels of Cthrc1 (collagen triple helix repeat containing 1), and localize into the scar. Noncanonical transforming growth factor-β signaling and different transcription factors including SOX9 are important regulators mediating their response to cardiac injury. Absence of CTHRC1 results in pronounced lethality attributable to ventricular rupture. A population of CFs with a similar transcriptome was identified in a swine model of MI and in heart tissue from patients with MI and dilated cardiomyopathy. CONCLUSIONS: as a novel regulator of the healing scar process and a target for future translational studies.