Evidation Health (United States)
ORCID: 0000-0002-8533-6843Publishes on Gene expression and cancer classification, Bioinformatics and Genomic Networks, Genetic Associations and Epidemiology. 28 papers and 4.7k citations.
Add your photo, update your bio, and get notified when your ranking changes.
High-throughput RNA sequencing (RNA-seq) promises to revolutionize our understanding of genes and their role in human disease by characterizing the RNA content of tissues and cells. The realization of this promise, however, is conditional on the development of effective computational methods for the identification and quantification of transcripts from incomplete and noisy data. In this article, we introduce iReckon, a method for simultaneous determination of the isoforms and estimation of their abundances. Our probabilistic approach incorporates multiple biological and technical phenomena, including novel isoforms, intron retention, unspliced pre-mRNA, PCR amplification biases, and multimapped reads. iReckon utilizes regularized expectation-maximization to accurately estimate the abundances of known and novel isoforms. Our results on simulated and real data demonstrate a superior ability to discover novel isoforms with a significantly reduced number of false-positive predictions, and our abundance accuracy prediction outmatches that of other state-of-the-art tools. Furthermore, we have applied iReckon to two cancer transcriptome data sets, a triple-negative breast cancer patient sample and the MCF7 breast cancer cell line, and show that iReckon is able to reconstruct the complex splicing changes that were not previously identified. QT-PCR validations of the isoforms detected in the MCF7 cell line confirmed all of iReckon's predictions and also showed strong agreement (r(2) = 0.94) with the predicted abundances.
Neurodegenerative disorders such as Alzheimer's disease (AD), Lewy body diseases (LBD), and the amyotrophic lateral sclerosis and frontotemporal dementia (ALS-FTD) spectrum are defined by the accumulation of specific misfolded protein aggregates. However, the mechanisms by which each proteinopathy leads to neurodegeneration remain elusive. We hypothesized that there is a common "pan-neurodegenerative" gene expression signature driving pathophysiology across these clinically and pathologically diverse proteinopathies. To test this hypothesis, we performed a systematic review of human CNS transcriptomics datasets from AD, LBD, and ALS-FTD patients and age-matched controls in the Gene Expression Omnibus (GEO) and ArrayExpress databases, followed by consistent processing of each dataset, meta-analysis, pathway enrichment, and overlap analyses. After applying pre-specified eligibility criteria and stringent data pre-processing, a total of 2600 samples from 26 AD, 21 LBD, and 13 ALS-FTD datasets were included in the meta-analysis. The pan-neurodegenerative gene signature is characterized by an upregulation of innate immunity, cytoskeleton, and transcription and RNA processing genes, and a downregulation of the mitochondrial electron transport chain. Pathway enrichment analyses also revealed the upregulation of neuroinflammation (including Toll-like receptor, TNF, and NFκB signaling) and phagocytosis, and the downregulation of mitochondrial oxidative phosphorylation, lysosomal acidification, and ubiquitin-proteasome pathways. Our findings suggest that neuroinflammation and a failure in both neuronal energy metabolism and protein degradation systems are consistent features underlying neurodegenerative diseases, despite differences in the extent of neuronal loss and brain regions involved.
Rheumatoid arthritis (RA) affects millions world-wide. While anti-TNF treatment is widely used to reduce disease progression, treatment fails in ∼one-third of patients. No biomarker currently exists that identifies non-responders before treatment. A rigorous community-based assessment of the utility of SNP data for predicting anti-TNF treatment efficacy in RA patients was performed in the context of a DREAM Challenge (http://www.synapse.org/RA_Challenge). An open challenge framework enabled the comparative evaluation of predictions developed by 73 research groups using the most comprehensive available data and covering a wide range of state-of-the-art modelling methodologies. Despite a significant genetic heritability estimate of treatment non-response trait (h(2)=0.18, P value=0.02), no significant genetic contribution to prediction accuracy is observed. Results formally confirm the expectations of the rheumatology community that SNP information does not significantly improve predictive performance relative to standard clinical traits, thereby justifying a refocusing of future efforts on collection of other data.