LAC+USC Medical Center
ORCID: 0000-0002-5522-5296Publishes on Epigenetics and DNA Methylation, Genomics and Chromatin Dynamics, Prostate Cancer Treatment and Research. 167 papers and 21.8k citations.
Add your photo, update your bio, and get notified when your ranking changes.
INTRODUCTION Strong genetic associations have been found for a number of psychiatric disorders. However, understanding the underlying molecular mechanisms remains challenging. RATIONALE To address this challenge, the PsychENCODE Consortium has developed a comprehensive online resource and integrative models for the functional genomics of the human brain. RESULTS The base of the pyramidal resource is the datasets generated by PsychENCODE, including bulk transcriptome, chromatin, genotype, and Hi-C datasets and single-cell transcriptomic data from ~32,000 cells for major brain regions. We have merged these with data from Genotype-Tissue Expression (GTEx), ENCODE, Roadmap Epigenomics, and single-cell analyses. Via uniform processing, we created a harmonized resource, allowing us to survey functional genomics data on the brain over a sample size of 1866 individuals. From this uniformly processed dataset, we created derived data products. These include lists of brain-expressed genes, coexpression modules, and single-cell expression profiles for many brain cell types; ~79,000 brain-active enhancers with associated Hi-C loops and topologically associating domains; and ~2.5 million expression quantitative-trait loci (QTLs) comprising ~238,000 linkage-disequilibrium–independent single-nucleotide polymorphisms and of other types of QTLs associated with splice isoforms, cell fractions, and chromatin activity. By using these, we found that >88% of the cross-population variation in brain gene expression can be accounted for by cell fraction changes. Furthermore, a number of disorders and aging are associated with changes in cell-type proportions. The derived data also enable comparison between the brain and other tissues. In particular, by using spectral analyses, we found that the brain has distinct expression and epigenetic patterns, including a greater extent of noncoding transcription than other tissues. The top level of the resource consists of integrative networks for regulation and machine-learning models for disease prediction. The networks include a full gene regulatory network (GRN) for the brain, linking transcription factors, enhancers, and target genes from merging of the QTLs, generalized element-activity correlations, and Hi-C data. By using this network, we link disease genes to genome-wide association study (GWAS) variants for psychiatric disorders. For schizophrenia, we linked 321 genes to the 142 reported GWAS loci. We then embedded the regulatory network into a deep-learning model to predict psychiatric phenotypes from genotype and expression. Our model gives a ~6-fold improvement in prediction over additive polygenic risk scores. Moreover, it achieves a ~3-fold improvement over additive models, even when the gene expression data are imputed, highlighting the value of having just a small amount of transcriptome data for disease prediction. Lastly, it highlights key genes and pathways associated with disorder prediction, including immunological, synaptic, and metabolic pathways, recapitulating de novo results from more targeted analyses. CONCLUSION Our resource and integrative analyses have uncovered genomic elements and networks in the brain, which in turn have provided insight into the molecular mechanisms underlying psychiatric disorders. Our deep-learning model improves disease risk prediction over traditional approaches and can be extended with additional data types (e.g., microRNA and neuroimaging). A comprehensive functional genomic resource for the adult human brain. The resource forms a three-layer pyramid. The bottom layer includes sequencing datasets for traits, such as schizophrenia. The middle layer represents derived datasets, including functional genomic elements and QTLs. The top layer contains integrated models, which link genotypes to phenotypes. DSPN, Deep Structured Phenotype Network; PC1 and PC2, principal components 1 and 2; ref, reference; alt, alternate; H3K27ac, histone H3 acetylation at lysine 27.
Cholangiocarcinoma (CCA) is an aggressive malignancy of the bile ducts, with poor prognosis and limited treatment options. Here, we describe the integrated analysis of somatic mutations, RNA expression, copy number, and DNA methylation by The Cancer Genome Atlas of a set of predominantly intrahepatic CCA cases and propose a molecular classification scheme. We identified an IDH mutant-enriched subtype with distinct molecular features including low expression of chromatin modifiers, elevated expression of mitochondrial genes, and increased mitochondrial DNA copy number. Leveraging the multi-platform data, we observed that ARID1A exhibited DNA hypermethylation and decreased expression in the IDH mutant subtype. More broadly, we found that IDH mutations are associated with an expanded histological spectrum of liver tumors with molecular features that stratify with CCA. Our studies reveal insights into the molecular pathogenesis and heterogeneity of cholangiocarcinoma and provide classification information of potential therapeutic significance.
Importance: African Americans have the highest breast cancer mortality rate. Although racial difference in the distribution of intrinsic subtypes of breast cancer is known, it is unclear if there are other inherent genomic differences that contribute to the survival disparities. Objectives: To investigate racial differences in breast cancer molecular features and survival and to estimate the heritability of breast cancer subtypes. Design, Setting, and Participants: Among a convenience cohort of patients with invasive breast cancer, breast tumor and matched normal tissue sample data (as of September 18, 2015) were obtained from The Cancer Genome Atlas. Main Outcomes and Measures: Breast cancer–free interval, tumor molecular features, and genetic variants. Results: Participants were 930 patients with breast cancer, including 154 black patients of African ancestry (mean [SD] age at diagnosis, 55.66 [13.01] years; 98.1% [n = 151] female) and 776 white patients of European ancestry (mean [SD] age at diagnosis, 59.51 [13.11] years; 99.0% [n = 768] female). Compared with white patients, black patients had a worse breast cancer-free interval (hazard ratio, HR=1.67; 95% CI, 1.02-2.74; P = .043). They had a higher likelihood of basal-like (odds ratio, 3.80; 95% CI, 2.46-5.87; P < .001) and human epidermal growth factor receptor 2 (ERBB2 [formerly HER2])–enriched (odds ratio, 2.22; 95% CI, 1.10-4.47; P = .027) breast cancer subtypes, with the Luminal A subtype as the reference. Blacks had more TP53 mutations and fewer PIK3CA mutations than whites. While most molecular differences were eliminated after adjusting for intrinsic subtype, the study found 16 DNA methylation probes, 4 DNA copy number segments, 1 protein, and 142 genes that were differentially expressed, with the gene-based signature having an excellent capacity for distinguishing breast tumors from black vs white patients (cross-validation C index, 0.878). Using germline genotypes, the heritability of breast cancer subtypes (basal vs nonbasal) was estimated to be 0.436 (P = 1.5 × 10−14). The estrogen receptor–positive polygenic risk score built from 89 known susceptibility variants was higher in blacks than in whites (difference, 0.24; P = 2.3 × 10−5), while the estrogen receptor–negative polygenic risk score was much higher in blacks than in whites (difference, 0.48; P = 2.8 × 10−11). Conclusions and Relevance: On the molecular level, after adjusting for intrinsic subtype frequency differences, this study found a modest number of genomic differences but a significant clinical survival outcome difference between blacks and whites in The Cancer Genome Atlas data set. Moreover, more than 40% of breast cancer subtype frequency differences could be explained by genetic variants. These data could form the basis for the development of molecular targeted therapies to improve clinical outcomes for the specific subtypes of breast cancers that disproportionately affect black women. Findings also indicate that personalized risk assessment and optimal treatment could reduce deaths from aggressive breast cancers for black women.