University of Pennsylvania
ORCID: 0000-0001-5216-4670Publishes on Genetic Associations and Epidemiology, Bioinformatics and Genomic Networks, BRCA gene mutations in cancer. 166 papers and 4.9k citations.
Add your photo, update your bio, and get notified when your ranking changes.
Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver illness with a genetically heterogeneous background that can be accompanied by considerable morbidity and attendant health care costs. The pathogenesis and progression of NAFLD is complex with many unanswered questions. We conducted genome-wide association studies (GWASs) using both adult and pediatric participants from the Electronic Medical Records and Genomics (eMERGE) Network to identify novel genetic contributors to this condition. First, a natural language processing (NLP) algorithm was developed, tested, and deployed at each site to identify 1106 NAFLD cases and 8571 controls and histological data from liver tissue in 235 available participants. These include 1242 pediatric participants (396 cases, 846 controls). The algorithm included billing codes, text queries, laboratory values, and medication records. Next, GWASs were performed on NAFLD cases and controls and case-only analyses using histologic scores and liver function tests adjusting for age, sex, site, ancestry, PC, and body mass index (BMI). Consistent with previous results, a robust association was detected for the PNPLA3 gene cluster in participants with European ancestry. At the PNPLA3-SAMM50 region, three SNPs, rs738409, rs738408, and rs3747207, showed strongest association (best SNP rs738409 p = 1.70 × 10− 20). This effect was consistent in both pediatric (p = 9.92 × 10− 6) and adult (p = 9.73 × 10− 15) cohorts. Additionally, this variant was also associated with disease severity and NAFLD Activity Score (NAS) (p = 3.94 × 10− 8, beta = 0.85). PheWAS analysis link this locus to a spectrum of liver diseases beyond NAFLD with a novel negative correlation with gout (p = 1.09 × 10− 4). We also identified novel loci for NAFLD disease severity, including one novel locus for NAS score near IL17RA (rs5748926, p = 3.80 × 10− 8), and another near ZFP90-CDH1 for fibrosis (rs698718, p = 2.74 × 10− 11). Post-GWAS and gene-based analyses identified more than 300 genes that were used for functional and pathway enrichment analyses. In summary, this study demonstrates clear confirmation of a previously described NAFLD risk locus and several novel associations. Further collaborative studies including an ethnically diverse population with well-characterized liver histologic features of NAFLD are needed to further validate the novel findings.
BACKGROUND: Observational studies have identified height as a strong risk factor for atrial fibrillation, but this finding may be limited by residual confounding. We aimed to examine genetic variation in height within the Mendelian randomization (MR) framework to determine whether height has a causal effect on risk of atrial fibrillation. METHODS AND FINDINGS: In summary-level analyses, MR was performed using summary statistics from genome-wide association studies of height (GIANT/UK Biobank; 693,529 individuals) and atrial fibrillation (AFGen; 65,446 cases and 522,744 controls), finding that each 1-SD increase in genetically predicted height increased the odds of atrial fibrillation (odds ratio [OR] 1.34; 95% CI 1.29 to 1.40; p = 5 × 10-42). This result remained consistent in sensitivity analyses with MR methods that make different assumptions about the presence of pleiotropy, and when accounting for the effects of traditional cardiovascular risk factors on atrial fibrillation. Individual-level phenome-wide association studies of height and a height genetic risk score were performed among 6,567 European-ancestry participants of the Penn Medicine Biobank (median age at enrollment 63 years, interquartile range 55-72; 38% female; recruitment 2008-2015), confirming prior observational associations between height and atrial fibrillation. Individual-level MR confirmed that each 1-SD increase in height increased the odds of atrial fibrillation, including adjustment for clinical and echocardiographic confounders (OR 1.89; 95% CI 1.50 to 2.40; p = 0.007). The main limitations of this study include potential bias from pleiotropic effects of genetic variants, and lack of generalizability of individual-level findings to non-European populations. CONCLUSIONS: In this study, we observed evidence that height is likely a positive causal risk factor for atrial fibrillation. Further study is needed to determine whether risk prediction tools including height or anthropometric risk factors can be used to improve screening and primary prevention of atrial fibrillation, and whether biological pathways involved in height may offer new targets for treatment of atrial fibrillation.
Importance: Hereditary transthyretin (TTR) amyloid cardiomyopathy (hATTR-CM) due to the TTR V122I variant is an autosomal-dominant disorder that causes heart failure in elderly individuals of African ancestry. The clinical associations of carrying the variant, its effect in other African ancestry populations including Hispanic/Latino individuals, and the rates of achieving a clinical diagnosis in carriers are unknown. Objective: To assess the association between the TTR V122I variant and heart failure and identify rates of hATTR-CM diagnosis among carriers with heart failure. Design, Setting, and Participants: Cross-sectional analysis of carriers and noncarriers of TTR V122I of African ancestry aged 50 years or older enrolled in the Penn Medicine Biobank between 2008 and 2017 using electronic health record data from 1996 to 2017. Case-control study in participants of African and Hispanic/Latino ancestry with and without heart failure in the Mount Sinai BioMe Biobank enrolled between 2007 and 2015 using electronic health record data from 2007 to 2018. Exposures: TTR V122I carrier status. Main Outcomes and Measures: The primary outcome was prevalent heart failure. The rate of diagnosis with hATTR-CM among TTR V122I carriers with heart failure was measured. Results: The cross-sectional cohort included 3724 individuals of African ancestry with a median age of 64 years (interquartile range, 57-71); 1755 (47%) were male, 2896 (78%) had a diagnosis of hypertension, and 753 (20%) had a history of myocardial infarction or coronary revascularization. There were 116 TTR V122I carriers (3.1%); 1121 participants (30%) had heart failure. The case-control study consisted of 2307 individuals of African ancestry and 3663 Hispanic/Latino individuals; the median age was 73 years (interquartile range, 68-80), 2271 (38%) were male, 4709 (79%) had a diagnosis of hypertension, and 1008 (17%) had a history of myocardial infarction or coronary revascularization. There were 1376 cases of heart failure. TTR V122I was associated with higher rates of heart failure (cross-sectional cohort: n = 51/116 TTR V122I carriers [44%], n = 1070/3608 noncarriers [30%], adjusted odds ratio, 1.7 [95% CI, 1.2-2.4], P = .006; case-control study: n = 36/1376 heart failure cases [2.6%], n = 82/4594 controls [1.8%], adjusted odds ratio, 1.8 [95% CI, 1.2-2.7], P = .008). Ten of 92 TTR V122I carriers with heart failure (11%) were diagnosed as having hATTR-CM; the median time from onset of symptoms to clinical diagnosis was 3 years. Conclusions and Relevance: Among individuals of African or Hispanic/Latino ancestry enrolled in 2 academic medical center-based biobanks, the TTR V122I genetic variant was significantly associated with heart failure.
The Penn Medicine BioBank (PMBB) is an electronic health record (EHR)-linked biobank at the University of Pennsylvania (Penn Medicine). A large variety of health-related information, ranging from diagnosis codes to laboratory measurements, imaging data and lifestyle information, is integrated with genomic and biomarker data in the PMBB to facilitate discoveries and translational science. To date, 174,712 participants have been enrolled into the PMBB, including approximately 30% of participants of non-European ancestry, making it one of the most diverse medical biobanks. There is a median of seven years of longitudinal data in the EHR available on participants, who also consent to permission to recontact. Herein, we describe the operations and infrastructure of the PMBB, summarize the phenotypic architecture of the enrolled participants, and use body mass index (BMI) as a proof-of-concept quantitative phenotype for PheWAS, LabWAS, and GWAS. The major representation of African-American participants in the PMBB addresses the essential need to expand the diversity in genetic and translational research. There is a critical need for a "medical biobank consortium" to facilitate replication, increase power for rare phenotypes and variants, and promote harmonized collaboration to optimize the potential for biological discovery and precision medicine.