Novel Alzheimer Disease Risk Loci and Pathways in African American Individuals Using the African Genome Resources PanelImportance: Compared with non-Hispanic White individuals, African American individuals from the same community are approximately twice as likely to develop Alzheimer disease. Despite this disparity, the largest Alzheimer disease genome-wide association studies to date have been conducted in non-Hispanic White individuals. In the largest association analyses of Alzheimer disease in African American individuals, ABCA7, TREM2, and an intergenic locus at 5q35 were previously implicated. Objective: To identify additional risk loci in African American individuals by increasing the sample size and using the African Genome Resource panel. Design, Setting, and Participants: This genome-wide association meta-analysis used case-control and family-based data sets from the Alzheimer Disease Genetics Consortium. There were multiple recruitment sites throughout the United States that included individuals with Alzheimer disease and controls of African American ancestry. Analysis began October 2018 and ended September 2019. Main Outcomes and Measures: Diagnosis of Alzheimer disease. Results: A total of 2784 individuals with Alzheimer disease (1944 female [69.8%]) and 5222 controls (3743 female [71.7%]) were analyzed (mean [SD] age at last evaluation, 74.2 [13.6] years). Associations with 4 novel common loci centered near the intracellular glycoprotein trafficking gene EDEM1 (3p26; P = 8.9 × 10-7), near the immune response gene ALCAM (3q13; P = 9.3 × 10-7), within GPC6 (13q31; P = 4.1 × 10-7), a gene critical for recruitment of glutamatergic receptors to the neuronal membrane, and within VRK3 (19q13.33; P = 3.5 × 10-7), a gene involved in glutamate neurotoxicity, were identified. In addition, several loci associated with rare variants, including a genome-wide significant intergenic locus near IGF1R at 15q26 (P = 1.7 × 10-9) and 6 additional loci with suggestive significance (P ≤ 5 × 10-7) such as API5 at 11p12 (P = 8.8 × 10-8) and RBFOX1 at 16p13 (P = 5.4 × 10-7) were identified. Gene expression data from brain tissue demonstrate association of ALCAM, ARAP1, GPC6, and RBFOX1 with brain β-amyloid load. Of 25 known loci associated with Alzheimer disease in non-Hispanic White individuals, only APOE, ABCA7, TREM2, BIN1, CD2AP, FERMT2, and WWOX were implicated at a nominal significance level or stronger in African American individuals. Pathway analyses strongly support the notion that immunity, lipid processing, and intracellular trafficking pathways underlying Alzheimer disease in African American individuals overlap with those observed in non-Hispanic White individuals. A new pathway emerging from these analyses is the kidney system, suggesting a novel mechanism for Alzheimer disease that needs further exploration. Conclusions and Relevance: While the major pathways involved in Alzheimer disease etiology in African American individuals are similar to those in non-Hispanic White individuals, the disease-associated loci within these pathways differ.
A Multiple-Filter-Multiple-Wrapper Approach to Gene Selection and Microarray Data ClassificationYuk Yee Leung, Yeungsam Hung|IEEE/ACM Transactions on Computational Biology and Bioinformatics|2009 Filters and wrappers are two prevailing approaches for gene selection in microarray data analysis. Filters make use of statistical properties of each gene to represent its discriminating power between different classes. The computation is fast but the predictions are inaccurate. Wrappers make use of a chosen classifier to select genes by maximizing classification accuracy, but the computation burden is formidable. Filters and wrappers have been combined in previous studies to maximize the classification accuracy for a chosen classifier with respect to a filtered set of genes. The drawback of this single-filter-single-wrapper (SFSW) approach is that the classification accuracy is dependent on the choice of specific filter and wrapper. In this paper, a multiple-filter-multiple-wrapper (MFMW) approach is proposed that makes use of multiple filters and multiple wrappers to improve the accuracy and robustness of the classification, and to identify potential biomarker genes. Experiments based on six benchmark data sets show that the MFMW approach outperforms SFSW models (generated by all combinations of filters and wrappers used in the corresponding MFMW model) in all cases and for all six data sets. Some of MFMW-selected genes have been confirmed to be biomarkers or contribute to the development of particular cancers by other studies.
HAMR: high-throughput annotation of modified ribonucleotidesRNA is often altered post-transcriptionally by the covalent modification of particular nucleotides; these modifications are known to modulate the structure and activity of their host RNAs. The recent discovery that an RNA methyl-6 adenosine demethylase (FTO) is a risk gene in obesity has brought to light the significance of RNA modifications to human biology. These noncanonical nucleotides, when converted to cDNA in the course of RNA sequencing, can produce sequence patterns that are distinguishable from simple base-calling errors. To determine whether these modifications can be detected in RNA sequencing data, we developed a method that can not only locate these modifications transcriptome-wide with single nucleotide resolution, but can also differentiate between different classes of modifications. Using small RNA-seq data we were able to detect 92% of all known human tRNA modification sites that are predicted to affect RT activity. We also found that different modifications produce distinct patterns of cDNA sequence, allowing us to differentiate between two classes of adenosine and two classes of guanine modifications with 98% and 79% accuracy, respectively. To show the robustness of this method to sample preparation and sequencing methods, as well as to organismal diversity, we applied it to a publicly available yeast data set and achieved similar levels of accuracy. We also experimentally validated two novel and one known 3-methylcytosine (3mC) sites predicted by HAMR in human tRNAs. Researchers can now use our method to identify and characterize RNA modifications using only RNA-seq data, both retrospectively and when asking questions specifically about modified RNA.
Changes in the Transcriptome of Human Astrocytes Accompanying Oxidative Stress-Induced SenescenceElizabeth P. Crowe, Ferit Tüzer, Brian D. Gregory et al.|Frontiers in Aging Neuroscience|2016 Aging is a major risk factor for many neurodegenerative disorders. A key feature of aging biology that may underlie these diseases is cellular senescence. Senescent cells accumulate in tissues with age, undergo widespread changes in gene expression, and typically demonstrate altered, pro-inflammatory profiles. Astrocyte senescence has been implicated in neurodegenerative disease, and to better understand senescence-associated changes in astrocytes, we investigated changes in their transcriptome using RNA sequencing. Senescence was induced in human fetal astrocytes by transient oxidative stress. Brain-expressed genes, including those involved in neuronal development and differentiation, were downregulated in senescent astrocytes. Remarkably, several genes indicative of astrocytic responses to injury were also downregulated, including glial fibrillary acidic protein and genes involved in the processing and presentation of antigens by major histocompatibility complex class II proteins, while pro-inflammatory genes were upregulated. Overall, our findings suggest that senescence-related changes in the function of astrocytes may impact the pathogenesis of age-related brain disorders.
DASHR: database of small human noncoding RNAsSmall non-coding RNAs (sncRNAs) are highly abundant RNAs, typically <100 nucleotides long, that act as key regulators of diverse cellular processes. Although thousands of sncRNA genes are known to exist in the human genome, no single database provides searchable, unified annotation, and expression information for full sncRNA transcripts and mature RNA products derived from these larger RNAs. Here, we present the Database of small human noncoding RNAs (DASHR). DASHR contains the most comprehensive information to date on human sncRNA genes and mature sncRNA products. DASHR provides a simple user interface for researchers to view sequence and secondary structure, compare expression levels, and evidence of specific processing across all sncRNA genes and mature sncRNA products in various human tissues. DASHR annotation and expression data covers all major classes of sncRNAs including microRNAs (miRNAs), Piwi-interacting (piRNAs), small nuclear, nucleolar, cytoplasmic (sn-, sno-, scRNAs, respectively), transfer (tRNAs), and ribosomal RNAs (rRNAs). Currently, DASHR (v1.0) integrates 187 smRNA high-throughput sequencing (smRNA-seq) datasets with over 2.5 billion reads and annotation data from multiple public sources. DASHR contains annotations for ∼ 48,000 human sncRNA genes and mature sncRNA products, 82% of which are expressed in one or more of the curated tissues. DASHR is available at http://lisanwanglab.org/DASHR.