GWAS Atlas: an updated knowledgebase integrating more curated associations in plants and animalsXiaonan Liu, Dongmei Tian, Cuiping Li et al.|Nucleic Acids Research|2022 GWAS Atlas (https://ngdc.cncb.ac.cn/gwas/) is a manually curated resource of genome-wide genotype-to-phenotype associations for a wide range of species. Here, we present an updated implementation of GWAS Atlas by curating and incorporating more high-quality associations, with significant improvements and advances over the previous version. Specifically, the current release of GWAS Atlas incorporates a total of 278,109 curated genotype-to-phenotype associations for 1,444 different traits across 15 species (10 plants and 5 animals) from 830 publications and 3,432 studies. A collection of 6,084 lead SNPs of 439 traits and 486 experiment-validated causal variants of 157 traits are newly added. Moreover, 1,056 trait ontology terms are newly defined, resulting in 1,172 and 431 terms for Plant Phenotype and Trait Ontology and Animal Phenotype and Trait Ontology, respectively. Additionally, it is equipped with four online analysis tools and a submission platform, allowing users to perform data analysis and data submission. Collectively, as a core resource in the National Genomics Data Center, GWAS Atlas provides valuable genotype-to-phenotype associations for a diversity of species and thus plays an important role in agronomic trait study and molecular breeding.
Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationshipsLeveraging molecular networks to discover disease-relevant modules is a long-standing challenge. With the accumulation of interactomes, there is a pressing need for powerful computational approaches to handle the inevitable noise and context-specific nature of biological networks. Here, we introduce Graphene, a two-step self-supervised representation learning framework tailored to concisely integrate multiple molecular networks and adapted to gene functional analysis via downstream re-training. In practice, we first leverage GNN (graph neural network) pre-training techniques to obtain initial node embeddings followed by re-training Graphene using a graph attention architecture, achieving superior performance over competing methods for pathway gene recovery, disease gene reprioritization, and comorbidity prediction. Graphene successfully recapitulates tissue-specific gene expression across disease spectrum and demonstrates shared heritability of common mental disorders. Graphene can be updated with new interactomes or other omics features. Graphene holds promise to decipher gene function under network context and refine GWAS (genome-wide association study) hits and offers mechanistic insights via decoding diseases from genome to networks to phenotypes.