Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2024The National Genomics Data Center (NGDC), which is a part of the China National Center for Bioinformation (CNCB), provides a family of database resources to support the global academic and industrial communities. With the rapid accumulation of multi-omics data at an unprecedented pace, CNCB-NGDC continuously expands and updates core database resources through big data archiving, integrative analysis and value-added curation. Importantly, NGDC collaborates closely with major international databases and initiatives to ensure seamless data exchange and interoperability. Over the past year, significant efforts have been dedicated to integrating diverse omics data, synthesizing expanding knowledge, developing new resources, and upgrading major existing resources. Particularly, several database resources are newly developed for the biodiversity of protists (P10K), bacteria (NTM-DB, MPA) as well as plant (PPGR, SoyOmics, PlantPan) and disease/trait association (CROST, HervD Atlas, HALL, MACdb, BioKA, BioKA, RePoS, PGG.SV, NAFLDkb). All the resources and services are publicly accessible at https://ngdc.cncb.ac.cn.
Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2025The National Genomics Data Center (NGDC), which is a part of the China National Center for Bioinformation (CNCB), offers a comprehensive suite of database resources to support the global scientific community. Amidst the unprecedented accumulation of multi-omics data, CNCB-NGDC is committed to continually evolving and updating its core database resources through big data archiving, integrative analysis and value-added curation. Over the past year, CNCB-NGDC has expanded its collaborations with international databases and established new subcenters focusing on biodiversity, traditional Chinese medicine and tumor genetics. Substantial efforts have been made toward encompassing a broad spectrum of multi-omics data, developing innovative resources and enhancing existing resources. Notably, new resources have been developed for single-cell omics (scTWAS Atlas), genome and variation (VDGE), health and disease (CVD Atlas, CPMKG, Immunosenescence Inventory, HemAtlas, Cyclicpepedia, IDeAS), biodiversity and biosynthesis (RefMetaPlant, MASH-Ocean) and research tools (CCLHunter). All resources and services are publicly accessible at https://ngdc.cncb.ac.cn.
BioKA: a curated and integrated biomarker knowledgebase for animalsYibo Wang, Yi‐Hao Lin, Sicheng Wu et al.|Nucleic Acids Research|2023 Biomarkers play an important role in various area such as personalized medicine, drug development, clinical care, and molecule breeding. However, existing animals' biomarker resources predominantly focus on human diseases, leaving a significant gap in non-human animal disease understanding and breeding research. To address this limitation, we present BioKA (Biomarker Knowledgebase for Animals, https://ngdc.cncb.ac.cn/bioka), a curated and integrated knowledgebase encompassing multiple animal species, diseases/traits, and annotated resources. Currently, BioKA houses 16 296 biomarkers associated with 951 mapped diseases/traits across 31 species from 4747 references, including 11 925 gene/protein biomarkers, 1784 miRNA biomarkers, 1043 mutation biomarkers, 773 metabolic biomarkers, 357 circRNA biomarkers and 127 lncRNA biomarkers. Furthermore, BioKA integrates various annotations such as GOs, protein structures, protein-protein interaction networks, miRNA targets and so on, and constructs an interactive knowledge network of biomarkers including circRNA-miRNA-mRNA associations, lncRNA-miRNA associations and protein-protein associations, which is convenient for efficient data exploration. Moreover, BioKA provides detailed information on 308 breeds/strains of 13 species, and homologous annotations for 8784 biomarkers across 16 species, and offers three online application tools. The comprehensive knowledge provided by BioKA not only advances human disease research but also contributes to a deeper understanding of animal diseases and supports livestock breeding.
iDog: a multi-omics resource for canids studyYanhu Liu, Yibo Wang, Jiani Sun et al.|Nucleic Acids Research|2024 iDog (https://ngdc.cncb.ac.cn/idog/) is a comprehensive public resource for domestic dogs (Canis lupus familiaris) and wild canids, designed to integrate multi-omics data and provide data services for the worldwide canine research community. Notably, iDog 2.0 features a 15-fold increase in genomic samples, including 29.55 million single nucleotide polymorphisms (SNPs) and 16.54 million insertions/deletions (InDels) from 1929 modern samples and 29.09 million SNPs from 111 ancient Canis samples. Additionally, 43487 breed-specific SNPs and 530 disease/trait-associated variants have been identified and integrated. The platform also includes data from 141 BioProjects involving gene expression analyses and a single-cell transcriptome module containing data from 105 057 Beagle hippocampus cells. iDog 2.0 also includes an epignome module that evaluates DNA methylation patterns across 547 samples and chromatin accessibility across 87 samples for the analysis of gene expression regulation. Additionally, it provies phenotypic data for 897 dog diseases, 3207 genotype-to-phenotype (G2P) pairs, and 349 dog disease-associated genes, along with two newly constructed ontologies for breed and disease standardization. Finally, 13 new analytical tools have been added. Given these enhancements, the updated iDog 2.0 is an invaluable resource for the global cannie research community.
EWAS Open Platform 2026: a deeply integrated resource for epigenome-wide association studiesFei Yang, Zhuang Xiong, Wenting Zong et al.|Nucleic Acids Research|2025 Epigenome-wide association studies (EWAS) has become an indispensable approach for elucidating the epigenetic basis of complex traits. EWAS Open Platform (https://ngdc.cncb.ac.cn/ewas/) includes three main components: EWAS Atlas (curated associations from publications), EWAS Data Hub (normalized DNA methylation array data), and EWAS Toolkit (one-stop analysis services). Here, we present a new release of EWAS Open Platform with the following significant updates and enhancements: (i) Expanded information: EWAS Atlas houses over 800 000 associations and incorporates an additional 17 000 curated causal relationships. EWAS Data Hub contains >180 000 batch-corrected samples, which incorporate data from newly added Illumina MethylationEPIC v2.0 BeadChip (935K) as well as trait-specific methylation profiles. EWAS Toolkit now provides an online batch correction tool and an interactive epigenetic causal network. (ii) Enhanced interoperability: data, knowledge, and toolkit are fully interconnected through a unified retrieval, offering integrated summaries and visualization capabilities. (iii) Artificial intelligence (AI)-based service: the platform is newly equipped with an AI-assisted question-answering service, allowing users to interactively explore EWAS-related questions and generate tailored insights. Taken together, EWAS Open Platform has undergone a significant upgrade across data resources, analytical tools, and service functionalities, offering more advanced support for unraveling complex molecular mechanisms from an epigenomic perspective.