Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2023The National Genomics Data Center (NGDC), part of the China National Center for Bioinformation (CNCB), provides a family of database resources to support global academic and industrial communities. With the explosive accumulation of multi-omics data generated at an unprecedented rate, CNCB-NGDC constantly expands and updates core database resources by big data archive, integrative analysis and value-added curation. In the past year, efforts have been devoted to integrating multiple omics data, synthesizing the growing knowledge, developing new resources and upgrading a set of major resources. Particularly, several database resources are newly developed for infectious diseases and microbiology (MPoxVR, KGCoV, ProPan), cancer-trait association (ASCancer Atlas, TWAS Atlas, Brain Catalog, CCAS) as well as tropical plants (TCOD). Importantly, given the global health threat caused by monkeypox virus and SARS-CoV-2, CNCB-NGDC has newly constructed the monkeypox virus resource, along with frequent updates of SARS-CoV-2 genome sequences, variants as well as haplotypes. All the resources and services are publicly accessible at https://ngdc.cncb.ac.cn.
Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2024The National Genomics Data Center (NGDC), which is a part of the China National Center for Bioinformation (CNCB), provides a family of database resources to support the global academic and industrial communities. With the rapid accumulation of multi-omics data at an unprecedented pace, CNCB-NGDC continuously expands and updates core database resources through big data archiving, integrative analysis and value-added curation. Importantly, NGDC collaborates closely with major international databases and initiatives to ensure seamless data exchange and interoperability. Over the past year, significant efforts have been dedicated to integrating diverse omics data, synthesizing expanding knowledge, developing new resources, and upgrading major existing resources. Particularly, several database resources are newly developed for the biodiversity of protists (P10K), bacteria (NTM-DB, MPA) as well as plant (PPGR, SoyOmics, PlantPan) and disease/trait association (CROST, HervD Atlas, HALL, MACdb, BioKA, BioKA, RePoS, PGG.SV, NAFLDkb). All the resources and services are publicly accessible at https://ngdc.cncb.ac.cn.
Database Resources of the National Genomics Data Center in 2020Zhang Zhang, Wenming Zhao, Jingfa Xiao et al.|Nucleic Acids Research|2019 The National Genomics Data Center (NGDC) provides a suite of database resources to support worldwide research activities in both academia and industry. With the rapid advancements in higher-throughput and lower-cost sequencing technologies and accordingly the huge volume of multi-omics data generated at exponential scales and rates, NGDC is continually expanding, updating and enriching its core database resources through big data integration and value-added curation. In the past year, efforts for update have been mainly devoted to BioProject, BioSample, GSA, GWH, GVM, NONCODE, LncBook, EWAS Atlas and IC4R. Newly released resources include three human genome databases (PGG.SNV, PGG.Han and CGVD), eLMSG, EWAS Data Hub, GWAS Atlas, iSheep and PADS Arsenal. In addition, four web services, namely, eGPS Cloud, BIG Search, BIG Submission and BIG SSO, have been significantly improved and enhanced. All of these resources along with their services are publicly accessible at https://bigd.big.ac.cn.
iSheep: an Integrated Resource for Sheep Genome, Variant and PhenotypeZhonghuang Wang, Qianghui Zhu, Xin Li et al.|Frontiers in Genetics|2021 DATA REPORT article Front. Genet., 17 August 2021 | https://doi.org/10.3389/fgene.2021.714852
Comparative genomic sequencing to characterize <i>Mycoplasma pneumoniae</i> genome, typing, and drug resistanceYue Jiang, Hailong Kang, Haiwei Dou et al.|Microbiology Spectrum|2024 ABSTRACT To analyze the characteristics of Mycoplasma pneumoniae as well as macrolide antibiotic resistance through whole-genome sequencing and comparative genomics. Thirteen clinical strains isolated from 2003 to 2019 were selected, 10 of which were resistant to erythromycin (MIC >64 µg/mL), including 8 P1-type I and 2 P1-type II. Three were sensitive (<1 µg/mL) and P1-type II. One resistant strain had an A→G point mutation at position 2064 in region V of the 23S rRNA, the others had it at position 2063, while the three sensitive strains had no mutation here. Genome assembly and comparative genome analysis revealed a high level of genome consistency within the P1 type, and the primary differences in genome sequences concentrated in the region encoding the P1 protein. In P1-type II strains, three specific gene mutations were identified: C162A and A430G in L4 gene and T1112G mutation in the CARDS gene. Clinical information showed seven cases were diagnosed with severe pneumonia, all of which were infected with drug-resistant strains. Notably, BS610A4 and CYM219A1 exhibited a gene multi-copy phenomenon and shared a conserved functional domain with the DUF31 protein family. Clinically, the patients had severe refractory pneumonia, with pleural effusion, necessitating treatment with glucocorticoids and bronchoalveolar lavage. The primary variations between strains occur among different P1-types, while there is a high level of genomic consistency within P1-types. Three mutation loci associated with specific types were identified, and no specific genetic alterations directly related to clinical presentation were observed. IMPORTANCE Mycoplasma pneumoniae is an important pathogen of community-acquired pneumonia, and macrolide resistance brings difficulties to clinical treatment. We analyzed the characteristics of M. pneumoniae as well as macrolide antibiotic resistance through whole-genome sequencing and comparative genomics. The work addressed primary variations between strains that occur among different P1-types, while there is a high level of genomic consistency within P1-types. In P1-type II strains, three specific gene mutations were identified: C162A and A430G in L4 gene and T1112G mutation in the CARDS gene. All the strains isolated from severe pneumonia cases were drug-resistant, two of which exhibited a gene multi-copy phenomenon, sharing a conserved functional domain with the DUF31 protein family. Three mutation loci associated with specific types were identified, and no specific genetic alterations directly related to clinical presentation were observed.