LncTar: a tool for predicting the RNA targets of long noncoding RNAsJianwei Li, Wei Ma, Pan Zeng et al.|Briefings in Bioinformatics|2014 Long noncoding RNAs (lncRNAs) represent a big category of noncoding RNA molecules, and increasing studies have shown that they play important roles in various critical biological processes. They show a diversity of functions through diverse mechanisms, among which regulating RNA molecules is one of the most popular ones. Given the big number of lncRNAs, it becomes urgent and important to predict the RNA targets of lncRNAs in a large scale for the comprehensive understanding of lncRNA functions and action mechanisms. Although several methods have been developed to predict RNA-RNA interactions, none of them can be used to predict the RNA targets of lncRNAs in a large scale. Here we presented a tool, LncTar, which shows the ability to efficiently predict the RNA targets of lncRNAs in a large scale. To test the accuracy of LncTar, we applied it to 10 experimentally supported lncRNA-mRNA interactions. As a result, LncTar successfully predicted 8 (80%) of the 10 lncRNA-mRNA pairs, suggesting that LncTar has a reliable accuracy. Finally, we believe that LncTar could be an efficient tool for the fast identification of the RNA targets of lncRNAs. LncTar is freely available at http://www.cuilab.cn/lnctar.
Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2025The National Genomics Data Center (NGDC), which is a part of the China National Center for Bioinformation (CNCB), offers a comprehensive suite of database resources to support the global scientific community. Amidst the unprecedented accumulation of multi-omics data, CNCB-NGDC is committed to continually evolving and updating its core database resources through big data archiving, integrative analysis and value-added curation. Over the past year, CNCB-NGDC has expanded its collaborations with international databases and established new subcenters focusing on biodiversity, traditional Chinese medicine and tumor genetics. Substantial efforts have been made toward encompassing a broad spectrum of multi-omics data, developing innovative resources and enhancing existing resources. Notably, new resources have been developed for single-cell omics (scTWAS Atlas), genome and variation (VDGE), health and disease (CVD Atlas, CPMKG, Immunosenescence Inventory, HemAtlas, Cyclicpepedia, IDeAS), biodiversity and biosynthesis (RefMetaPlant, MASH-Ocean) and research tools (CCLHunter). All resources and services are publicly accessible at https://ngdc.cncb.ac.cn.
An analysis of human microbe–disease associationsWei Ma, Lu Zhang, Pan Zeng et al.|Briefings in Bioinformatics|2016 The microbiota living in the human body has critical impacts on our health and disease, but a systems understanding of its relationships with disease remains limited. Here, we use a large-scale text mining-based manually curated microbe-disease association data set to construct a microbe-based human disease network and investigate the relationships between microbes and disease genes, symptoms, chemical fragments and drugs. We reveal that microbe-based disease loops are significantly coherent. Microbe-based disease connections have strong overlaps with those constructed by disease genes, symptoms, chemical fragments and drugs. Moreover, we confirm that the microbe-based disease analysis is able to predict novel connections and mechanisms for disease, microbes, genes and drugs. The presented network, methods and findings can be a resource helpful for addressing some issues in medicine, for example, the discovery of bench knowledge and bedside clinical solutions for disease mechanism understanding, diagnosis and therapy.
Analysis of gut microbiota profiles and microbe-disease associations in children with autism spectrum disorders in ChinaMengxiang Zhang, Wei Ma, Juan Zhang et al.|Scientific Reports|2018 Autism spectrum disorder (ASD) is a set of complex neurodevelopmental disorders. Recent studies reported that children with ASD have altered gut microbiota profiles compared with typical development (TD) children. However, few studies on gut bacteria of children with ASD have been conducted in China. Here, in order to elucidate changes of fecal microbiota in children with ASD, 16S rRNA sequencing was conducted and the 16S rRNA (V3-V4) gene tags were amplified. We investigated differences in fecal microbiota between 35 children with ASD and 6 TD children. At the phylum level, the fecal microbiota of ASD group indicated a significant increase of the Bacteroidetes/Firmicutes ratio. At the genus level, we found that the relative abundance of Sutterella, Odoribacter and Butyricimonas was much more abundant in the ASD group whereas the abundance of Veillonella and Streptococcus was decreased significantly compared to the control group. Functional analysis demonstrated that butyrate and lactate producers were less abundant in the ASD group. In addition, we downloaded the association data set of microbe-disease from human microbe-disease association database and constructed a human disease network including ASD using our gut microbiome results. In this microbe-disease network based on microbe similarity of diseases, we found that ASD is positively correlated with periodontal, negatively related to type 1 diabetes. Therefore, these results suggest that microbe-based disease analysis is able to predict novel connection between ASD and other diseases and may play a role in revealing the pathogenesis of ASD.
Database Resources of the BIG Data Center in 2018Xingjian Xu, Lili Hao, Junwei Zhu et al.|Nucleic Acids Research|2017 The BIG Data Center at Beijing Institute of Genomics (BIG) of the Chinese Academy of Sciences provides freely open access to a suite of database resources in support of worldwide research activities in both academia and industry. With the vast amounts of omics data generated at ever-greater scales and rates, the BIG Data Center is continually expanding, updating and enriching its core database resources through big-data integration and value-added curation, including BioCode (a repository archiving bioinformatics tool codes), BioProject (a biological project library), BioSample (a biological sample library), Genome Sequence Archive (GSA, a data repository for archiving raw sequence reads), Genome Warehouse (GWH, a centralized resource housing genome-scale data), Genome Variation Map (GVM, a public repository of genome variations), Gene Expression Nebulas (GEN, a database of gene expression profiles based on RNA-Seq data), Methylation Bank (MethBank, an integrated databank of DNA methylomes), and Science Wikis (a series of biological knowledge wikis for community annotations). In addition, three featured web services are provided, viz., BIG Search (search as a service; a scalable inter-domain text search engine), BIG SSO (single sign-on as a service; a user access control system to gain access to multiple independent systems with a single ID and password) and Gsub (submission as a service; a unified submission service for all relevant resources). All of these resources are publicly accessible through the home page of the BIG Data Center at http://bigd.big.ac.cn.