Chen Ye

The sequence and de novo assembly of the giant panda genome

Ruiqiang Li, Wei Fan, Geng Tian et al.|Nature|2009

Cited by 1.2kOpen Access

Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes. The genome of the giant panda — specifically of the female Beijing Olympics mascot Jingjing — has been determined using short-read sequencing technology, a first for such a complex genome. It consists of some 2.4 billion DNA base pairs, compared to 3 billion in humans, and contains around 21,000 protein-encoding genes, similar to the human genome. Genomic diversity reflected in the sequence is high, raising hopes that despite a population of only about 2,500, conservation efforts can keep the species from extinction. Intriguingly, the panda appears to have all the genes needed for a carnivorous digestive system but lacks digestive cellulase genes. It may therefore depend on its gut microbiome to handle its famously limited bamboo diet. Taste may be a diet-limiting factor: loss of function of the T1R1 gene means that pandas may not experience the umami taste associated with high-protein foods. Technical aspects of this work pave the way for the use of next-generation sequencing for rapid de novo assembly of large eukaryotic genomes. Here, a draft sequence of the giant panda genome is assembled using next-generation sequencing technology alone. Genome analysis reveals a low divergence rate in comparison with dog and human genomes and insights into panda-specific traits; for example, the giant panda's bamboo diet may be more dependent on its gut microbiome than its own genetic composition.

The Genomes of Oryza sativa: A History of Duplications

Jun Yu, Jun Wang, Wei Lin et al.|PLoS Biology|2005

Cited by 1kOpen Access

We report improved whole-genome shotgun sequences for the genomes of indica and japonica rice, both with multimegabase contiguity, or almost 1,000-fold improvement over the drafts of 2002. Tested against a nonredundant collection of 19,079 full-length cDNAs, 97.7% of the genes are aligned, without fragmentation, to the mapped super-scaffolds of one or the other genome. We introduce a gene identification procedure for plants that does not rely on similarity to known genes to remove erroneous predictions resulting from transposable elements. Using the available EST data to adjust for residual errors in the predictions, the estimated gene count is at least 38,000-40,000. Only 2%-3% of the genes are unique to any one subspecies, comparable to the amount of sequence that might still be missing. Despite this lack of variation in gene content, there is enormous variation in the intergenic regions. At least a quarter of the two sequences could not be aligned, and where they could be aligned, single nucleotide polymorphism (SNP) rates varied from as little as 3.0 SNP/kb in the coding regions to 27.6 SNP/kb in the transposable elements. A more inclusive new approach for analyzing duplication history is introduced here. It reveals an ancient whole-genome duplication, a recent segmental duplication on Chromosomes 11 and 12, and massive ongoing individual gene duplications. We find 18 distinct pairs of duplicated segments that cover 65.7% of the genome; 17 of these pairs date back to a common time before the divergence of the grasses. More important, ongoing individual gene duplications provide a never-ending source of raw material for gene genesis and are major contributors to the differences between members of the grass family.

Single-Cell Exome Sequencing and Monoclonal Evolution of a JAK2-Negative Myeloproliferative Neoplasm

Yong Hou, Luting Song, Ping Zhu et al.|Cell|2012

Cited by 592Open Access

The sheep genome illuminates biology of the rumen and lipid metabolism

Yu Jiang, Min Xie, Wenbin Chen et al.|Science|2014

Cited by 556Open Access

Sheep (Ovis aries) are a major source of meat, milk, and fiber in the form of wool and represent a distinct class of animals that have a specialized digestive organ, the rumen, that carries out the initial digestion of plant material. We have developed and analyzed a high-quality reference sheep genome and transcriptomes from 40 different tissues. We identified highly expressed genes encoding keratin cross-linking proteins associated with rumen evolution. We also identified genes involved in lipid metabolism that had been amplified and/or had altered tissue expression patterns. This may be in response to changes in the barrier lipids of the skin, an interaction between lipid metabolism and wool synthesis, and an increased role of volatile fatty acids in ruminants compared with nonruminant animals.

Oncofetal long noncoding RNA PVT1 promotes proliferation and stem cell-like property of hepatocellular carcinoma cells by stabilizing NOP2

Fang Wang, Ji‐hang Yuan, Shao-Bing Wang et al.|Hepatology|2014

Cited by 458Open Access

UNLABELLED: Many protein-coding oncofetal genes are highly expressed in murine and human fetal liver and silenced in adult liver. The protein products of these hepatic oncofetal genes have been used as clinical markers for the recurrence of hepatocellular carcinoma (HCC) and as therapeutic targets for HCC. Herein we examined the expression profiles of long noncoding RNAs (lncRNAs) found in fetal and adult liver in mice. Many fetal hepatic lncRNAs were identified; one of these, lncRNA-mPvt1, is an oncofetal RNA that was found to promote cell proliferation, cell cycling, and the expression of stem cell-like properties of murine cells. Interestingly, we found that human lncRNA-hPVT1 was up-regulated in HCC tissues and that patients with higher lncRNA-hPVT1 expression had a poor clinical prognosis. The protumorigenic effects of lncRNA-hPVT1 on cell proliferation, cell cycling, and stem cell-like properties of HCC cells were confirmed both in vitro and in vivo by gain-of-function and loss-of-function experiments. Moreover, mRNA expression profile data showed that lncRNA-hPVT1 up-regulated a series of cell cycle genes in SMMC-7721 cells. By RNA pulldown and mass spectrum experiments, we identified NOP2 as an RNA-binding protein that binds to lncRNA-hPVT1. We confirmed that lncRNA-hPVT1 up-regulated NOP2 by enhancing the stability of NOP2 proteins and that lncRNA-hPVT1 function depends on the presence of NOP2. CONCLUSION: Our study demonstrates that the expression of many lncRNAs is up-regulated in early liver development and that the fetal liver can be used to search for new diagnostic markers for HCC. LncRNA-hPVT1 promotes cell proliferation, cell cycling, and the acquisition of stem cell-like properties in HCC cells by stabilizing NOP2 protein. Regulation of the lncRNA-hPVT1/NOP2 pathway may have beneficial effects on the treatment of HCC.

Is this you? Claim your profile.

Top publicationsby citations