What is a gene, post-ENCODE? History and updated definition

Mark Gerstein(Yale University), Can Bruce(Yale University), Joel Rozowsky(Yale University), Deyou Zheng(Yale University), Jiang Du(Yale University), Jan O. Korbel(Yale University), Olof Emanuelsson(Stockholm University), Zhengdong D. Zhang(Yale University), Sherman M. Weissman(Yale University), M Snyder(Yale University)
Genome Research
June 1, 2007
Cited by 685Open Access
Full Text

Abstract

While sequencing of the human genome surprised us with how many protein-coding genes there are, it did not fundamentally change our perspective on what a gene is. In contrast, the complex patterns of dispersed regulation and pervasive transcription uncovered by the ENCODE project, together with non-genic conservation and the abundance of noncoding RNA genes, have challenged the notion of the gene. To illustrate this, we review the evolution of operational definitions of a gene over the past century--from the abstract elements of heredity of Mendel and Morgan to the present-day ORFs enumerated in the sequence databanks. We then summarize the current ENCODE findings and provide a computational metaphor for the complexity. Finally, we propose a tentative update to the definition of a gene: A gene is a union of genomic sequences encoding a coherent set of potentially overlapping functional products. Our definition side-steps the complexities of regulation and transcription by removing the former altogether from the definition and arguing that final, functional gene products (rather than intermediate transcripts) should be used to group together entities associated with a single gene. It also manifests how integral the concept of biological function is in defining genes.


Related Papers

No related papers found

Powered by citation graph analysis