The Complete Genome Sequence of <i>Escherichia coli</i> K-12

Frederick R. Blattner(University of Wisconsin–Madison), Guy Plunkett(University of Wisconsin–Madison), Craig A. Bloch(University of Wisconsin–Madison), Nicole T. Perna(University of Wisconsin–Madison), Valerie Burland(University of Wisconsin–Madison), Monica Riley(University of Wisconsin–Madison), Julio Collado‐Vides(University of Wisconsin–Madison), Jeremy D. Glasner(University of Wisconsin–Madison), Christopher K. Rode(University of Wisconsin–Madison), George F. Mayhew(University of Wisconsin–Madison), J. W. GREGOR(University of Wisconsin–Madison), N. Wayne Davis(University of Wisconsin–Madison), Heather Kirkpatrick(University of Wisconsin–Madison), Michael A. Goeden(University of Wisconsin–Madison), Debra J. Rose(University of Wisconsin–Madison), Bob Mau(University of Wisconsin–Madison), Ying Shao(University of Wisconsin–Madison)
Science
September 5, 1997
Cited by 7,506

Abstract

The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.


Related Papers

No related papers found

Powered by citation graph analysis