Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths

Marie Touchon(Centre National de la Recherche Scientifique), Claire Hoede(Inserm), Olivier Tenaillon(Inserm), Valérie Barbe(Commissariat à l'Énergie Atomique et aux Énergies Alternatives), Simon Baeriswyl(Délégation Paris 5), Philippe Bidet(Délégation Paris 7), Édouard Bingen(Délégation Paris 7), Stéphane Bonacorsi(Délégation Paris 7), Christiane Bouchier(Inserm), Odile Bouvet(Inserm), Alexandra Calteau(Centre National de la Recherche Scientifique), Hélène Chiapello(Mathématiques et Informatique Appliquées du Génome à l'Environnement), Olivier Clermont(Inserm), Stéphane Cruveiller(Centre National de la Recherche Scientifique), Antoine Danchin(Centre National de la Recherche Scientifique), Médéric Diard(Délégation Paris 5), Carole Dossat(Commissariat à l'Énergie Atomique et aux Énergies Alternatives), Meriem El Karoui(Université Paris Cité), Eric Frapy(Délégation Paris 5), Louis Garry(Inserm), Jean Marc Ghigo(Centre National de la Recherche Scientifique), Anne Marie Gilles(Centre National de la Recherche Scientifique), James R. Johnson(University of Minnesota), Chantal Le Bouguénec(Institut Pasteur), Mathilde Lescat(Inserm), Sophie Mangenot(Commissariat à l'Énergie Atomique et aux Énergies Alternatives), Vanessa Martinez-Jéhanne(Institut Pasteur), Ivan Matić(Délégation Paris 5), Xavier Nassif(Délégation Paris 5), Sophie Oztas(Commissariat à l'Énergie Atomique et aux Énergies Alternatives), Marie‐Agnès Petit(Mathématiques et Informatique Appliquées du Génome à l'Environnement), Christophe Pichon(Institut Pasteur), Zoé Rouy(Centre National de la Recherche Scientifique), Claude Saint Ruf(Délégation Paris 5), Dominique Schneider(Centre National de la Recherche Scientifique), Jérôme Tourret(Inserm), Benoît Vacherie(Commissariat à l'Énergie Atomique et aux Énergies Alternatives), David Vallenet(Centre National de la Recherche Scientifique), Claudine Médigue(Centre National de la Recherche Scientifique), Eduardo P. C. Rocha(Centre National de la Recherche Scientifique), Érick Denamur(Inserm)
PLoS Genetics
January 22, 2009
Cited by 1,199Open Access
Full Text

Abstract

The Escherichia coli species represents one of the best-studied model organisms, but also encompasses a variety of commensal and pathogenic strains that diversify by high rates of genetic change. We uniformly (re-) annotated the genomes of 20 commensal and pathogenic E. coli strains and one strain of E. fergusonii (the closest E. coli related species), including seven that we sequenced to completion. Within the approximately 18,000 families of orthologous genes, we found approximately 2,000 common to all strains. Although recombination rates are much higher than mutation rates, we show, both theoretically and using phylogenetic inference, that this does not obscure the phylogenetic signal, which places the B2 phylogenetic group and one group D strain at the basal position. Based on this phylogeny, we inferred past evolutionary events of gain and loss of genes, identifying functional classes under opposite selection pressures. We found an important adaptive role for metabolism diversification within group B2 and Shigella strains, but identified few or no extraintestinal virulence-specific genes, which could render difficult the development of a vaccine against extraintestinal infections. Genome flux in E. coli is confined to a small number of conserved positions in the chromosome, which most often are not associated with integrases or tRNA genes. Core genes flanking some of these regions show higher rates of recombination, suggesting that a gene, once acquired by a strain, spreads within the species by homologous recombination at the flanking genes. Finally, the genome's long-scale structure of recombination indicates lower recombination rates, but not higher mutation rates, at the terminus of replication. The ensuing effect of background selection and biased gene conversion may thus explain why this region is A+T-rich and shows high sequence divergence but low sequence polymorphism. Overall, despite a very high gene flow, genes co-exist in an organised genome.


Related Papers

No related papers found

Powered by citation graph analysis