Roary: rapid large-scale prokaryote pan genome analysis

Andrew J. Page(Wellcome Sanger Institute), Carla Cummins(Wellcome Sanger Institute), Martin Hunt(Wellcome Sanger Institute), Vanessa Wong(University of Cambridge), Sandra Reuter(University of Cambridge), Matthew T. G. Holden(University of St Andrews), María Fookes(Wellcome Sanger Institute), Daniel Falush(Swansea University), Jacqueline A. Keane(Wellcome Sanger Institute), Julian Parkhill(Wellcome Sanger Institute)
Bioinformatics
July 20, 2015
Cited by 5,862Open Access
Full Text

Abstract

UNLABELLED: A typical prokaryote population sequencing study can now consist of hundreds or thousands of isolates. Interrogating these datasets can provide detailed insights into the genetic structure of prokaryotic genomes. We introduce Roary, a tool that rapidly builds large-scale pan genomes, identifying the core and accessory genes. Roary makes construction of the pan genome of thousands of prokaryote samples possible on a standard desktop without compromising on the accuracy of results. Using a single CPU Roary can produce a pan genome consisting of 1000 isolates in 4.5 hours using 13 GB of RAM, with further speedups possible using multiple processors. AVAILABILITY AND IMPLEMENTATION: Roary is implemented in Perl and is freely available under an open source GPLv3 license from http://sanger-pathogens.github.io/Roary CONTACT: roary@sanger.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Related Papers

No related papers found

Powered by citation graph analysis