PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species

Derrick E. Fouts(J. Craig Venter Institute), Lauren Brinkac(J. Craig Venter Institute), Erin Beck(J. Craig Venter Institute), Jason Inman(J. Craig Venter Institute), Granger Sutton(J. Craig Venter Institute)
Nucleic Acids Research
August 13, 2012
Cited by 224Open Access
Full Text

Abstract

Pan-genome ortholog clustering tool (PanOCT) is a tool for pan-genomic analysis of closely related prokaryotic species or strains. PanOCT uses conserved gene neighborhood information to separate recently diverged paralogs into orthologous clusters where homology-only clustering methods cannot. The results from PanOCT and three commonly used graph-based ortholog-finding programs were compared using a set of four publicly available strains of the same bacterial species. All four methods agreed on ∼70% of the clusters and ∼86% of the proteins. The clusters that did not agree were inspected for evidence of correctness resulting in 85 high-confidence manually curated clusters that were used to compare all four methods.


Related Papers

No related papers found

Powered by citation graph analysis