Swarm: robust and fast clustering method for amplicon-based studies

Frédéric Mahé(Centre National de la Recherche Scientifique), Torbjørn Rognes(Oslo University Hospital), Christopher Quince(University of Glasgow), Colomban de Vargas(Centre National de la Recherche Scientifique), Micah Dunthorn(University of Kaiserslautern)
PeerJ
September 25, 2014
Cited by 934Open Access
Full Text

Abstract

Popular de novo amplicon clustering methods suffer from two fundamental flaws: arbitrary global clustering thresholds, and input-order dependency induced by centroid selection. Swarm was developed to address these issues by first clustering nearly identical amplicons iteratively using a local threshold, and then by using clusters' internal structure and amplicon abundances to refine its results. This fast, scalable, and input-order independent approach reduces the influence of clustering parameters and produces robust operational taxonomic units.


Related Papers