Twelve years of SAMtools and BCFtools

Petr Danecek(Wellcome Sanger Institute), James Bonfield(Wellcome Sanger Institute), Jennifer Liddle(Wellcome Sanger Institute), John Marshall(University of Glasgow), Valeriu Ohan(Wellcome Sanger Institute), Martin Pollard(Wellcome Sanger Institute), Andrew Whitwham(Wellcome Sanger Institute), Thomas Keane(European Bioinformatics Institute), Shane McCarthy(Wellcome Sanger Institute), Robert M. Davies(Wellcome Sanger Institute), Heng Li(Harvard University)
GigaScience
January 29, 2021
Cited by 15,531Open Access
Full Text

Abstract

BACKGROUND: SAMtools and BCFtools are widely used programs for processing and analysing high-throughput sequencing data. They include tools for file format conversion and manipulation, sorting, querying, statistics, variant calling, and effect analysis amongst other methods. FINDINGS: The first version appeared online 12 years ago and has been maintained and further developed ever since, with many new features and improvements added over the years. The SAMtools and BCFtools packages represent a unique collection of tools that have been used in numerous other software projects and countless genomic pipelines. CONCLUSION: Both SAMtools and BCFtools are freely available on GitHub under the permissive MIT licence, free for both non-commercial and commercial use. Both packages have been installed >1 million times via Bioconda. The source code and documentation are available from https://www.htslib.org.


Related Papers