Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies

Arang Rhie(National Institutes of Health), Brian P. Walenz(National Institutes of Health), Sergey Koren(National Institutes of Health), Adam M. Phillippy(National Institutes of Health)
Genome biology
September 14, 2020
Cited by 3,418Open Access
Full Text

Abstract

Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations. By comparing k-mers in a de novo assembly to those found in unassembled high-accuracy reads, Merqury estimates base-level accuracy and completeness. For trios, Merqury can also evaluate haplotype-specific accuracy, completeness, phase block continuity, and switch errors. Multiple visualizations, such as k-mer spectrum plots, can be generated for evaluation. We demonstrate on both human and plant genomes that Merqury is a fast and robust method for assembly validation.


Related Papers

No related papers found

Powered by citation graph analysis