Merqury: reference-free quality and phasing assessment for genome assemblies

Recent long-read assemblies often exceed the quality of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations. By comparing k-mers in a de novo assembly to those found in unassembled high-accuracy reads, Merqury estimates base-level accuracy and completeness.

For trios, Merqury can also evaluate haplotype-specific accuracy, completeness, phase block continuity, and switch errors. Multiple visualizations, such as k-mer spectrum plots, are provided for evaluating assembly quality. We demonstrate on both human and plant genomes that Merqury is a fast and robust method for assembly validation.

Authors: Arang Rhie, Brian P. Walenz, Sergey Koren, Adam M. Phillippy