NanoCLUST: a species-level analysis of 16S rRNA nanopore sequencing data

Summary NanoCLUST is an analysis pipeline for classification of amplicon-based full-length 16S rRNA nanopore reads. It is characterized by an unsupervised read clustering step, based on Uniform Manifold Approximation and Projection (UMAP), followed by the construction of a polished read and subsequent Blast classification.

Here we demonstrate that NanoCLUST performs better than other state-of-the-art software in the characterization of two commercial mock communities, enabling accurate bacterial identification and abundance profile estimation at species level resolution.

Authors: Héctor Rodríguez-Pérez, Laura Ciuffreda, Carlos Flores