Main menu

NCM 2022: SeqScreen-Nano: functional and taxonomic characterization of long read metagenomic data


Affordable long read sequencing has enabled a wide variety of metagenomic analysis tasks, from obtaining high quality genome assemblies to identifying structural variants. Though long reads offer better resolution, accurate assignment of functional and taxonomic labels to Oxford Nanopore sequences remains an open challenge. Here we present a solution to this challenge, building upon SeqScreen and adapting it to identify Functions of Sequences of Concern (FunSoCs) on Oxford Nanopore data. The taxonomic assignment over the entire read is carried out using a combination of a majority voting heuristic and greedy weighted min-set cover approach and refined using a reference-based approach that uses breadth of coverage information to separate closely related species in the sample. We show that on simulated and synthetic metagenomic data, SeqScreen-Nano can identify Open Reading Frames (ORFs) across the length of raw Oxford Nanopore reads and use it to accurately assign functional and taxonomic labels.

Download the PDF

Getting started

Buy a MinION starter pack Nanopore store Sequencing service providers Channel partners

Nanopore technology

Subscribe to Nanopore updates Resources and publications What is the Nanopore Community

About Oxford Nanopore

News Company timeline Sustainability Leadership team Media resources & contacts For investors For partners Working at Oxford Nanopore Current vacancies Commercial information BSI 27001 accreditationBSI 90001 accreditationBSI mark of trust
Spanish flag