Clairvoyante: a multi-task convolutional deep neural network for variant calling in Single Molecule Sequencing

Return to Resources

The accurate identification of DNA sequence variants is an important, but challenging task in genomics. It is particularly difficult for single molecule sequencing, which has a per-nucleotide error rate of ~5%-15%. Meeting this demand, we developed Clairvoyante, a multi-task five-layer convolutional neural network model for predicting variant type (SNP or indel), zygosity, alternative allele and indel length from aligned reads. For the well-characterized NA12878 human sample, Clairvoyante achieved 99.73%, 97.68% and 95.36% precision on known variants, and 98.65%, 92.57%, 87.26% F1-score for whole-genome analysis, using Illumina, PacBio, and Oxford Nanopore data, respectively. Training on a second human sample shows Clairvoyante is sample agnostic and finds variants in less than two hours on a standard server. Furthermore, we identified 3,135 variants that are missed using Illumina but supported independently by both PacBio and Oxford Nanopore reads. Clairvoyante is available open-source (https://github.com/aquaskyline/Clairvoyante), with modules to train, utilize and visualize the model.

Authors: Ruibang Luo, Fritz J Sedlazeck, Tak-Wah Lam, Michael Schatz

Nature Communications - Full text

Discover nanopore sequencing

Explore products

Research

Techniques

Focus areas

Resources

Documentation

Nanopore Learning

Company

News & Events

Global partners

Clairvoyante: a multi-task convolutional deep neural network for variant calling in Single Molecule Sequencing

Download

Getting started

Nanopore technology

About Oxford Nanopore

Contact us

London Calling 2025

Discover nanopore sequencing

Explore products

Research

Techniques

Focus areas

Resources

Documentation

Nanopore Learning

Company

News & Events

Global partners

Clairvoyante: a multi-task convolutional deep neural network for variant calling in Single Molecule Sequencing

Download