Nanopore sequencing of the pharmacogene CYP2D6 allows simultaneous haplotyping and detection of duplications
13th March 2019 - BioRxiv
The accurate genotyping of CYP2D6 is hindered by the very polymorphic nature of the gene, high homology with its pseudogene CYP2D7, and the occurrence of structural variations. Long read sequencing offers the promise of overcoming some of these challenges, along with the advantage of straightforward variant phasing. We have established methods for sequencing and analysis of DNA amplicons containing the whole CYP2D6 gene, using the GridION nanopore sequencer.
Materials and methods
Seven reference and 25 clinical samples covering various haplotypes including gene duplication were barcoded and sequenced over two sequencing runs. Sequenced raw reads were analyzed using a pipeline of bioinformatics tools including two mapping tools and two variant calling tools.
Using minimap2 and nanopolish (mapping and variant calling tools respectively) resulted in the most accurate variant detection. Haplotypes of all samples could be detected accurately to the subvariant level in most samples, while the remaining samples had either novel variants or variant patterns not matched to the current PharmVar CYP2D6 haplotype database. Allele duplication could also be determined by analyzing the allelic balance between the sample haplotypes.
Nanopore sequencing of CYP2D6 offers a high throughput method for genotyping, accurate haplotyping, and detection of new variants and duplicated alleles.