First record of a tandem-repeat region within the mitochondrial genome of Clonorchis sinensis using a long-read sequencing approach

Background

Mitochondrial genomes provide useful genetic markers for systematic and population genetic studies of parasitic helminths. Although many such genome sequences have been published and deposited in public databases, there is evidence that some of them are incomplete relating to an inability of conventional techniques to reliably sequence non-coding (repetitive) regions. In the present study, we characterise the complete mitochondrial genome—including the long, non-coding region—of the carcinogenic Chinese liver fluke, Clonorchis sinensis, using long-read sequencing.

Methods

The mitochondrial genome was sequenced from total high molecular-weight genomic DNA isolated from a pool of 100 adult worms of C. sinensis using the MinION sequencing platform (Oxford Nanopore Technologies), and assembled and annotated using an informatic approach.

Results

From > 93,500 long-reads, we assembled a 18,304 bp-mitochondrial genome for C. sinensis. Within this genome we identified a novel non-coding region of 4,549 bp containing six tandem-repetitive units of 719–809 bp each. Given that genomic DNA from pooled worms was used for sequencing, some variability in length/sequence in this tandem-repetitive region was detectable, reflecting population variation.

Conclusions

For C. sinensis, we report the complete mitochondrial genome, which includes a long (> 4.5 kb) tandem-repetitive region. The discovery of this non-coding region using a nanopore-sequencing/informatic approach now paves the way to investigating the nature and extent of length/sequence variation in this region within and among individual worms, both within and among C. sinensis populations, and to exploring whether this region has a functional role in the regulation of replication and transcription, akin to the mitochondrial control region in mammals. Although applied to C. sinensis, the technological approach established here should be broadly applicable to characterise complex tandem-repetitive or homo-polymeric regions in the mitochondrial genomes of a wide range of taxa.

Authors: Liina Kinkar, Neil D. Young , Woon-Mok Sohn, Andreas J. Stroehlein, Pasi K. Korhonen, Robin B. Gasser