periscope: sub-genomic RNA identification in SARS-CoV-2 ARTIC network nanopore sequencing data

We have developed periscope, a tool for the detection and quantification of sub-genomic RNA in ARTIC network protocol generated Nanopore SARS-CoV-2 sequence data.

We applied periscope to 1155 SARS-CoV-2 sequences from Sheffield, UK. Using a simple local alignment to detect reads which contain the leader sequence we were able to identify and quantify reads arising from canonical and non-canonical sub-genomic RNA. We were able to detect all canonical sub-genomic RNAs at expected abundances, with the exception of ORF10, suggesting that this is not a functional ORF. A number of recurrent non-canonical sub-genomic RNAs are detected.

We show that the results are reproducible using technical replicates and determine the optimum number of reads for sub-genomic RNA analysis. Finally variants found in genomic RNA are transmitted to sub-genomic RNAs with high fidelity in most cases.

This tool can be applied to tens of thousands of sequences worldwide to provide the most comprehensive analysis of SARS-CoV-2 sub-genomic RNA to date.

Authors: Matthew Daniel Parker, Benjamin B Lindsey, Shay Leary, Silvana Gaudieri, Abha Chopra, Matthew Wyles, Adrienn Angyal, Luke R Green, Paul Parsons, Rachel M Tucker, Rebecca Brown, Danielle Groves, Katie Johnson, Laura Carrilero, Joe Heffer, David Partridge, Cariad Evans, Mohammad Razza, Alexanda J Keeley, Nikki Smith, Dennis Wang, Simon Mallal, Thushan I de Silva