Main menu

NCM 2022: Direct high-throughput deconvolution of unnatural bases via nanopore sequencing and bootstrapped learning


The discovery of synthetic xeno-nucleic acids (XNAs) that can basepair as unnatural bases (UBs) to expand the genetic alphabet has spawned interest in many applications, from synthetic biology to DNA storage. However, the inability to read XNAs in a direct, high-throughput manner has been a significant limitation for xenobiology. We demonstrate that XNA-containing templates can be directly and robustly sequenced (>2.3 million reads/flowcell, similar to DNA controls) on a MinION sequencer from Oxford Nanopore Technologies to obtain signal data that is significantly different relative to natural bases (median fold-change >5). To enable training of machine learning models that directly call XNAs along with natural bases, we developed a framework to synthesize a complex pool of 1,024 XNA-containing templates with different sequence contexts and high XNA purity (>90% on average). Combining this with a bootstrapping approach to filter non-XNA-containing templates during machine learning, we show that a model can be trained to call natural as well as unnatural bases with high accuracy (>91%). These results highlight the versatility of nanopore sequencing as a platform for interrogating nucleic acids for xenobiology applications, and the potential to study genetic material beyond those that use canonical bases.

Authors: Mauricio Lisboa Perez

入门指南

购买 MinION 启动包 Nanopore 商城 测序服务提供商 全球代理商

纳米孔技术

订阅 Nanopore 更新 资源库及发表刊物 什么是 Nanopore 社区

关于 Oxford Nanopore

新闻 公司历程 可持续发展 领导团队 媒体资源和联系方式 投资者 合作者 在 Oxford Nanopore 工作 职位空缺 商业信息 BSI 27001 accreditationBSI 90001 accreditationBSI mark of trust
Chinese flag