Main menu

AnVIL: An overlap-aware genome assembly scaffolder for linked reads


10X Genomics Chromium linked reads contain information that can be used to link sequences together into scaffolds in draft genome assemblies. Existing software for this purpose perform the scaffolding by joining sequences together with a gap between them, not considering potential contig overlaps. Such overlaps can be particularly prominent in genome drafts assembled from long-read sequencing data where an overlap-layout-consensus (OLC) algorithm has been used. Ignoring overlapping contig ends may result in genes and other features being incomplete or fragmented in the resulting scaffolds.

We developed the application AnVIL to generate scaffolds from genome drafts using 10X Chromium data, with a focus on minimizing the number of gaps in resulting scaffolds by incorporating an OLC step to resolve junctions between linked contigs. We tested the performance of AnVIL on three published and simulated datasets and compared to the previously published tools ARCS and ARKS.

The results revealed that AnVIL performed similarly considering contiguity statistics, and the advantage of the overlapping step was revealed by fewer long and short variants in AnVIL produced scaffolds, in addition to a higher proportion of completely assembled LTR retrotransposons. We expect AnVIL to have broad applicability in genome assembly projects that utilize 10X Chromium linked reads.

Authors: Markus Hiltunen, Martin Ryberg, Hanna Johannesson

入門

MinION Starter Packを購入 ナノポア製品の販売 シークエンスサービスプロバイダー グローバルディストリビューター

ナノポア技術

ナノポアの最新ニュースを購読 リソースと発表文献 Nanopore Communityとは

Oxford Nanoporeについて

ニュース 会社沿革 持続可能性 経営陣 メディアリソース & お問い合わせ先 投資家向け パートナー向け Oxford Nanopore社で働く 現在の募集状況 営業上の情報 BSI 27001 accreditationBSI 90001 accreditationBSI mark of trust
Japanese flag