Main menu

GraphUnzip: unzipping assembly graphs with long reads and Hi-C


Long reads and Hi-C have revolutionized the field of genome assembly as they have made highly continuous assemblies accessible for challenging genomes. As haploid chromosome-level assemblies are now commonly achieved for all types of organisms, phasing assemblies has become the new frontier for genome reconstruction. Several tools have already been released using long reads and/or Hi-C to phase assemblies, but they all start from a linear sequence, and are ill-suited for non-model organisms with high levels of heterozygosity.

We present GraphUnzip, a fast, memory-efficient and accurate tool to unzip assembly graphs into their constituent haplotypes using long reads and/or Hi-C data. As GraphUnzip only connects sequences in the assembly graph that already had a potential link based on overlaps, it yields high-quality gap-less supercontigs. To demonstrate the efficiency of GraphUnzip, we tested it on a simulated diploid Escherichia coli genome, and on two real datasets for the genomes of the rotifer Adineta vaga and the potato Solanum tuberosum. In all cases, GraphUnzip yielded highly continuous phased assemblies.

Authors: Roland Faure, Nadège Guiglielmoni, Jean-François Flot

入門

MinION Starter Packを購入 ナノポア製品の販売 シークエンスサービスプロバイダー グローバルディストリビューター

ナノポア技術

ナノポアの最新ニュースを購読 リソースと発表文献 Nanopore Communityとは

Oxford Nanoporeについて

ニュース 会社沿革 持続可能性 経営陣 メディアリソース & お問い合わせ先 投資家向け パートナー向け Oxford Nanopore社で働く 現在の募集状況 営業上の情報 BSI 27001 accreditationBSI 90001 accreditationBSI mark of trust
Japanese flag