Main menu

Beware of Ogres: grass pea and the challenges of assembling large legume genomes


Grass pea (Lathyrus sativus) is exceptionally resilient to drought, flooding, and salinity. However, it contains a toxin which, when a lot of the plant is consumed over months, can cause paralysis from the waist down.

The problem of the presence of the toxin can be tackled through plant breeding; this requires an understanding of e.g. the genetics of toxin synthesis, for which a genome assembly is needed.

The grass pea genome is 6.3 Gb and highly repetitive, featuring ‘Ogre elements’, spanning up to 25 kbp.

Short-read assembly of grass pea produced a 6.2 Gb assembly across 1.6 million contigs.

Scaffolding the assembly with paired-end short reads increased contiguity, but introduced 2 billion Ns into the assembly.

Long-read nanopore sequencing on PromethION to 36x coverage + polishing with short reads produced a 6.2 Gb assembly ‘with no Ns’ in 163 contigs, with almost 3-fold improvement in contig N50 vs the scaffolded short-read assembly.

Gene annotation revealed 45k protein-coding genes & >75k transcripts. BUSCO completeness was 82-90%.

Authors: Peter Emmrich

入门指南

购买 MinION 启动包 Nanopore 商城 测序服务提供商 全球代理商

纳米孔技术

订阅 Nanopore 更新 资源库及发表刊物 什么是 Nanopore 社区

关于 Oxford Nanopore

新闻 公司历程 可持续发展 领导团队 媒体资源和联系方式 投资者 合作者 在 Oxford Nanopore 工作 职位空缺 商业信息 BSI 27001 accreditationBSI 90001 accreditationBSI mark of trust
Chinese flag