Fig. 1 Pore-C a) overview of laboratory workflow for plants b) multi-contact reads
Nanopore reads can reach hundreds of kilobases in length, which greatly assists the process of reconstructing genomes, resulting in highly contiguous de novo assemblies. Assembly contiguity can be increased further by the addition of data from Pore-C, a technique which investigates the folded state of a genome. Here we exploit the fact that genomic regions which are close in 3D space also tend to be close in the primary sequence. In the laboratory protocol, genomic DNA is first cross-linked to nuclear proteins, preserving the spatial proximity of loci. Restriction digestion and proximity ligation are used to join cross-linked fragments, which are then sequenced (Fig. 1a). The long nanopore Pore-C reads are derived from multiple genomic loci (Fig. 1b).