Highly contiguous assemblies of 101 drosophilid genomes

Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus.

Recent advances in long read sequencing allow high quality genome assemblies for tens or even hundreds of species to be generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of high-quality assemblies for 101 lines of 95 drosophilid species encompassing 14 species groups and 35 sub-groups with an average contig N50 of 10.5 Mb and greater than 97% BUSCO completeness in 97/101 assemblies.

These assemblies, along with detailed wet lab protocol and assembly pipelines, are released as a public resource and will serve as a starting point for addressing broad questions of genetics, ecology, and evolution within this key group.

Authors: Bernard Y. Kim, Jeremy R. Wang, Danny E. Miller, Olga Barmina, Emily Delaney, Ammon Thompson, Aaron A. Comeault, David Peede, Emmanuel R. R. D’Agostino, Julianne Pelaez, Jessica M. Aguilar,, Diler Haji, Teruyuki Matsunaga, Ellie E. Armstrong, Molly Zych, Yoshitaka Ogawa, Marina Stamenković-Radak, Mihailo Jelić, Marija Savić Veselinović, Marija Tanasković, Pavle Erić, Jian-jun Gao, Takehiro K. Katoh, Masanori J. Toda,, Hideaki Watabe, Masayoshi Watada, Jeremy S. Davis, Leonie C. Moyle, Giulia Manoli, Enrico Bertolini, Vladimír Košťál, R. Scott Hawley,, Aya Takahashi, Corbin D. Jones, Donald K. Price, Noah Whiteman, Artyom Kopp, Daniel R. Matute, Dmitri A. Petrov