Genome of Solanum pimpinellifolium provides insights into structural variants during tomato breeding

Solanum pimpinellifolium (SP) is the wild progenitor of cultivated tomato. Because of its remarkable stress tolerance and intense flavor, SP has been used as an important germplasm donor in modern breeding of tomato. Here we present a high-quality chromosome-scale genome sequence of SP LA2093. Genome comparison identifies more than 92,000 high-confidence structural variants (SVs) between LA2093 and the modern cultivar, Heinz 1706.

Genotyping these SVs in ~600 representative tomato accessions unravels alleles under selection during tomato domestication, improvement and modern breeding, and discovers numerous novel SVs underlying genes known to regulate important breeding traits such as fruit weight and lycopene content. Expression quantitative trait locus (eQTL) analysis detects hotspots harboring master regulators controlling important fruit quality traits, including cuticular wax accumulation and flavonoid biosynthesis, and novel SVs contributing to these complex regulatory networks.

The LA2093 genome sequence and the identified SVs provide rich resources for future research and biodiversity-based breeding.

Authors: Xin Wang, Lei Gao, Chen Jiao, Stefanos Stravoravdis, Prashant S. Hosmani, Surya Saha, Jing Zhang, Samantha Mainiero, Susan R. Strickler, Carmen Catala, Gregory B. Martin, Lukas A. Mueller, Julia Vrebalov, James J. Giovannoni, Shan Wu, Zhangjun Fei