A chromosome scale tomato genome built from complementary PacBio and Nanopore sequences alone reveals extensive linkage drag during breeding

SUMMARY The assembly and scaffolding of plant crop genomes facilitate the characterization of genetically diverse cultivated and wild germplasm. The cultivated tomato (Solanum lycopersicum) has been improved through the introgression of genetic material from related wild species, including resistanc...

Full description

Saved in:
Bibliographic Details
Published inThe Plant journal : for cell and molecular biology Vol. 110; no. 2; pp. 572 - 588
Main Authors Rengs, Willem M. J., Schmidt, Maximilian H.‐W., Effgen, Sieglinde, Le, Duyen Bao, Wang, Yazhong, Zaidan, Mohd Waznul Adly Mohd, Huettel, Bruno, Schouten, Henk J., Usadel, Björn, Underwood, Charles J.
Format Journal Article
LanguageEnglish
Published England Blackwell Publishing Ltd 01.04.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:SUMMARY The assembly and scaffolding of plant crop genomes facilitate the characterization of genetically diverse cultivated and wild germplasm. The cultivated tomato (Solanum lycopersicum) has been improved through the introgression of genetic material from related wild species, including resistance to pandemic strains of tobacco mosaic virus (TMV) from Solanum peruvianum. Here we applied PacBio HiFi and ONT Nanopore sequencing to develop independent, highly contiguous and complementary assemblies of an inbred TMV‐resistant tomato variety. We show specific examples of how HiFi and ONT datasets can complement one another to improve assembly contiguity. We merged the HiFi and ONT assemblies to generate a long‐read‐only assembly where all 12 chromosomes were represented as 12 contiguous sequences (N50 = 68.5 Mbp). This chromosome scale assembly did not require scaffolding using an orthogonal data type. The merged assembly was validated by chromosome conformation capture data and is highly consistent with previous tomato genome assemblies that made use of genetic maps and Hi‐C for scaffolding. Our long‐read‐only assembly reveals that a complex series of structural variants linked to the TMV resistance gene likely contributed to linkage drag of a 64.1‐Mbp region of the S. peruvianum genome during tomato breeding. Through marker studies and ONT‐based comprehensive haplotyping we show that this minimal introgression region is present in six cultivated tomato hybrid varieties developed in three commercial breeding programs. Our results suggest that complementary long read technologies can facilitate the rapid generation of near‐complete genome sequences. Significance Statement Modern cultivated tomato (Solanum lycopersicum) hybrids are resistant to economically devastating pandemic strains of tobacco mosaic virus (TMV), due to an introgressed resistance gene from Solanum peruvianum. Our ‘long‐read‐only’ tomato genome assembly reveals, for the first time, that a complex series of structural variants in the TMV resistance locus likely contributed to a massive linkage drag of a 64.1‐Mbp region of ‘wild DNA’ from S. peruvianum during tomato breeding.
Bibliography:
These authors contributed equally to this work.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0960-7412
1365-313X
DOI:10.1111/tpj.15690