Haplotype-resolved assembly of a tetraploid potato genome using long reads and low-depth offspring data

Potato is one of the world's major staple crops and like many important crop plants it has a polyploid genome. Polyploid haplotype assembly poses a major computational challenge, hindering the use of genomic data in breeding strategies. Here, we introduce a novel strategy for the assembly of po...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors Rebecca Serra Mari, Schrinner, Sven, Finkers, Richard, Arens, Paul, Schmidt, Maximilian H-W, Usadel, Björn, Klau, Gunnar W, Marschall, Tobias
Format Paper
LanguageEnglish
Published Cold Spring Harbor Cold Spring Harbor Laboratory Press 10.05.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Potato is one of the world's major staple crops and like many important crop plants it has a polyploid genome. Polyploid haplotype assembly poses a major computational challenge, hindering the use of genomic data in breeding strategies. Here, we introduce a novel strategy for the assembly of polyploid genomes and present an assembly of the autotetraploid potato cultivar Altus. Our method uses low-depth sequencing data from an offspring population, which is available in many plant breeding settings, to achieve chromosomal clustering and haplotype phasing directly on the assembly graph. This involves a novel strategy for the analysis of k-mers unique to specific graph nodes. Our approach generates assemblies of individual chromosomes with phased haplotig N50 values of up to 13 Mb and haplotig lengths of up to 31 Mb. This major advance provides high-quality assemblies with haplotype-specific sequence resolution of whole chromosome arms and can be applied in common breeding scenarios where collections of offspring are available. Competing Interest Statement The authors have declared no competing interest.
DOI:10.1101/2022.05.10.491293