Inference of Chromosome-Length Haplotypes Using Genomic Data of Three or a Few More Single Gametes

Abstract Compared with genomic data of individual markers, haplotype data provide higher resolution for DNA variants, advancing our knowledge in genetics and evolution. Although many computational and experimental phasing methods have been developed for analyzing diploid genomes, it remains challeng...

Full description

Saved in:
Bibliographic Details
Published inMolecular biology and evolution Vol. 37; no. 12; pp. 3684 - 3698
Main Authors Li, Ruidong, Qu, Han, Chen, Jinfeng, Wang, Shibo, Chater, John M, Zhang, Le, Wei, Julong, Zhang, Yuan-Ming, Xu, Chenwu, Zhong, Wei-De, Zhu, Jianguo, Lu, Jianming, Feng, Yuanfa, Chen, Weiming, Ma, Renyuan, Ferrante, Sergio Pietro, Roose, Mikeal L, Jia, Zhenyu
Format Journal Article
LanguageEnglish
Published United States Oxford University Press 16.12.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Compared with genomic data of individual markers, haplotype data provide higher resolution for DNA variants, advancing our knowledge in genetics and evolution. Although many computational and experimental phasing methods have been developed for analyzing diploid genomes, it remains challenging to reconstruct chromosome-scale haplotypes at low cost, which constrains the utility of this valuable genetic resource. Gamete cells, the natural packaging of haploid complements, are ideal materials for phasing entire chromosomes because the majority of the haplotypic allele combinations has been preserved. Therefore, compared with the current diploid-based phasing methods, using haploid genomic data of single gametes may substantially reduce the complexity in inferring the donor’s chromosomal haplotypes. In this study, we developed the first easy-to-use R package, Hapi, for inferring chromosome-length haplotypes of individual diploid genomes with only a few gametes. Hapi outperformed other phasing methods when analyzing both simulated and real single gamete cell sequencing data sets. The results also suggested that chromosome-scale haplotypes may be inferred by using as few as three gametes, which has pushed the boundary to its possible limit. The single gamete cell sequencing technology allied with the cost-effective Hapi method will make large-scale haplotype-based genetic studies feasible and affordable, promoting the use of haplotype data in a wide range of research.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-3
content type line 23
ObjectType-Undefined-2
USDOE Office of Science (SC)
National Institute of Food and Agriculture (NIFA)
National Key Basic Research Program of China
National Natural Science Foundation of China (NSFC)
AC02-05CH11231; 2013-67013-21110; 2015CB553706; 81571427; 201803040001; 201707010291; 81660426; 2017-5803
Ruidong Li and Han Qu contributed equally to this work.
ISSN:0737-4038
1537-1719
DOI:10.1093/molbev/msaa176