psBLUP: incorporating marker proximity for improving genomic prediction accuracy

Genomic selection entails the estimation of phenotypic traits of interest for plants without phenotype based on the association between single-nucleotide polymorphisms (SNPs) and phenotypic traits for plants with phenotype. Typically, the number of SNPs far exceeds the number of samples (high-dimens...

Full description

Saved in:

Bibliographic Details
Published in	Euphytica Vol. 218; no. 5
Main Authors	Bartzis, Georgios, Peeters, Carel F. W., Eeuwijk, Fred van
Format	Journal Article
Language	English
Published	Dordrecht Springer Netherlands 01.05.2022 Springer Springer Nature B.V
Subjects	Analysis Arabidopsis thaliana Biomedical and Life Sciences Biotechnology Genomics Life Sciences Markers Nucleotides Phenotypes Plant Genetics and Genomics Plant Pathology Plant Physiology Plant Sciences Proximity Regression models Regularization Regularization methods Single nucleotide polymorphisms Single-nucleotide polymorphism High-dimensional data BLUP Genomic selection Proximity smoothing
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Genomic selection entails the estimation of phenotypic traits of interest for plants without phenotype based on the association between single-nucleotide polymorphisms (SNPs) and phenotypic traits for plants with phenotype. Typically, the number of SNPs far exceeds the number of samples (high-dimensionality) and, therefore, usage of regularization methods is common. The most common approach to estimate marker-trait associations uses the genomic best linear unbiased predictor (GBLUP) method, where a mixed model is fitted to the data. GBLUP has also been alternatively parameterized as a ridge regression model (RRBLUP). GBLUP/RRBLUP is based on the assumption of independence between predictor variables. However, it is to be expected that variables will be associated due to their genetic proximity. Here, we propose a regularized linear model (namely psBLUP: proximity smoothed BLUP) that explicitly models the dependence between predictor effects. We show that psBLUP can improve accuracy compared to the standard methods on both Arabidopsis thaliana data and Barley data.
ISSN:	0014-2336 1573-5060
DOI:	10.1007/s10681-022-03006-y