Calibration and validation of predicted genomic breeding values in an advanced cycle maize population

Key message Model training on data from all selection cycles yielded the highest prediction accuracy by attenuating specific effects of individual cycles. Expected reliability was a robust predictor of accuracies obtained with different calibration sets. The transition from phenotypic to genome-base...

Full description

Saved in:

Bibliographic Details
Published in	Theoretical and applied genetics Vol. 134; no. 9; pp. 3069 - 3081
Main Authors	Auinger, Hans-Jürgen, Lehermeier, Christina, Gianola, Daniel, Mayer, Manfred, Melchinger, Albrecht E., da Silva, Sofia, Knaak, Carsten, Ouzunova, Milena, Schön, Chris-Carolin
Format	Journal Article
Language	English
Published	Berlin/Heidelberg Springer Berlin Heidelberg 01.09.2021 Springer Springer Nature B.V
Subjects	Agriculture Biochemistry Biomedical and Life Sciences Biotechnology Breeding Calibration Corn Dry matter Genomics Life Sciences Original Original Article Phenotypes Plant Biochemistry Plant Breeding/Biotechnology Plant Genetics and Genomics Predictions Single nucleotide polymorphisms Germany
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Key message Model training on data from all selection cycles yielded the highest prediction accuracy by attenuating specific effects of individual cycles. Expected reliability was a robust predictor of accuracies obtained with different calibration sets. The transition from phenotypic to genome-based selection requires a profound understanding of factors that determine genomic prediction accuracy. We analysed experimental data from a commercial maize breeding programme to investigate if genomic measures can assist in identifying optimal calibration sets for model training. The data set consisted of six contiguous selection cycles comprising testcrosses of 5968 doubled haploid lines genotyped with a minimum of 12,000 SNP markers. We evaluated genomic prediction accuracies in two independent prediction sets in combination with calibration sets differing in sample size and genomic measures (effective sample size, average maximum kinship, expected reliability, number of common polymorphic SNPs and linkage phase similarity). Our results indicate that across selection cycles prediction accuracies were as high as 0.57 for grain dry matter yield and 0.76 for grain dry matter content. Including data from all selection cycles in model training yielded the best results because interactions between calibration and prediction sets as well as the effects of different testers and specific years were attenuated. Among genomic measures, the expected reliability of genomic breeding values was the best predictor of empirical accuracies obtained with different calibration sets. For grain yield, a large difference between expected and empirical reliability was observed in one prediction set. We propose to use this difference as guidance for determining the weight phenotypic data of a given selection cycle should receive in model retraining and for selection when both genomic breeding values and phenotypes are available.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 ObjectType-Undefined-3 Communicated by Jose Crossa.
ISSN:	0040-5752 1432-2242
DOI:	10.1007/s00122-021-03880-5