Evaluating dimensionality reduction for genomic prediction

The development of genomic selection (GS) methods has allowed plant breeding programs to select favorable lines using genomic data before performing field trials. Improvements in genotyping technology have yielded high-dimensional genomic marker data which can be difficult to incorporate into statis...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in genetics Vol. 13; p. 958780
Main Authors Manthena, Vamsi, Jarquín, Diego, Varshney, Rajeev K., Roorkiwal, Manish, Dixit, Girish Prasad, Bharadwaj, Chellapilla, Howard, Reka
Format Journal Article
LanguageEnglish
Published Frontiers Media S.A 14.10.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The development of genomic selection (GS) methods has allowed plant breeding programs to select favorable lines using genomic data before performing field trials. Improvements in genotyping technology have yielded high-dimensional genomic marker data which can be difficult to incorporate into statistical models. In this paper, we investigated the utility of applying dimensionality reduction (DR) methods as a pre-processing step for GS methods. We compared five DR methods and studied the trend in the prediction accuracies of each method as a function of the number of features retained. The effect of DR methods was studied using three models that involved the main effects of line, environment, marker, and the genotype by environment interactions. The methods were applied on a real data set containing 315 lines phenotyped in nine environments with 26,817 markers each. Regardless of the DR method and prediction model used, only a fraction of features was sufficient to achieve maximum correlation. Our results underline the usefulness of DR methods as a key pre-processing step in GS models to improve computational efficiency in the face of ever-increasing size of genomic data.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Alan Cleary, National Center for Genome Resources, United States
This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics
Reviewed by: Pierce Rafter, Teagasc, Ireland
Edited by: Zitong Li, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Australia
ISSN:1664-8021
1664-8021
DOI:10.3389/fgene.2022.958780