A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts

One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in genetics Vol. 13; p. 899523
Main Authors Pärna, Katri, Nolte, Ilja M, Snieder, Harold, Fischer, Krista, Marnetto, Davide, Pagani, Luca
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media S.A 18.07.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To explore this matter, we focused on principal component analysis (PCA) and asked whether a population genetics informed strategy focused on PCs derived from an external reference population helps in mitigating this PRS transferability issue. Throughout the study, we used two complex model traits, height and body mass index, and samples from UK and Estonian Biobanks. We aimed to investigate 1) whether using a reference population (1000G) for computation of the PCs adjusted for in the discovery cohort improves the resulting PRS performance in a target set from another population and 2) whether adjusting the validation model for PCs is required at all. Our results showed that any other set of PCs performed worse than the one computed on samples from the same population as the discovery dataset. Furthermore, we show that PC correction in GWAS cannot prevent residual population structure information in the PRS, also for non-structured traits. Therefore, we confirm the utility of PC correction in the validation model when the investigated trait shows an actual correlation with population genetic structure, to account for the residual confounding effect when evaluating the predictive value of PRS.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
These authors have contributed equally to this work
Diptavo Dutta, Johns Hopkins University, United States
This article was submitted to Evolutionary and Population Genetics, a section of the journal Frontiers in Genetics
Reviewed by: Arslan A. Zaidi, University of Pennsylvania, United States
Edited by: Charleston Chiang, University of Southern California, United States
ISSN:1664-8021
1664-8021
DOI:10.3389/fgene.2022.899523