A Bayesian genomic selection approach incorporating prior feature ordering and population structures with application to coronary artery disease

Coronary artery disease is one of the most common types of cardiovascular disease. Death from coronary heart disease is influenced by genetic factors in both women and men. In this article, we propose a novel Bayesian variable selection framework for the identification of important genetic variants...

Full description

Saved in:
Bibliographic Details
Published inStatistical methods in medical research p. 9622802231181231
Main Authors Dai, Xiaotian, Lu, Xuewen, Chekouo, Thierry
Format Journal Article
LanguageEnglish
Published England 01.08.2023
Subjects
Online AccessGet more information

Cover

Loading…
More Information
Summary:Coronary artery disease is one of the most common types of cardiovascular disease. Death from coronary heart disease is influenced by genetic factors in both women and men. In this article, we propose a novel Bayesian variable selection framework for the identification of important genetic variants associated with coronary artery disease disease status. Instead of treating each feature independently as in conventional Bayesian variable selection methods, we propose an innovative prior for the inclusion probabilities of genetic variants that accounts for their ordering structure. We assume that neighboring variants are more likely to be selected together as they tend to be highly correlated and have similar biological functions. Additionally, we propose to group participating subjects based on underlying population structure and fit separate regressions, so that the regression coefficients can better reflect different disease risks in different population groups. Our approach borrows strength across regression models through an innovative prior inspired by the Markov random fields. The proposed framework can improve variable selection and prediction performances as demonstrated in the simulation studies. We also apply the proposed framework to the CATHeterization GENetics data with binary Coronary artery disease disease status.
ISSN:1477-0334
DOI:10.1177/09622802231181231