Bayesian variable selection for high dimensional predictors and self-reported outcomes

The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are cost-effective; however, they are subject to error. Diagnosis of silent events may also occur through the use of imperfect laboratory-based diagnostic...

Full description

Saved in:
Bibliographic Details
Published inBMC medical informatics and decision making Vol. 20; no. 1; pp. 212 - 11
Main Authors Gu, Xiangdong, Tadesse, Mahlet G, Foulkes, Andrea S, Ma, Yunsheng, Balasubramanian, Raji
Format Journal Article
LanguageEnglish
Published England BioMed Central Ltd 07.09.2020
BioMed Central
BMC
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are cost-effective; however, they are subject to error. Diagnosis of silent events may also occur through the use of imperfect laboratory-based diagnostic tests. In this paper, we describe an approach for variable selection in high dimensional datasets for settings in which the outcome is observed with error. We adapt the spike and slab Bayesian Variable Selection approach in the context of error-prone, self-reported outcomes. The performance of the proposed approach is studied through simulation studies. An illustrative application is included using data from the Women's Health Initiative SNP Health Association Resource, which includes extensive genotypic (>900,000 SNPs) and phenotypic data on 9,873 African American and Hispanic American women. Simulation studies show improved sensitivity of our proposed method when compared to a naive approach that ignores error in the self-reported outcomes. Application of the proposed method resulted in discovery of several single nucleotide polymorphisms (SNPs) that are associated with risk of type 2 diabetes in a dataset of 9,873 African American and Hispanic participants in the Women's Health Initiative. There was little overlap among the top ranking SNPs associated with type 2 diabetes risk between the racial groups, adding support to previous observations in the literature of disease associated genetic loci that are often not generalizable across race/ethnicity populations. The adapted Bayesian variable selection algorithm is implemented in R. The source code for the simulations are available in the Supplement. Variable selection accuracy is reduced when the outcome is ascertained by error-prone self-reports. For this setting, our proposed algorithm has improved variable selection performance when compared to approaches that neglect to account for the error-prone nature of self-reports.
AbstractList Abstract Background The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are cost-effective; however, they are subject to error. Diagnosis of silent events may also occur through the use of imperfect laboratory-based diagnostic tests. In this paper, we describe an approach for variable selection in high dimensional datasets for settings in which the outcome is observed with error. Methods We adapt the spike and slab Bayesian Variable Selection approach in the context of error-prone, self-reported outcomes. The performance of the proposed approach is studied through simulation studies. An illustrative application is included using data from the Women’s Health Initiative SNP Health Association Resource, which includes extensive genotypic (>900,000 SNPs) and phenotypic data on 9,873 African American and Hispanic American women. Results Simulation studies show improved sensitivity of our proposed method when compared to a naive approach that ignores error in the self-reported outcomes. Application of the proposed method resulted in discovery of several single nucleotide polymorphisms (SNPs) that are associated with risk of type 2 diabetes in a dataset of 9,873 African American and Hispanic participants in the Women’s Health Initiative. There was little overlap among the top ranking SNPs associated with type 2 diabetes risk between the racial groups, adding support to previous observations in the literature of disease associated genetic loci that are often not generalizable across race/ethnicity populations. The adapted Bayesian variable selection algorithm is implemented in R. The source code for the simulations are available in the Supplement. Conclusions Variable selection accuracy is reduced when the outcome is ascertained by error-prone self-reports. For this setting, our proposed algorithm has improved variable selection performance when compared to approaches that neglect to account for the error-prone nature of self-reports.
The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are cost-effective; however, they are subject to error. Diagnosis of silent events may also occur through the use of imperfect laboratory-based diagnostic tests. In this paper, we describe an approach for variable selection in high dimensional datasets for settings in which the outcome is observed with error. We adapt the spike and slab Bayesian Variable Selection approach in the context of error-prone, self-reported outcomes. The performance of the proposed approach is studied through simulation studies. An illustrative application is included using data from the Women's Health Initiative SNP Health Association Resource, which includes extensive genotypic (>900,000 SNPs) and phenotypic data on 9,873 African American and Hispanic American women. Simulation studies show improved sensitivity of our proposed method when compared to a naive approach that ignores error in the self-reported outcomes. Application of the proposed method resulted in discovery of several single nucleotide polymorphisms (SNPs) that are associated with risk of type 2 diabetes in a dataset of 9,873 African American and Hispanic participants in the Women's Health Initiative. There was little overlap among the top ranking SNPs associated with type 2 diabetes risk between the racial groups, adding support to previous observations in the literature of disease associated genetic loci that are often not generalizable across race/ethnicity populations. The adapted Bayesian variable selection algorithm is implemented in R. The source code for the simulations are available in the Supplement. Variable selection accuracy is reduced when the outcome is ascertained by error-prone self-reports. For this setting, our proposed algorithm has improved variable selection performance when compared to approaches that neglect to account for the error-prone nature of self-reports.
Background The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are cost-effective; however, they are subject to error. Diagnosis of silent events may also occur through the use of imperfect laboratory-based diagnostic tests. In this paper, we describe an approach for variable selection in high dimensional datasets for settings in which the outcome is observed with error. Methods We adapt the spike and slab Bayesian Variable Selection approach in the context of error-prone, self-reported outcomes. The performance of the proposed approach is studied through simulation studies. An illustrative application is included using data from the Women's Health Initiative SNP Health Association Resource, which includes extensive genotypic (>900,000 SNPs) and phenotypic data on 9,873 African American and Hispanic American women. Results Simulation studies show improved sensitivity of our proposed method when compared to a naive approach that ignores error in the self-reported outcomes. Application of the proposed method resulted in discovery of several single nucleotide polymorphisms (SNPs) that are associated with risk of type 2 diabetes in a dataset of 9,873 African American and Hispanic participants in the Women's Health Initiative. There was little overlap among the top ranking SNPs associated with type 2 diabetes risk between the racial groups, adding support to previous observations in the literature of disease associated genetic loci that are often not generalizable across race/ethnicity populations. The adapted Bayesian variable selection algorithm is implemented in R. The source code for the simulations are available in the Supplement. Conclusions Variable selection accuracy is reduced when the outcome is ascertained by error-prone self-reports. For this setting, our proposed algorithm has improved variable selection performance when compared to approaches that neglect to account for the error-prone nature of self-reports. Keywords: Bayesian variable selection, Self-reports, High dimensional data
The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are cost-effective; however, they are subject to error. Diagnosis of silent events may also occur through the use of imperfect laboratory-based diagnostic tests. In this paper, we describe an approach for variable selection in high dimensional datasets for settings in which the outcome is observed with error.BACKGROUNDThe onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are cost-effective; however, they are subject to error. Diagnosis of silent events may also occur through the use of imperfect laboratory-based diagnostic tests. In this paper, we describe an approach for variable selection in high dimensional datasets for settings in which the outcome is observed with error.We adapt the spike and slab Bayesian Variable Selection approach in the context of error-prone, self-reported outcomes. The performance of the proposed approach is studied through simulation studies. An illustrative application is included using data from the Women's Health Initiative SNP Health Association Resource, which includes extensive genotypic (>900,000 SNPs) and phenotypic data on 9,873 African American and Hispanic American women.METHODSWe adapt the spike and slab Bayesian Variable Selection approach in the context of error-prone, self-reported outcomes. The performance of the proposed approach is studied through simulation studies. An illustrative application is included using data from the Women's Health Initiative SNP Health Association Resource, which includes extensive genotypic (>900,000 SNPs) and phenotypic data on 9,873 African American and Hispanic American women.Simulation studies show improved sensitivity of our proposed method when compared to a naive approach that ignores error in the self-reported outcomes. Application of the proposed method resulted in discovery of several single nucleotide polymorphisms (SNPs) that are associated with risk of type 2 diabetes in a dataset of 9,873 African American and Hispanic participants in the Women's Health Initiative. There was little overlap among the top ranking SNPs associated with type 2 diabetes risk between the racial groups, adding support to previous observations in the literature of disease associated genetic loci that are often not generalizable across race/ethnicity populations. The adapted Bayesian variable selection algorithm is implemented in R. The source code for the simulations are available in the Supplement.RESULTSSimulation studies show improved sensitivity of our proposed method when compared to a naive approach that ignores error in the self-reported outcomes. Application of the proposed method resulted in discovery of several single nucleotide polymorphisms (SNPs) that are associated with risk of type 2 diabetes in a dataset of 9,873 African American and Hispanic participants in the Women's Health Initiative. There was little overlap among the top ranking SNPs associated with type 2 diabetes risk between the racial groups, adding support to previous observations in the literature of disease associated genetic loci that are often not generalizable across race/ethnicity populations. The adapted Bayesian variable selection algorithm is implemented in R. The source code for the simulations are available in the Supplement.Variable selection accuracy is reduced when the outcome is ascertained by error-prone self-reports. For this setting, our proposed algorithm has improved variable selection performance when compared to approaches that neglect to account for the error-prone nature of self-reports.CONCLUSIONSVariable selection accuracy is reduced when the outcome is ascertained by error-prone self-reports. For this setting, our proposed algorithm has improved variable selection performance when compared to approaches that neglect to account for the error-prone nature of self-reports.
The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are cost-effective; however, they are subject to error. Diagnosis of silent events may also occur through the use of imperfect laboratory-based diagnostic tests. In this paper, we describe an approach for variable selection in high dimensional datasets for settings in which the outcome is observed with error. We adapt the spike and slab Bayesian Variable Selection approach in the context of error-prone, self-reported outcomes. The performance of the proposed approach is studied through simulation studies. An illustrative application is included using data from the Women's Health Initiative SNP Health Association Resource, which includes extensive genotypic (>900,000 SNPs) and phenotypic data on 9,873 African American and Hispanic American women. Simulation studies show improved sensitivity of our proposed method when compared to a naive approach that ignores error in the self-reported outcomes. Application of the proposed method resulted in discovery of several single nucleotide polymorphisms (SNPs) that are associated with risk of type 2 diabetes in a dataset of 9,873 African American and Hispanic participants in the Women's Health Initiative. There was little overlap among the top ranking SNPs associated with type 2 diabetes risk between the racial groups, adding support to previous observations in the literature of disease associated genetic loci that are often not generalizable across race/ethnicity populations. The adapted Bayesian variable selection algorithm is implemented in R. The source code for the simulations are available in the Supplement. Variable selection accuracy is reduced when the outcome is ascertained by error-prone self-reports. For this setting, our proposed algorithm has improved variable selection performance when compared to approaches that neglect to account for the error-prone nature of self-reports.
Background The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are cost-effective; however, they are subject to error. Diagnosis of silent events may also occur through the use of imperfect laboratory-based diagnostic tests. In this paper, we describe an approach for variable selection in high dimensional datasets for settings in which the outcome is observed with error. Methods We adapt the spike and slab Bayesian Variable Selection approach in the context of error-prone, self-reported outcomes. The performance of the proposed approach is studied through simulation studies. An illustrative application is included using data from the Women’s Health Initiative SNP Health Association Resource, which includes extensive genotypic (>900,000 SNPs) and phenotypic data on 9,873 African American and Hispanic American women. Results Simulation studies show improved sensitivity of our proposed method when compared to a naive approach that ignores error in the self-reported outcomes. Application of the proposed method resulted in discovery of several single nucleotide polymorphisms (SNPs) that are associated with risk of type 2 diabetes in a dataset of 9,873 African American and Hispanic participants in the Women’s Health Initiative. There was little overlap among the top ranking SNPs associated with type 2 diabetes risk between the racial groups, adding support to previous observations in the literature of disease associated genetic loci that are often not generalizable across race/ethnicity populations. The adapted Bayesian variable selection algorithm is implemented in R. The source code for the simulations are available in the Supplement. Conclusions Variable selection accuracy is reduced when the outcome is ascertained by error-prone self-reports. For this setting, our proposed algorithm has improved variable selection performance when compared to approaches that neglect to account for the error-prone nature of self-reports.
ArticleNumber 212
Audience Academic
Author Tadesse, Mahlet G
Ma, Yunsheng
Gu, Xiangdong
Foulkes, Andrea S
Balasubramanian, Raji
Author_xml – sequence: 1
  givenname: Xiangdong
  surname: Gu
  fullname: Gu, Xiangdong
– sequence: 2
  givenname: Mahlet G
  surname: Tadesse
  fullname: Tadesse, Mahlet G
– sequence: 3
  givenname: Andrea S
  surname: Foulkes
  fullname: Foulkes, Andrea S
– sequence: 4
  givenname: Yunsheng
  surname: Ma
  fullname: Ma, Yunsheng
– sequence: 5
  givenname: Raji
  surname: Balasubramanian
  fullname: Balasubramanian, Raji
BackLink https://www.ncbi.nlm.nih.gov/pubmed/32894123$$D View this record in MEDLINE/PubMed
BookMark eNp9kktv1DAURiNURB_wB1igSGzYpPhtZ4NUKh6VKrEBttaNfTPjURIPdqZV_z2emRY6FUJe2Lo-91jX-k6roylOWFWvKTmn1Kj3mbKW0oYw0hDKGG9un1UnVGjWqFboo0fn4-o05xUhVBsuX1THnJlWUMZPqp8f4Q5zgKm-gRSgG7DOOKCbQ5zqPqZ6GRbL2ocRp1xKMNTrhD64OaZcw-S3dN8kXMc0o6_jZnZxxPyyet7DkPHV_X5W_fj86fvl1-b625ery4vrxknF58bznhAptFZKOXQ9cA2mBUJES32neiYZlcIbQR3zfUcASec4OIqibY0m_Ky62nt9hJVdpzBCurMRgt0VYlpYSHNwA1pgRHvkXHbOiU7LTkokFJ1rTacRaHF92LvWm25E73CaEwwH0sObKSztIt5YLYyWrSyCd_eCFH9tMM92DNnhMMCEcZMtE4IoJajRBX37BF3FTSrfu6MKpgWVf6kFlAHC1MfyrttK7YXissiUaQt1_g-qLI9jcCUyfSj1g4Y3jwf9M-FDKgpg9oBLMeeEvXVhhm0kijkMlhK7DaDdB9CWANpdAO1taWVPWh_s_2n6DZxy3Ys
CitedBy_id crossref_primary_10_1051_itmconf_20257601001
Cites_doi 10.2307/2534008
10.1080/01621459.1993.10476353
10.1214/11-AOAS463
10.1158/1055-9965.EPI-11-0524
10.1016/S0197-2456(97)00078-0
10.1080/01621459.2013.869223
10.1177/1740774508091749
10.1093/biomet/86.4.843
10.1214/13-BA846
10.1198/016214504000001565
10.1371/journal.pcbi.1002822
10.1214/15-AOAS810
10.1093/biomet/93.4.877
10.1214/08-AOAS169
10.1111/j.2517-6161.1976.tb01597.x
10.1007/s11033-013-2635-y
10.1111/j.0006-341X.2001.01048.x
10.1111/j.0006-341X.2003.00109.x
10.1214/11-AOAS455
10.2307/2530698
10.1198/016214504000000566
10.1111/j.0006-341X.2004.00233.x
10.1214/09-BA403
10.1201/b14832
10.1198/jasa.2010.tm08177
10.1093/bioinformatics/btl362
10.1111/jdi.12805
10.1007/s10985-010-9154-0
10.1038/jhg.2013.21
10.1080/01621459.1988.10478694
10.1093/biomet/90.1.171
10.1016/S0197-2456(00)00053-2
10.1093/bioinformatics/19.1.90
10.1371/journal.pone.0058655
10.4093/dmj.2015.39.3.188
10.1198/016214507000000554
10.1111/biom.12620
10.2337/db18-513-P
ContentType Journal Article
Copyright COPYRIGHT 2020 BioMed Central Ltd.
2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
The Author(s) 2020
Copyright_xml – notice: COPYRIGHT 2020 BioMed Central Ltd.
– notice: 2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: The Author(s) 2020
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
3V.
7QO
7SC
7X7
7XB
88C
88E
8AL
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABUWG
AFKRA
ARAPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
JQ2
K7-
K9.
L7M
LK8
L~C
L~D
M0N
M0S
M0T
M1P
M7P
P5Z
P62
P64
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
Q9U
7X8
5PM
DOA
DOI 10.1186/s12911-020-01223-w
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
ProQuest Central (Corporate)
Biotechnology Research Abstracts
Computer and Information Systems Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Healthcare Administration Database (Alumni)
Medical Database (Alumni Edition)
Computing Database (Alumni Edition)
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Journals
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Technology Collection
Natural Science Collection
ProQuest One Community College
ProQuest Central Korea
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
ProQuest SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
ProQuest Health & Medical Complete (Alumni)
Advanced Technologies Database with Aerospace
Biological Sciences
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
ProQuest Health & Medical Collection
Healthcare Administration Database
Medical Database
Biological Science Database
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database (Proquest)
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest Central Basic
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
Computer Science Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
SciTech Premium Collection
ProQuest Central China
ProQuest One Applied & Life Sciences
Health Research Premium Collection
Natural Science Collection
Health & Medical Research Collection
Biological Science Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
Advanced Technologies & Aerospace Collection
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
ProQuest Health Management (Alumni Edition)
Engineering Research Database
ProQuest One Academic
ProQuest One Academic (New)
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Central
ProQuest Health & Medical Research Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Advanced Technologies Database with Aerospace
ProQuest Computing
ProQuest Central Basic
ProQuest Computing (Alumni Edition)
ProQuest Health Management
ProQuest SciTech Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest Medical Library
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList


MEDLINE - Academic
MEDLINE
Publicly Available Content Database
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 4
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Women's Studies
EISSN 1472-6947
EndPage 11
ExternalDocumentID oai_doaj_org_article_a207de335bcc4b75b55e01ecc98b7ea1
PMC7487595
A635066689
32894123
10_1186_s12911_020_01223_w
Genre Journal Article
Research Support, N.I.H., Extramural
GrantInformation_xml – fundername: NHLBI NIH HHS
  grantid: HHSN268201100046C
– fundername: WHI NIH HHS
  grantid: HHSN268201100001C
– fundername: NIA NIH HHS
  grantid: HHSN271201100004C
– fundername: WHI NIH HHS
  grantid: HHSN268201100003C
– fundername: WHI NIH HHS
  grantid: HHSN268201100002C
– fundername: WHI NIH HHS
  grantid: HHSN268201100004C
GroupedDBID ---
0R~
23N
2WC
53G
5VS
6J9
6PF
7X7
88E
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AAKPC
AASML
AAWTL
AAYXX
ABDBF
ABUWG
ACGFO
ACGFS
ACIWK
ACPRK
ACUHS
ADBBV
ADUKV
AENEX
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALIPV
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
AQUVI
ARAPS
AZQEC
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CITATION
CS3
DIK
DU5
DWQXO
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
HYE
IAO
IHR
INH
INR
ITC
K6V
K7-
KQ8
LK8
M0T
M1P
M48
M7P
M~E
O5R
O5S
OK1
OVT
P2P
P62
PGMZT
PHGZM
PHGZT
PIMPY
PQQKQ
PROAC
PSQYO
RBZ
RNS
ROL
RPM
RSV
SMD
SOJ
SV3
TR2
TUS
UKHRP
W2D
WOQ
WOW
XSB
-A0
3V.
ACRMQ
ADINQ
C24
CGR
CUY
CVF
ECM
EIF
M0N
NPM
PMFND
7QO
7SC
7XB
8AL
8FD
8FK
FR3
JQ2
K9.
L7M
L~C
L~D
P64
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQUKI
PRINS
Q9U
7X8
5PM
PUEGO
ID FETCH-LOGICAL-c563t-d3f005477666cecfa37a89a00491db6f252154d841c2dfb0ae0bc3ac1e4998703
IEDL.DBID DOA
ISSN 1472-6947
IngestDate Wed Aug 27 01:31:18 EDT 2025
Thu Aug 21 18:04:30 EDT 2025
Fri Jul 11 16:05:52 EDT 2025
Fri Jul 25 10:43:52 EDT 2025
Tue Jun 17 21:19:52 EDT 2025
Tue Jun 10 20:44:19 EDT 2025
Thu Jan 02 22:57:49 EST 2025
Tue Jul 01 04:05:51 EDT 2025
Thu Apr 24 23:03:30 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Self-reports
Bayesian variable selection
High dimensional data
Language English
License Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c563t-d3f005477666cecfa37a89a00491db6f252154d841c2dfb0ae0bc3ac1e4998703
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
OpenAccessLink https://doaj.org/article/a207de335bcc4b75b55e01ecc98b7ea1
PMID 32894123
PQID 2444067415
PQPubID 42572
PageCount 11
ParticipantIDs doaj_primary_oai_doaj_org_article_a207de335bcc4b75b55e01ecc98b7ea1
pubmedcentral_primary_oai_pubmedcentral_nih_gov_7487595
proquest_miscellaneous_2440664187
proquest_journals_2444067415
gale_infotracmisc_A635066689
gale_infotracacademiconefile_A635066689
pubmed_primary_32894123
crossref_citationtrail_10_1186_s12911_020_01223_w
crossref_primary_10_1186_s12911_020_01223_w
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2020-09-07
PublicationDateYYYYMMDD 2020-09-07
PublicationDate_xml – month: 09
  year: 2020
  text: 2020-09-07
  day: 07
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
– name: London
PublicationTitle BMC medical informatics and decision making
PublicationTitleAlternate BMC Med Inform Decis Mak
PublicationYear 2020
Publisher BioMed Central Ltd
BioMed Central
BMC
Publisher_xml – name: BioMed Central Ltd
– name: BioMed Central
– name: BMC
References W Bush (1223_CR33) 2012; 8
S Dixit (1223_CR35) 2013; 8
K Lee (1223_CR15) 2003; 19
1223_CR40
G Anderson (1223_CR1) 1998; 19
J Neuhaus (1223_CR11) 1999; 86
N Jeoung (1223_CR36) 2015; 39
R Balasubramanian (1223_CR5) 2003; 90
S Snapinn (1223_CR8) 1998; 54
S Kim (1223_CR20) 2006; 93
D Dunson (1223_CR21) 2008; 103
T Mitchell (1223_CR13) 1988; 83
R Balasubramanian (1223_CR4) 2001; 57
E George (1223_CR14) 1993; 88
Y Chang (1223_CR41) 2018; 9
1223_CR29
1223_CR34
V Rockova (1223_CR26) 2014; 109
V Rockova (1223_CR25) 2014; 9
K McKeown (1223_CR6) 2010; 16
1223_CR32
S Habib (1223_CR39) 2018; Supplement 1
T Cook (1223_CR10) 2004; 99
M Tadesse (1223_CR19) 2005; 100
R Jacobs (1223_CR27) 2017; 28
K Margolis (1223_CR30) 2008; 5
D Finkelstein (1223_CR3) 1986; 42
N Sha (1223_CR17) 2006; 22
X Gu (1223_CR12) 2015; 9
Y Guan (1223_CR18) 2011; 5
S Chen (1223_CR28) 2017; 73
F Stingo (1223_CR23) 2011; 5
C Collares (1223_CR42) 2013; 40
C Hutter (1223_CR43) 2011; 20
B Turnbull (1223_CR2) 1976; 38
S Hasstedt (1223_CR38) 2013; 58
F Li (1223_CR22) 2010; 105
R O’Hara (1223_CR24) 2009; 4
T Cook (1223_CR9) 2000; 21
H Ishwaran (1223_CR31) 2008; 2
A Meier (1223_CR7) 2003; 59
N Sha (1223_CR16) 2004; 60
1223_CR37
References_xml – volume: 54
  start-page: 209
  year: 1998
  ident: 1223_CR8
  publication-title: Biometrics
  doi: 10.2307/2534008
– volume: 88
  start-page: 881
  issue: 423
  year: 1993
  ident: 1223_CR14
  publication-title: J Am Stat Assoc
  doi: 10.1080/01621459.1993.10476353
– volume: 5
  start-page: 1978
  issue: 3
  year: 2011
  ident: 1223_CR23
  publication-title: Ann Appl Stat
  doi: 10.1214/11-AOAS463
– volume: 20
  start-page: 1950
  year: 2011
  ident: 1223_CR43
  publication-title: Cancer Epidemiol Biomarkers Prev
  doi: 10.1158/1055-9965.EPI-11-0524
– ident: 1223_CR32
– volume: 19
  start-page: 61
  issue: 1
  year: 1998
  ident: 1223_CR1
  publication-title: Control Clin Trials
  doi: 10.1016/S0197-2456(97)00078-0
– volume: 109
  start-page: 828
  year: 2014
  ident: 1223_CR26
  publication-title: J Am Stat Assoc
  doi: 10.1080/01621459.2013.869223
– volume: 5
  start-page: 240
  year: 2008
  ident: 1223_CR30
  publication-title: Clin Trials
  doi: 10.1177/1740774508091749
– volume: 86
  start-page: 843
  year: 1999
  ident: 1223_CR11
  publication-title: Biometrika
  doi: 10.1093/biomet/86.4.843
– volume: 9
  start-page: 221
  year: 2014
  ident: 1223_CR25
  publication-title: Bayesian Anal
  doi: 10.1214/13-BA846
– volume: 100
  start-page: 602
  issue: 470
  year: 2005
  ident: 1223_CR19
  publication-title: J Am Stat Assoc
  doi: 10.1198/016214504000001565
– volume: 8
  start-page: e1X002822
  year: 2012
  ident: 1223_CR33
  publication-title: PLoS Comput Biol
  doi: 10.1371/journal.pcbi.1002822
– volume: 9
  start-page: 714
  issue: 2
  year: 2015
  ident: 1223_CR12
  publication-title: Ann Appl Stat
  doi: 10.1214/15-AOAS810
– volume: 93
  start-page: 877
  issue: 4
  year: 2006
  ident: 1223_CR20
  publication-title: Biometrika
  doi: 10.1093/biomet/93.4.877
– volume: 2
  start-page: 841
  issue: 3
  year: 2008
  ident: 1223_CR31
  publication-title: Ann Appl Stat
  doi: 10.1214/08-AOAS169
– ident: 1223_CR40
– volume: 38
  start-page: 290
  year: 1976
  ident: 1223_CR2
  publication-title: J R Stat Soc Ser B Methodol
  doi: 10.1111/j.2517-6161.1976.tb01597.x
– volume: 40
  start-page: 5351
  year: 2013
  ident: 1223_CR42
  publication-title: Mol Biol Rep
  doi: 10.1007/s11033-013-2635-y
– volume: 57
  start-page: 1048
  year: 2001
  ident: 1223_CR4
  publication-title: Biometrics
  doi: 10.1111/j.0006-341X.2001.01048.x
– volume: 59
  start-page: 947
  year: 2003
  ident: 1223_CR7
  publication-title: Biometrics
  doi: 10.1111/j.0006-341X.2003.00109.x
– volume: 5
  start-page: 1780
  issue: 3
  year: 2011
  ident: 1223_CR18
  publication-title: Ann Appl Stat
  doi: 10.1214/11-AOAS455
– volume: 42
  start-page: 845
  year: 1986
  ident: 1223_CR3
  publication-title: Biometrics
  doi: 10.2307/2530698
– volume: 99
  start-page: 1140
  year: 2004
  ident: 1223_CR10
  publication-title: J Am Stat Assoc
  doi: 10.1198/016214504000000566
– volume: 60
  start-page: 812
  issue: 3
  year: 2004
  ident: 1223_CR16
  publication-title: Biometrics
  doi: 10.1111/j.0006-341X.2004.00233.x
– ident: 1223_CR34
– volume: 4
  start-page: 85
  issue: 1
  year: 2009
  ident: 1223_CR24
  publication-title: Bayesian Anal
  doi: 10.1214/09-BA403
– volume: 28
  start-page: 1
  issue: 4
  year: 2017
  ident: 1223_CR27
  publication-title: Stat Methods Med Res
– ident: 1223_CR29
  doi: 10.1201/b14832
– volume: 105
  start-page: 1202
  issue: 491
  year: 2010
  ident: 1223_CR22
  publication-title: J Am Stat Assoc
  doi: 10.1198/jasa.2010.tm08177
– volume: 22
  start-page: 2262
  issue: 18
  year: 2006
  ident: 1223_CR17
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btl362
– volume: 9
  start-page: 1067
  issue: 5
  year: 2018
  ident: 1223_CR41
  publication-title: J Diabetes Inv
  doi: 10.1111/jdi.12805
– volume: 16
  start-page: 215
  year: 2010
  ident: 1223_CR6
  publication-title: Lifetime Data Anal
  doi: 10.1007/s10985-010-9154-0
– volume: 58
  start-page: 378
  issue: 6
  year: 2013
  ident: 1223_CR38
  publication-title: J Hum Genet
  doi: 10.1038/jhg.2013.21
– volume: 83
  start-page: 1023
  issue: 404
  year: 1988
  ident: 1223_CR13
  publication-title: J Am Stat Assoc
  doi: 10.1080/01621459.1988.10478694
– volume: 90
  start-page: 171
  year: 2003
  ident: 1223_CR5
  publication-title: Biometrika
  doi: 10.1093/biomet/90.1.171
– volume: 21
  start-page: 208
  year: 2000
  ident: 1223_CR9
  publication-title: Control Clin Trials
  doi: 10.1016/S0197-2456(00)00053-2
– volume: 19
  start-page: 90
  issue: 1
  year: 2003
  ident: 1223_CR15
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/19.1.90
– volume: 8
  start-page: e58655
  issue: 3
  year: 2013
  ident: 1223_CR35
  publication-title: Plos ONE
  doi: 10.1371/journal.pone.0058655
– volume: 39
  start-page: 188
  year: 2015
  ident: 1223_CR36
  publication-title: Diabetes Metab J
  doi: 10.4093/dmj.2015.39.3.188
– ident: 1223_CR37
– volume: 103
  start-page: 534
  issue: 482
  year: 2008
  ident: 1223_CR21
  publication-title: J Am Stat Assoc
  doi: 10.1198/016214507000000554
– volume: 73
  start-page: 603
  year: 2017
  ident: 1223_CR28
  publication-title: Biometrics
  doi: 10.1111/biom.12620
– volume: Supplement 1
  start-page: 513
  year: 2018
  ident: 1223_CR39
  publication-title: Diabetes
  doi: 10.2337/db18-513-P
SSID ssj0017835
Score 2.2312703
Snippet The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes are...
Background The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported outcomes...
Abstract Background The onset of silent diseases such as type 2 diabetes is often registered through self-report in large prospective cohorts. Self-reported...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
StartPage 212
SubjectTerms African Americans
Algorithms
Bayes Theorem
Bayesian analysis
Bayesian variable selection
Clinical trials
Computer simulation
Datasets
Diabetes
Diabetes mellitus
Diabetes mellitus (non-insulin dependent)
Diabetes Mellitus, Type 2 - diagnosis
Diabetes Mellitus, Type 2 - epidemiology
Diabetes Mellitus, Type 2 - genetics
Diagnostic systems
Diagnostic tests
Feature selection
Female
Generalized linear models
Health informatics
Health risks
High dimensional data
Hispanic American women
Hispanic Americans
Human papillomavirus
Humans
Laboratories
Medical errors
Minority & ethnic groups
Nucleotides
Patient Reported Outcome Measures
Pediatrics
Polymorphism, Single Nucleotide
Prospective Studies
Random variables
Self Report
Self-reports
Single nucleotide polymorphisms
Single-nucleotide polymorphism
Source code
Type 2 diabetes
Women's studies
Womens health
SummonAdditionalLinks – databaseName: Health & Medical Collection
  dbid: 7X7
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfR3bitQwNOgK4ot4t7pKBMEHCds0SdM-ya64LML65Mq8hdyqwtKO0xkX_95z0kzdIuzr5GRoziXnknMh5G1TOmHrjjOca8tk1TnmpLZMtrKubJA8RoxDnn-pzy7k55Va5YDbmNMq93diuqjD4DFGfgRqCHQP6r8P618Mp0bh62oeoXGb3MHWZZjSpVezw8UxqrEvlGnqoxF0GwYEK0zFArXIrhbKKPXs__9mvqaalmmT1_TQ6QNyPxuQ9Hii-ENyK_aPyN3z_ET-mHw7sX8iVkbS3-AGY2EUHdOsGyAABQuVYoNiGrCp_9SQg643uBmn7lDbB4Tu2PSUEAMddlvATxyfkIvTT18_nrE8PIF5VYstC6JDc0xr8E989J0V2jatRY-AY-ldBXpbydBI7qvQudLG0nlhPY_gA4EQi6fkoB_6-JxQx11tcU8HxIPb07VB6coqB7ZU5WtXEL7HovG5szgOuLg0ycNoajNh3gDmTcK8uSrI-3nPeuqrcSP0CRJnhsSe2OmHYfPdZBEztip1iEIo5710WjmlYsmBRdvG6Wh5Qd4haQ1KLnyet7kAAQ6JPbDMMdhe6M01bUEOF5AgcX65vGcOkyV-NP_4syBv5mXciVlsfRx2CQb-QfJGF-TZxEvzkQR4vhLMiILoBZctzrxc6X_-SP3ANTqdrXpx82e9JPeqxPpYtHZIDrabXXwFBtXWvU5S8xeTyB_E
  priority: 102
  providerName: ProQuest
– databaseName: Scholars Portal Journals: Open Access
  dbid: M48
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3Nb9UwDLfGkBAXxDcdAwUJiQMKNG3StAeENsQ0IT1OPLRblK8C0tQ33gfb_nvstH2sYuLEtbGrxrHrn5PYBnhZ5660VSs49bXlsmgdd1JbLhtZFTZIESPtQ84-V8dz-elEnezA2O5oEODq2tCO-knNl6dvLn5evkeDf5cMvq7ertBn0UZfQVes0N3x8xtwEz2TJkOdyT-nCrTLMSbOXMs3cU6phv_ff-orrmp6jfKKXzq6C3cGQMkOeg24Bzuxuw-3ZsOR-QP4emgvI2VKsl8YFlOiFFul3je4IAwRK6OCxSxQkf--QAc7WxIzdeFhtgtE3fL-aCEGttisUUfj6iHMjz5--XDMh2YK3KuqXPNQtgTPtMZ4xUff2lLburEUIQhKxSvQjysZail8EVqX25g7X1ovIsZEaNTlI9jtFl18AswJV1niaXEx8W_qmqB0YZVDbFX4ymUgRikaP1Qap4YXpyZFHHVleskblLxJkjfnGbze8pz1dTb-SX1Ii7OlpBrZ6cFi-c0MJmdskesQy1I576XTyikVc4Eq29RORysyeEVLa0i38PO8HRIScJJUE8scIBaj6K5uMtifUKIF-unwqBxmVGCDsAmxEuG1DF5sh4mTbrV1cbFJNPgGKWqdweNel7ZTKjESlggrMtATLZvMeTrS_fie6oNrCkIbtfc_hPQUbhfJQCjVbR9218tNfIYwbO2eJ9v6DXGsMLk
  priority: 102
  providerName: Scholars Portal
Title Bayesian variable selection for high dimensional predictors and self-reported outcomes
URI https://www.ncbi.nlm.nih.gov/pubmed/32894123
https://www.proquest.com/docview/2444067415
https://www.proquest.com/docview/2440664187
https://pubmed.ncbi.nlm.nih.gov/PMC7487595
https://doaj.org/article/a207de335bcc4b75b55e01ecc98b7ea1
Volume 20
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3di9QwEB_0BPFF_LbnuUQQfJBwbZM06eOt3HoIe8jhyeJLSNIUBeket7se9987k3aXLYK--JKHZrLbTGY6v0kyMwBvTe6Fq9qCU11bLsvWcy-147KWVekaWcRI-5Dz8-rsUn5aqMVeqS-6E9anB-4Zd-zKXDdRCOVDkF4rr1TMC_zj2ngdXXJ80OZtnanh_ID2M7YhMqY6XqFVo63Aki5hoUHkNyMzlLL1__lN3jNK4wuTexZo9ggeDtCRnfSv_BjuxO4J3J8Ph-NP4evU3UaKiWS_0AGmkCi2SlVukPUMsSmj1MSsoXT-fSoOdnVNg6neDnNdQ9Qt7w8RYsOWmzVKY1w9g8vZ6ZcPZ3wom8CDqsSaN6IlIKY1eiYhhtYJ7UztyBcoKOiuRIutZGNkEcqm9bmLuQ_ChSKi94PqK57DQbfs4ktgvvCVozEtLht-N33dKF065RFFlaHyGRRbLtow5BSn0hY_bfItTGV7zlvkvE2ctzcZvN-NueozavyVekqLs6OkbNjpAcqIHWTE_ktGMnhHS2tJZ_H1ghtCD3CSlP3KniDqIj_O1BkcjShR18K4eyscdtD1lUWAhKiIkFkGb3bdNJLur3VxuUk0-AuyMDqDF70s7aYk0OeVCCAy0CMpG8153NP9-J4ygWtyN2t1-D-Y9AoelElBKKjtCA7W15v4GgHX2k_grl5obM3s4wTuTU_PP19Mkr5hO5cG24vpt98_0S67
linkProvider Directory of Open Access Journals
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwEB6VIgEXxJtAASOBOKCosePEyQGhFqi2tNtTi_ZmbMcBJJQsm11W_VP8RmbyWBoh9dZrPLZie2a-GdszA_Aqi2xs0pKHVNc2lKK0oZXKhDKXqTCF5N7TOeT0JJ2cyc-zZLYFf4ZYGHpWOejEVlEXtaMz8l2EIcQewr_3818hVY2i29WhhEbHFkf-fI0uW_Pu8CPu72shDj6dfpiEfVWB0CVpvAyLuCQ7RSk03J13pYmVyXJDpjKnmDSBgJbIIpPciaK0kfGRdbFx3KNzgNwd47jX4DoCb0QSpWYbB4_TKcoQmJOluw1iKR1ACnr6hTAcrkfg19YI-B8JLkDh-JnmBdw7uAO3e4OV7XUcdhe2fHUPbkz7K_n78GXfnHuKxGS_0e2mQCzWtLV1cMMZWsSMEiKzgooIdAlA2HxBnanKDzNVQdRl2F1d-ILVqyXuh28ewNmVLOtD2K7qyj8GZrlNDfUpkVlQW9u8SJQwiUXbTbjUBsCHVdSuz2ROBTV-6tajyVLdrbzGldftyut1AG83feZdHo9LqfdpczaUlIO7_VAvvulepLURkSp8HCfWOWlVYpPERxxFIs-s8oYH8Ia2VpOmwN9zpg94wElSzi29h7YeeY9ZHsDOiBIl3I2bB-bQvYZp9D95CODlppl60qu5yterlgZHkDxTATzqeGkzpRg9bYlmSwBqxGWjOY9bqh_f2_zjipzcPHly-W-9gJuT0-mxPj48OXoKt0QrBhQwtwPby8XKP0NjbmmftxLE4OtVi-xfJaVcSw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Bayesian+variable+selection+for+high+dimensional+predictors+and+self-reported+outcomes&rft.jtitle=BMC+medical+informatics+and+decision+making&rft.au=Xiangdong+Gu&rft.au=Mahlet+G+Tadesse&rft.au=Andrea+S+Foulkes&rft.au=Yunsheng+Ma&rft.date=2020-09-07&rft.pub=BMC&rft.eissn=1472-6947&rft.volume=20&rft.issue=1&rft.spage=1&rft.epage=11&rft_id=info:doi/10.1186%2Fs12911-020-01223-w&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_a207de335bcc4b75b55e01ecc98b7ea1
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1472-6947&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1472-6947&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1472-6947&client=summon