An efficient and accurate frailty model approach for genome-wide survival association analysis controlling for population structure and relatedness in large-scale biobanks

Abstract With decades of electronic health records linked to genetic data, large biobanks provide unprecedented opportunities for systematically understanding the genetics of the natural history of complex diseases. Genome-wide survival association analysis can identify genetic variants associated w...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors Dey, Rounak, Zhou, Wei, Kiiskinen, Tuomo, Havulinna, Aki, Elliott, Amanda, Karjalainen, Juha, Kurki, Mitja, Qin, Ashley, Finngen, Lee, Seunggeun, Palotie, Aarno, Neale, Benjamin, Daly, Mark, Lin, Xihong
Format Paper
LanguageEnglish
Published Cold Spring Harbor Cold Spring Harbor Laboratory Press 01.11.2020
Cold Spring Harbor Laboratory
Edition1.1
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Abstract With decades of electronic health records linked to genetic data, large biobanks provide unprecedented opportunities for systematically understanding the genetics of the natural history of complex diseases. Genome-wide survival association analysis can identify genetic variants associated with ages of onset, disease progression and lifespan. We developed an efficient and accurate frailty (random effects) model approach for genome-wide survival association analysis of censored time-to-event (TTE) phenotypes in large biobanks by accounting for both population structure and relatedness. Our method utilizes state-of-the-art optimization strategies to reduce the computational cost. The saddlepoint approximation is used to allow for analysis of heavily censored phenotypes (>90%) and low frequency variants (down to minor allele count 20). We demonstrated the performance of our method through extensive simulation studies and analysis of five TTE phenotypes, including lifespan, with heavy censoring rates (90.9% to 99.8%) on ~400,000 UK Biobank participants with white British ancestry and ~180,000 samples in FinnGen, respectively. We further performed genome-wide association analysis for 871 TTE phenotypes in UK Biobank and presented the genome-wide scale phenome-wide association (PheWAS) results with the PheWeb browser. Competing Interest Statement B.M.N. is on the scientific advisory board of Deep Genomics, and is a consultant for CAMP4 Therapeutics, Takeda and Biogen. X.L. is a consultant to AbbVie Pharmaceuticals and Verily Life Sciences. M.J.D. is a founder of Maze Therapeutics and on the scientific advisory board of BC Platforms.
AbstractList Abstract With decades of electronic health records linked to genetic data, large biobanks provide unprecedented opportunities for systematically understanding the genetics of the natural history of complex diseases. Genome-wide survival association analysis can identify genetic variants associated with ages of onset, disease progression and lifespan. We developed an efficient and accurate frailty (random effects) model approach for genome-wide survival association analysis of censored time-to-event (TTE) phenotypes in large biobanks by accounting for both population structure and relatedness. Our method utilizes state-of-the-art optimization strategies to reduce the computational cost. The saddlepoint approximation is used to allow for analysis of heavily censored phenotypes (>90%) and low frequency variants (down to minor allele count 20). We demonstrated the performance of our method through extensive simulation studies and analysis of five TTE phenotypes, including lifespan, with heavy censoring rates (90.9% to 99.8%) on ~400,000 UK Biobank participants with white British ancestry and ~180,000 samples in FinnGen, respectively. We further performed genome-wide association analysis for 871 TTE phenotypes in UK Biobank and presented the genome-wide scale phenome-wide association (PheWAS) results with the PheWeb browser. Competing Interest Statement B.M.N. is on the scientific advisory board of Deep Genomics, and is a consultant for CAMP4 Therapeutics, Takeda and Biogen. X.L. is a consultant to AbbVie Pharmaceuticals and Verily Life Sciences. M.J.D. is a founder of Maze Therapeutics and on the scientific advisory board of BC Platforms.
With decades of electronic health records linked to genetic data, large biobanks provide unprecedented opportunities for systematically understanding the genetics of the natural history of complex diseases. Genome-wide survival association analysis can identify genetic variants associated with ages of onset, disease progression and lifespan. We developed an efficient and accurate frailty (random effects) model approach for genome-wide survival association analysis of censored time-to-event (TTE) phenotypes in large biobanks by accounting for both population structure and relatedness. Our method utilizes state-of-the-art optimization strategies to reduce the computational cost. The saddlepoint approximation is used to allow for analysis of heavily censored phenotypes (>90%) and low frequency variants (down to minor allele count 20). We demonstrated the performance of our method through extensive simulation studies and analysis of five TTE phenotypes, including lifespan, with heavy censoring rates (90.9% to 99.8%) on ~400,000 UK Biobank participants with white British ancestry and ~180,000 samples in FinnGen, respectively. We further performed genome-wide association analysis for 871 TTE phenotypes in UK Biobank and presented the genome-wide scale phenome-wide association (PheWAS) results with the PheWeb browser.
Author Dey, Rounak
Kurki, Mitja
Kiiskinen, Tuomo
Elliott, Amanda
Zhou, Wei
Palotie, Aarno
Finngen
Neale, Benjamin
Havulinna, Aki
Daly, Mark
Karjalainen, Juha
Qin, Ashley
Lin, Xihong
Lee, Seunggeun
Author_xml – sequence: 1
  givenname: Rounak
  surname: Dey
  fullname: Dey, Rounak
– sequence: 2
  givenname: Wei
  surname: Zhou
  fullname: Zhou, Wei
– sequence: 3
  givenname: Tuomo
  surname: Kiiskinen
  fullname: Kiiskinen, Tuomo
– sequence: 4
  givenname: Aki
  surname: Havulinna
  fullname: Havulinna, Aki
– sequence: 5
  givenname: Amanda
  surname: Elliott
  fullname: Elliott, Amanda
– sequence: 6
  givenname: Juha
  surname: Karjalainen
  fullname: Karjalainen, Juha
– sequence: 7
  givenname: Mitja
  surname: Kurki
  fullname: Kurki, Mitja
– sequence: 8
  givenname: Ashley
  surname: Qin
  fullname: Qin, Ashley
– sequence: 9
  fullname: Finngen
– sequence: 10
  givenname: Seunggeun
  surname: Lee
  fullname: Lee, Seunggeun
– sequence: 11
  givenname: Aarno
  surname: Palotie
  fullname: Palotie, Aarno
– sequence: 12
  givenname: Benjamin
  surname: Neale
  fullname: Neale, Benjamin
– sequence: 13
  givenname: Mark
  surname: Daly
  fullname: Daly, Mark
– sequence: 14
  givenname: Xihong
  surname: Lin
  fullname: Lin, Xihong
BookMark eNpNkMtOHDEQRa2ISDzCB7CzlE02PfjVPd1LhAJBQmID61bZLg8mHrtjd0-Yb-In4zAsWNVV1am6qntKjmKKSMgFZyvOGb8UTFTFVpKvZNsLqb6QE9ENoukFa48-6WNyXsoLY0wMHZdrdULeriJF57zxGGcK0VIwZskwI3UZfJj3dJssBgrTlBOYZ-pSphuMaYvNX2-RliXv_A4qUUoyHmafYj0EYV98oSbFOacQfNy8b05pWsKBKXNezLxkfLfNWNtoI5ZCfaQB8gabYiAg1T5piL_LN_LVQSh4_lHPyNPNz8frX839w-3d9dV9ozlTqmn1oJUycj243rIWWyl70J21Dh1bgxCoB2aUNGBkNyitWi3rqCLMOmdRnpEfh7vVOL_63Thlv4W8H__nPHI2Sj4ecq7o9wNaw_mzYJnHl7Tk-nwZRcu6vmed6uQ_uGeE1A
ContentType Paper
Copyright 2020. This article is published under http://creativecommons.org/licenses/by/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
2020, Posted by Cold Spring Harbor Laboratory
Copyright_xml – notice: 2020. This article is published under http://creativecommons.org/licenses/by/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: 2020, Posted by Cold Spring Harbor Laboratory
DBID 8FE
8FH
ABUWG
AFKRA
AZQEC
BBNVY
BENPR
BHPHI
CCPQU
DWQXO
GNUQQ
HCIFZ
LK8
M7P
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
FX.
DOI 10.1101/2020.10.31.358234
DatabaseName ProQuest SciTech Collection
ProQuest Natural Science Collection
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
ProQuest Central Essentials - QC
Biological Science Collection
ProQuest Central
Natural Science Collection
ProQuest One Community College
ProQuest Central
ProQuest Central Student
SciTech Premium Collection
Biological Sciences
Biological Science Database
ProQuest Central Premium
ProQuest One Academic (New)
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
bioRxiv
DatabaseTitle Publicly Available Content Database
ProQuest Central Student
ProQuest One Academic Middle East (New)
ProQuest Biological Science Collection
ProQuest Central Essentials
ProQuest One Academic Eastern Edition
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Natural Science Collection
Biological Science Database
ProQuest SciTech Collection
ProQuest Central China
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest One Academic UKI Edition
Natural Science Collection
ProQuest Central Korea
Biological Science Collection
ProQuest Central (New)
ProQuest One Academic
ProQuest One Academic (New)
DatabaseTitleList Publicly Available Content Database

Database_xml – sequence: 1
  dbid: FX.
  name: bioRxiv
  url: https://www.biorxiv.org/
  sourceTypes: Open Access Repository
– sequence: 2
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 2692-8205
Edition 1.1
ExternalDocumentID 2020.10.31.358234v1
Genre Working Paper/Pre-Print
GeographicLocations United Kingdom--UK
GeographicLocations_xml – name: United Kingdom--UK
GroupedDBID 8FE
8FH
ABUWG
AFKRA
ALMA_UNASSIGNED_HOLDINGS
AZQEC
BBNVY
BENPR
BHPHI
CCPQU
DWQXO
GNUQQ
HCIFZ
LK8
M7P
NQS
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PROAC
RHI
FX.
ID FETCH-LOGICAL-b1044-5b9b44c379f8d05e5338ab6ddfef07a22eb90c43cac3694b45b3fefab60dffde3
IEDL.DBID FX.
ISSN 2692-8205
IngestDate Tue Jan 07 19:00:28 EST 2025
Fri Jul 25 09:17:48 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
License This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at http://creativecommons.org/licenses/by/4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-b1044-5b9b44c379f8d05e5338ab6ddfef07a22eb90c43cac3694b45b3fefab60dffde3
Notes SourceType-Working Papers-1
ObjectType-Working Paper/Pre-Print-1
content type line 50
Competing Interest Statement: B.M.N. is on the scientific advisory board of Deep Genomics, and is a consultant for CAMP4 Therapeutics, Takeda and Biogen. X.L. is a consultant to AbbVie Pharmaceuticals and Verily Life Sciences. M.J.D. is a founder of Maze Therapeutics and on the scientific advisory board of BC Platforms.
ORCID 0000-0002-0949-8752
0000-0002-6540-8280
0000-0001-7719-0859
0000-0003-1513-6077
0000-0002-4787-8959
0000-0002-8097-3878
OpenAccessLink https://www.biorxiv.org/content/10.1101/2020.10.31.358234
PQID 2506880646
PQPubID 2050091
PageCount 31
ParticipantIDs biorxiv_primary_2020_10_31_358234
proquest_journals_2506880646
PublicationCentury 2000
PublicationDate 20201101
PublicationDateYYYYMMDD 2020-11-01
PublicationDate_xml – month: 11
  year: 2020
  text: 20201101
  day: 01
PublicationDecade 2020
PublicationPlace Cold Spring Harbor
PublicationPlace_xml – name: Cold Spring Harbor
PublicationTitle bioRxiv
PublicationYear 2020
Publisher Cold Spring Harbor Laboratory Press
Cold Spring Harbor Laboratory
Publisher_xml – name: Cold Spring Harbor Laboratory Press
– name: Cold Spring Harbor Laboratory
References Staley (2020.10.31.358234v1.14) 2017; 25
Green, Symons (2020.10.31.358234v1.12) 1983; 36
McGilchrist (2020.10.31.358234v1.25) 1993; 49
Svishcheva, Axenovich, Belonogova, van Duijn, Aulchenko (2020.10.31.358234v1.17) 2012; 44
Kaplan, Meier (2020.10.31.358234v1.51) 1992
Barndorff-Nielsen (2020.10.31.358234v1.62) 1990; 52
Wu (2020.10.31.358234v1.59) 2011; 89
Gilmour, Thompson, Cullis (2020.10.31.358234v1.38) 1995; 51
Moreno-Grau (2020.10.31.358234v1.50) 2019; 15
Udler (2020.10.31.358234v1.47) 2010; 19
Bycroft (2020.10.31.358234v1.15) 2018; 562
Jiang (2020.10.31.358234v1.18) 2019; 51
Lee, Go (2020.10.31.358234v1.2) 1997; 18
Stone (2020.10.31.358234v1.48) 1997; 275
Kang (2020.10.31.358234v1.58) 2010; 42
Rovio (2020.10.31.358234v1.53) 2005; 4
Phipps (2020.10.31.358234v1.6) 2016; 37
Breslow, Clayton (2020.10.31.358234v1.37) 1993; 88
Zhou (2020.10.31.358234v1.19) 2018; 50
Smith, Nielson, Woodard, Seidenberg, Rao (2020.10.31.358234v1.55) 2013; 3
Dey (2020.10.31.358234v1.35) 2019; 43
Tsuruta, Misztal, Stranden (2020.10.31.358234v1.39) 2001; 79
Lee, Lim (2020.10.31.358234v1.10) 2019; 17
Ripatti, Palmgren (2020.10.31.358234v1.28) 2000; 56
Loh (2020.10.31.358234v1.16) 2015; 47
Klein (2020.10.31.358234v1.24) 1992; 48
Petersen, Andersen, Gill (2020.10.31.358234v1.26) 1996; 50
Nelson (2020.10.31.358234v1.44) 2017; 49
Kasza, Wraith, Lamb, Wolfe (2020.10.31.358234v1.4) 2014; 19
Daniels (2020.10.31.358234v1.36) 1954; 25
Li (2020.10.31.358234v1.56) 2020; 52
Breslow (2020.10.31.358234v1.61) 1972; 34
Dg, Bl De, Sb, Ka (2020.10.31.358234v1.3) 1995; 72
Therneau, Grambsch (2020.10.31.358234v1.60) 2000
Kuonen (2020.10.31.358234v1.63) 1999; 86
Chen (2020.10.31.358234v1.20) 2016; 98
Dey, Schmidt, Abecasis, Lee (2020.10.31.358234v1.34) 2017; 101
Bi, Fritsche, Mukherjee, Kim, Lee (2020.10.31.358234v1.11) 2020; 107
Deloukas (2020.10.31.358234v1.45) 2012; 45
Denny (2020.10.31.358234v1.40) 2013; 31
Yang, Zaitlen, Goddard, Visscher, Price (2020.10.31.358234v1.57) 2014; 46
Cox (2020.10.31.358234v1.1) 1972; 34
Wu (2020.10.31.358234v1.9) 2014; 63
Vaupel, Manton, Stallard (2020.10.31.358234v1.21) 1979; 16
Clayton, Cuzick (2020.10.31.358234v1.23) 1985; 148
Walter (2020.10.31.358234v1.42) 2015; 526
Therneau, Grambsch, Pankratz (2020.10.31.358234v1.29) 2003; 12
He (2020.10.31.358234v1.32) 2020
Ma, Blackwell, Boehnke, Scott, Go (2020.10.31.358234v1.33) 2013; 37
Therneau (2020.10.31.358234v1.30) 2019
Kulminski (2020.10.31.358234v1.8) 2016; 12
Korsgaard, Andersen (2020.10.31.358234v1.27) 1998; 25
Schuit, Feskens, Launer, Kromhout (2020.10.31.358234v1.54) 2001; 33
Hougaard (2020.10.31.358234v1.22) 1995; 1
Gagliano Taliun (2020.10.31.358234v1.43) 2020; 52
Meyer (2020.10.31.358234v1.46) 2013; 93
McCarthy (2020.10.31.358234v1.41) 2016; A3
Johnson (2020.10.31.358234v1.7) 2016; 7
Wolters (2020.10.31.358234v1.52) 2019; 14
He, Kulminski (2020.10.31.358234v1.31) 2020; 215
He (2020.10.31.358234v1.5) 2016; 6
Abecasis, Cherny, Cookson, Cardon (2020.10.31.358234v1.64) 2001; 30
Callas, Pastides, Hosmer (2020.10.31.358234v1.13) 1998; 33
Burdon (2020.10.31.358234v1.49) 2011; 43
References_xml – year: 2020
  ident: 2020.10.31.358234v1.32
  article-title: coxmeg: Cox Mixed-Effects Models for Genome-Wide Association Studies
– volume: 275
  start-page: 668
  year: 1997
  end-page: 670
  ident: 2020.10.31.358234v1.48
  article-title: Identification of a Gene That Causes Primary Open Angle Glaucoma
  publication-title: Science (American Association for the Advancement of Science)
– volume: 33
  start-page: 33
  year: 1998
  end-page: 47
  ident: 2020.10.31.358234v1.13
  article-title: Empirical comparisons of proportional hazards, Poisson, and logistic regression modeling of occupational cohort data
  publication-title: American Journal of Industrial Medicine
– volume: 45
  start-page: 25
  year: 2012
  end-page: 33
  ident: 2020.10.31.358234v1.45
  article-title: Large-scale association analysis identifies new risk loci for coronary artery disease
  publication-title: Nature genetics
– volume: 19
  start-page: 483
  year: 2014
  end-page: 492
  ident: 2020.10.31.358234v1.4
  publication-title: Survival analysis of time-to-event data in respiratory health research studies
– volume: 52
  start-page: 485
  year: 1990
  end-page: 496
  ident: 2020.10.31.358234v1.62
  article-title: Approximate Interval Probabilities
  publication-title: Journal of the Royal Statistical Society. Series B (Methodological)
– volume: 31
  start-page: 1102
  year: 2013
  end-page: 10
  ident: 2020.10.31.358234v1.40
  article-title: Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data
  publication-title: Nat Biotechnol
– volume: 42
  start-page: 348
  year: 2010
  end-page: 354
  ident: 2020.10.31.358234v1.58
  article-title: Variance component model to account for sample structure in genome-wide association studies
  publication-title: Nature genetics
– volume: 93
  start-page: 1046
  year: 2013
  end-page: 1060
  ident: 2020.10.31.358234v1.46
  article-title: Fine-Scale Mapping of the FGFR2 Breast Cancer Risk Locus: Putative Functional Variants Differentially Bind FOXA1 and E2F1
  publication-title: American journal of human genetics
– volume: 43
  start-page: 574
  year: 2011
  end-page: 578
  ident: 2020.10.31.358234v1.49
  article-title: Genome-wide association study identifies susceptibility loci for open angle glaucoma at TMCO1 and CDKN2B-AS1
  publication-title: Nature genetics
– volume: 34
  start-page: 216
  year: 1972
  end-page: 217
  ident: 2020.10.31.358234v1.61
  article-title: Discussion of the paper by D. R. Cox
  publication-title: Journal of the Royal Statistical Society. Series B (Methodological)
– volume: 101
  start-page: 37
  year: 2017
  end-page: 49
  ident: 2020.10.31.358234v1.34
  article-title: A Fast and Accurate Algorithm to Test for Binary Phenotypes and Its Application to PheWAS
  publication-title: Am J Hum Genet
– volume: 37
  start-page: 87
  year: 2016
  end-page: 95
  ident: 2020.10.31.358234v1.6
  article-title: Common genetic variation and survival after colorectal cancer diagnosis: a genome-wide analysis
  publication-title: Carcinogenesis
– volume: 49
  start-page: 221
  year: 1993
  end-page: 5
  ident: 2020.10.31.358234v1.25
  article-title: REML estimation for survival models with frailty
  publication-title: Biometrics
– volume: 49
  start-page: 1385
  year: 2017
  end-page: 1391
  ident: 2020.10.31.358234v1.44
  article-title: Association analyses based on false discovery rate implicate new loci for coronary artery disease
  publication-title: Nature Genetics
– volume: 50
  start-page: 193
  year: 1996
  end-page: 211
  ident: 2020.10.31.358234v1.26
  article-title: Variance components models for survival data
  publication-title: Statistica Neerlandica
– year: 2000
  ident: 2020.10.31.358234v1.60
  publication-title: Modeling Survival Data: Extending the Cox Model
– volume: 3
  start-page: 54
  year: 2013
  end-page: 83
  ident: 2020.10.31.358234v1.55
  article-title: Physical activity and brain function in older adults at increased risk for Alzheimer’s disease
  publication-title: Brain Sci
– volume: 86
  start-page: 929
  year: 1999
  end-page: 935
  ident: 2020.10.31.358234v1.63
  article-title: Saddlepoint Approximations for Distributions of Quadratic Forms in Normal Variables
  publication-title: Biometrika
– volume: 7
  year: 2016
  ident: 2020.10.31.358234v1.7
  article-title: Genome-wide association study identifies variation at 6q25.1 associated with survival in multiple myeloma
  publication-title: Nature Communications
– volume: 15
  start-page: 1333
  year: 2019
  end-page: 1347
  ident: 2020.10.31.358234v1.50
  article-title: Genome-wide association analysis of dementia and its clinical endophenotypes reveal novel loci associated with Alzheimer’s disease and three causality networks: The GR@ACE project
  publication-title: Alzheimers Dement
– volume: 16
  start-page: 439
  year: 1979
  end-page: 454
  ident: 2020.10.31.358234v1.21
  article-title: The impact of heterogeneity in individual frailty on the dynamics of mortality
  publication-title: Demography
– volume: 33
  start-page: 772
  year: 2001
  end-page: 7
  ident: 2020.10.31.358234v1.54
  article-title: Physical activity and cognitive decline, the role of the apolipoprotein e4 allele
  publication-title: Med Sci Sports Exerc
– volume: 50
  start-page: 1335
  year: 2018
  end-page: 1341
  ident: 2020.10.31.358234v1.19
  article-title: Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies
  publication-title: Nat Genet
– volume: 12
  start-page: e1006314
  year: 2016
  ident: 2020.10.31.358234v1.8
  article-title: Pleiotropic Associations of Allelic Variants in a 2q22 Region with Risks of Major Human Diseases and Mortality.(Research Article)(Report)
  publication-title: PLoS Genetics
– volume: 36
  start-page: 715
  year: 1983
  end-page: 723
  ident: 2020.10.31.358234v1.12
  article-title: A comparison of the logistic risk function and the proportional hazards model in prospective epidemiologic studies
  publication-title: Journal of Chronic Diseases
– volume: 88
  start-page: 9
  year: 1993
  end-page: 25
  ident: 2020.10.31.358234v1.37
  article-title: Approximate Inference in Generalized Linear Mixed Models
  publication-title: Journal of the American Statistical Association
– volume: 52
  start-page: 969
  year: 2020
  end-page: 983
  ident: 2020.10.31.358234v1.56
  article-title: Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale
  publication-title: Nature Genetics
– volume: 89
  start-page: 82
  year: 2011
  end-page: 93
  ident: 2020.10.31.358234v1.59
  article-title: Rare-Variant Association Testing for Sequencing Data with the Sequence Kernel Association Test
  publication-title: American journal of human genetics
– volume: 6
  start-page: n/a-n/a
  year: 2016
  ident: 2020.10.31.358234v1.5
  article-title: Genome-wide time-to-event analysis on smoking progression stages in a familybased study
  publication-title: Brain and Behavior
– volume: 52
  start-page: 550
  year: 2020
  end-page: 552
  ident: 2020.10.31.358234v1.43
  article-title: Exploring and visualizing large-scale genetic associations by using PheWeb
  publication-title: Nature Genetics
– volume: 79
  start-page: 1166
  year: 2001
  end-page: 72
  ident: 2020.10.31.358234v1.39
  article-title: Use of the preconditioned conjugate gradient algorithm as a generic solver for mixed-model equations in animal breeding applications
  publication-title: J Anim Sci
– volume: 562
  start-page: 203
  year: 2018
  end-page: 209
  ident: 2020.10.31.358234v1.15
  article-title: The UK Biobank resource with deep phenotyping and genomic data
  publication-title: Nature
– volume: A3
  start-page: 1279
  year: 2016
  end-page: 1283
  ident: 2020.10.31.358234v1.41
  article-title: A reference panel of 64,976 haplotypes for genotype imputation
  publication-title: Nature Genetics
– volume: 44
  start-page: 1166
  year: 2012
  end-page: 70
  ident: 2020.10.31.358234v1.17
  article-title: Rapid variance components-based method for whole-genome association analysis
  publication-title: Nat Genet
– volume: 4
  start-page: 705
  year: 2005
  end-page: 11
  ident: 2020.10.31.358234v1.53
  article-title: Leisure-time physical activity at midlife and the risk of dementia and Alzheimer’s disease
  publication-title: Lancet Neurol
– volume: 148
  start-page: 82
  year: 1985
  end-page: 108
  ident: 2020.10.31.358234v1.23
  article-title: Multivariate Generalizations of the Proportional Hazards Model
  publication-title: Journal of the Royal Statistical Society: Series A (General)
– year: 2019
  ident: 2020.10.31.358234v1.30
  article-title: coxme: Mixed Effects Cox Models
– volume: 72
  start-page: 511
  year: 1995
  ident: 2020.10.31.358234v1.3
  article-title: Review of survival analyses published in cancer journals
  publication-title: British Journal of Cancer
– volume: 25
  start-page: 225
  year: 1998
  end-page: 269
  ident: 2020.10.31.358234v1.27
  article-title: The Additive Genetic Gamma Frailty Model
  publication-title: Scandinavian Journal of Statistics
– volume: 34
  start-page: 187
  year: 1972
  end-page: 220
  ident: 2020.10.31.358234v1.1
  article-title: Regression Models and Life-Tables
  publication-title: Journal of the Royal Statistical Society. Series B (Methodological)
– volume: 30
  start-page: 97
  year: 2001
  end-page: 101
  ident: 2020.10.31.358234v1.64
  article-title: Merlin—rapid analysis of dense genetic maps using sparse gene flow trees
  publication-title: Nature genetics
– volume: 25
  start-page: 854
  year: 2017
  end-page: 862
  ident: 2020.10.31.358234v1.14
  article-title: A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design
  publication-title: European journal of human genetics: EJHG
– volume: 56
  start-page: 1016
  year: 2000
  end-page: 22
  ident: 2020.10.31.358234v1.28
  article-title: Estimation of multivariate frailty models using penalized partial likelihood
  publication-title: Biometrics
– volume: 17
  start-page: e41
  year: 2019
  end-page: e41
  ident: 2020.10.31.358234v1.10
  article-title: Review of statistical methods for survival analysis using genomic data
  publication-title: Genomics & informatics
– volume: 14
  start-page: e0219668
  year: 2019
  ident: 2020.10.31.358234v1.52
  article-title: The impact of APOE genotype on survival: Results of 38,537 participants from six population-based cohorts (E2-CHARGE)
  publication-title: PLoS ONE
– volume: 51
  start-page: 1440
  year: 1995
  end-page: 1450
  ident: 2020.10.31.358234v1.38
  article-title: Average Information REML: An Efficient Algorithm for Variance Parameter Estimation in Linear Mixed Models
  publication-title: Biometrics
– volume: 107
  start-page: 222
  year: 2020
  end-page: 233
  ident: 2020.10.31.358234v1.11
  article-title: A Fast and Accurate Method for Genome-Wide Time-to-Event Data Analysis and Its Application to UK Biobank
  publication-title: Am J Hum Genet
– volume: 46
  start-page: 100
  year: 2014
  end-page: 106
  ident: 2020.10.31.358234v1.57
  article-title: Advantages and pitfalls in the application of mixed-model association methods
  publication-title: Nature genetics
– volume: 215
  start-page: 41
  year: 2020
  end-page: 58
  ident: 2020.10.31.358234v1.31
  article-title: Fast Algorithms for Conducting Large-Scale GWAS of Age-at-Onset Traits Using Cox Mixed-Effects Models
  publication-title: Genetics
– volume: 25
  start-page: 631
  year: 1954
  end-page: 650
  ident: 2020.10.31.358234v1.36
  article-title: Saddlepoint Approximations in Statistics
  publication-title: Ann. Math. Statist.
– year: 1992
  ident: 2020.10.31.358234v1.51
  publication-title: Nonparametric Estimation from Incomplete Observations
– volume: 526
  start-page: 82
  year: 2015
  end-page: 90
  ident: 2020.10.31.358234v1.42
  article-title: The UK10K project identifies rare variants in health and disease
  publication-title: Nature
– volume: 63
  start-page: 152
  year: 2014
  ident: 2020.10.31.358234v1.9
  article-title: Genome-wide association study of survival in patients with pancreatic adenocarcinoma
  publication-title: Gut
– volume: 51
  start-page: 1749
  year: 2019
  end-page: 2
  ident: 2020.10.31.358234v1.18
  article-title: A resource-efficient tool for mixed model association analysis of large-scale data
  publication-title: Nature Genetics
– volume: 12
  start-page: 156
  year: 2003
  end-page: 175
  ident: 2020.10.31.358234v1.29
  article-title: Penalized Survival Models and Frailty
  publication-title: Journal of computational and graphical statistics
– volume: 37
  start-page: 539
  year: 2013
  end-page: 50
  ident: 2020.10.31.358234v1.33
  article-title: Recommended joint and meta-analysis strategies for case-control association testing of single low-count variants
  publication-title: Genet Epidemiol
– volume: 43
  start-page: 462
  year: 2019
  end-page: 476
  ident: 2020.10.31.358234v1.35
  article-title: Robust meta-analysis of biobank-based genome-wide association studies with unbalanced binary phenotypes
  publication-title: Genet Epidemiol
– volume: 18
  start-page: 105
  year: 1997
  end-page: 34
  ident: 2020.10.31.358234v1.2
  article-title: Survival analysis in public health research
  publication-title: Annual Review of Public Health
– volume: 48
  start-page: 795
  year: 1992
  end-page: 806
  ident: 2020.10.31.358234v1.24
  article-title: Semiparametric estimation of random effects using the Cox model based on the EM algorithm
  publication-title: Biometrics
– volume: 47
  start-page: 284
  year: 2015
  end-page: 90
  ident: 2020.10.31.358234v1.16
  article-title: Efficient Bayesian mixed-model analysis increases association power in large cohorts
  publication-title: Nat Genet
– volume: 19
  start-page: 2507
  year: 2010
  end-page: 2515
  ident: 2020.10.31.358234v1.47
  article-title: Fine scale mapping of the breast cancer 16q12 locus
  publication-title: Human molecular genetics
– volume: 98
  start-page: 653
  year: 2016
  end-page: 666
  ident: 2020.10.31.358234v1.20
  article-title: Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models
  publication-title: The American Journal of Human Genetics
– volume: 1
  start-page: 255
  year: 1995
  end-page: 273
  ident: 2020.10.31.358234v1.22
  article-title: Frailty models for survival data
  publication-title: Lifetime data analysis
SSID ssj0002961374
Score 1.6079967
SecondaryResourceType preprint
Snippet Abstract With decades of electronic health records linked to genetic data, large biobanks provide unprecedented opportunities for systematically understanding...
With decades of electronic health records linked to genetic data, large biobanks provide unprecedented opportunities for systematically understanding the...
SourceID biorxiv
proquest
SourceType Open Access Repository
Aggregation Database
SubjectTerms Association analysis
Biobanks
Computer applications
Electronic medical records
Genetic analysis
Genetic diversity
Genomes
Genomics
Life span
Phenotypes
Population structure
Survival
SummonAdditionalLinks – databaseName: ProQuest Central
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1bS8MwFA66MfDNK06nRPC12kuaNk-isjEExxAHeytJcwrF2c11U_eb_JOetNn2IPicNqU5J-eWnO8j5NqLYk-maeSAFhoTFB44wiDQasY9T6QBD5Qp6D8PeH_Ensbh2BbcSnutcm0TK0Otp6mpkd-iq-aoa5zxu9mHY1ijzOmqpdDYJU00wXHcIM2H7mD4sqmy-ALdVQXF7HOBW993Q3u0iapoEn-TvmLeemMaRg15ckvl0_l3_vnHNFf-prdPmkM5g_kB2YHikLRqwsjVEfm5LyhUoA_oK6gsNMUVWBq0B5rNZT5ZrGhFbUPXUOEUY1JqcFjfwfnKNdByibYBtYvKrVxwohqZhNqL66ZFvXpztqH3ojXQ7HIO1WerFhjQxk7SvKATc5_cKVHeQPHXlCzeymMy6nVfH_uOpVtwFOZkzAmVUIylQSSyWLshYCAYS8W1ziBzI-n7oISbsiCVKELBFAtVgEP4iKuzTENwQhrFtIBTQiMPMx3A0EthRKiAC8k95cWuzFKMN13RJld2nZNZDaqRGFkkmJEEXlLLok06awkkdl-VyVYLzv4fPid7Zsa6a7BDGrhGcIHhw0JdWh35BX5nxwQ
  priority: 102
  providerName: ProQuest
Title An efficient and accurate frailty model approach for genome-wide survival association analysis controlling for population structure and relatedness in large-scale biobanks
URI https://www.proquest.com/docview/2506880646
https://www.biorxiv.org/content/10.1101/2020.10.31.358234
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1bS8MwFA66IfjmFadzRPC1o1mytHlU2RDBMcTB3krSnEJxdqPd1P0m_6QnbTcFffC1bZImJzn5vuRcCLlmQch0HAceWGWRoEjuKReB1grJmIq55MYd6D-O5P1EPEz70x-pvpxZpUnn-Uf6Vt7jO4Nt1L7V4vaZ4-qOcSLV7DofTy52SROnlHBrcjjtbo9Xegr3qUDU95h_lkTEW7f0Sw-Xm8vwgDTHegH5IdmB7IjsVdkh18fk8yajUEZ4wF-jyPgpdnflQjvQJNfpbLmmZR4buokLThGAUhd09RW899QCLVaoCHAqUf0tBKyoCkNCayt1549ellxsc3nRKqrsKoey2dLfBaxTijTN6MwZj3sFChcods3o7KU4IZPh4Pnu3qtzK3gGCZjw-kYZIWIeqCS0fh8Q9YXaSGsTSPxA93pglB8LHmuUlxJG9A3HV_iJb5PEAj8ljWyewRmhAUNaA4izDMI_A1JpyQwLfZ3ECC591SJX9ThHiyqCRuRkESH94CyqZNEi7Y0EonoRFRGiM4nqRQp5_o8qLsi-e1b5CbZJAwcKLhEwLE2HNG8Ho_FTp5wiX0NOwRY
linkProvider Cold Spring Harbor Laboratory Press
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9wwEB7RXaH21tJWhdLiSu0xbRI7zvpQoT5AS4EVqkDiltrxRIqA7HbDFvY3Vepv7EwecEDixtmJk3i-zMuebwDeR-kosnmeBuiNpwBFy8AwA61XOopMLrV0nNA_nOjxifpxmpyuwL--FoaPVfY6sVHUfppzjvwTmWpNWNNKb89-B9w1indX-xYaLSz2cXlFIVv9ee87yfdDHO_uHH8bB11XgcBR6KGCxBmnVC5TU4x8mCD5OyPrtPcFFmFq4xidCXMlc0tvapRTiZM0RJeEvig8Spr3EQyV1GE8gOHXncnRz5usTmzIPDbUz7E2pGriMOm2Ugn6nGjgcJni5I9coMrNmlddOZ1fl3_umILGvu0-heGRneH8GaxgtQarbYPK5XP4-6US2JBMkG0StvKCVnzB7BKimNvy_HIpmlY6oqcmF-QDC-Z9vcDgqvQo6gXpIkKzsLc4oIlaJhTRHZTnkvjmztlNOzHREtsu5tg8tim5Qc96WZSVOOfz60FN-EJBn-ZsdVa_gJMHEcRLGFTTCl-BSCOKrJBcPUceqENtrI5cNAptkZN_G5p1eNetczZrSTwylkVGEZCMslYW67DZSyDr_uM6u0Xdxv3DW_B4fHx4kB3sTfZfwxOeva1Y3IQBrRe-Idfl0r3t8CLg10ND9D8nLQYq
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9wwEB7BIlBvfVCVdksHqRyzimOvsz5wqAorKAVxAGlvwY4nUsQ2rDZs6f4mbv2FHSdZeqCHXjgnceyZsf199jwAPot0JGyepxF545mgaBmZkIHWKy2EyaWWLhzon53r4yv1bTKcrMHvVSxMcKt05e38V_mzuccPDtu8-raTOxaBqwfGyVRzEGI8pRqEY-rBzBedY-UpLe-ZttUHJ4es4_0kGR9dfj2OusoCkWP6oaKhM06pXKamGPl4SIx5RtZp7wsq4tQmCTkT50rmlntrlFNDJ_kRvxL7ovAkud112GBbVqFcxHgyeDzXSQxvkKnqLlD_2WWG2t0Qn2wAza42fgkbF3ZG81ewRtVr2GzLUi7fwMOXCqlJLcEyQVt5ZDkvQk4JLOa2nN4tsSmgg6uE5MjIF0O21x8U3ZeesF7wCsQ2jPav9rmhNv8Jdu7xIRC--XL2WEQM23S2izk1v20CbciH1RjLCqfBaz2q2aoIeWjOVjf1Nlw9i-jfQq-6regdYCqYTxEDPMe405E2VgsnRrEtcka1sdmBvU7O2axN3ZEFXWTMe6TIWl3sQH-lgaybvXXGsFDzuqaVfv8fTXyCrYvDcfb95Pz0A7wIj9tYxT70WGb0kUHLndttrATh-rnN8g9hTAG2
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+efficient+and+accurate+frailty+model+approach+for+genome-wide+survival+association+analysis+controlling+for+population+structure+and+relatedness+in+large-scale+biobanks&rft.jtitle=bioRxiv&rft.au=Dey%2C+Rounak&rft.au=Zhou%2C+Wei&rft.au=Kiiskinen%2C+Tuomo&rft.au=Havulinna%2C+Aki&rft.date=2020-11-01&rft.pub=Cold+Spring+Harbor+Laboratory&rft.eissn=2692-8205&rft_id=info:doi/10.1101%2F2020.10.31.358234&rft.externalDocID=2020.10.31.358234v1
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2692-8205&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2692-8205&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2692-8205&client=summon