sEBM: Scaling Event Based Models to Predict Disease Progression via Implicit Biomarker Selection and Clustering
The Event Based Model (EBM) is a probabilistic generative model to explore biomarker changes occurring as a disease progresses. Disease progression is hypothesized to occur through a sequence of biomarker dysregulation "events". The EBM estimates the biomarker dysregulation event sequence....
Saved in:
Published in | Information processing in medical imaging : proceedings of the ... conference Vol. 13939; p. 208 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Germany
01.01.2023
|
Subjects | |
Online Access | Get more information |
ISSN | 1011-2499 |
DOI | 10.1007/978-3-031-34048-2_17 |
Cover
Loading…
Abstract | The Event Based Model (EBM) is a probabilistic generative model to explore biomarker changes occurring as a disease progresses. Disease progression is hypothesized to occur through a sequence of biomarker dysregulation "events". The EBM estimates the biomarker dysregulation event sequence. It computes the data likelihood for a given dysregulation sequence, and subsequently evaluates the posterior distribution on the dysregulation sequence. Since the posterior distribution is intractable, Markov Chain Monte-Carlo is employed to generate samples under the posterior distribution. However, the set of possible sequences increases as
! where
is the number of biomarkers (data dimension) and quickly becomes prohibitively large for effective sampling via MCMC. This work proposes the "scaled EBM" (sEBM) to enable event based modeling on large biomarker sets (e.g. high-dimensional data). First, sEBM implicitly selects a subset of biomarkers useful for modeling disease progression and infers the event sequence only for that subset. Second, sEBM clusters biomarkers with similar positions in the event sequence and only orders the "clusters", with each successive cluster corresponding to the next stage in disease progression. These two modifications used to construct the sEBM method provably reduces the possible space of event sequences by multiple orders of magnitude. The novel modifications are supported by theory and experiments on synthetic and real clinical data provides validation for sEBM to work in higher dimensional settings. Results on synthetic data with known ground truth shows that sEBM outperforms previous EBM variants as data dimensions increase. sEBM was successfully implemented with up to 300 biomarkers, which is a 6-fold increase over previous EBM applications. A real-world clinical application of sEBM is performed using 119 neuroimaging markers from publicly available Alzheimer's Disease Neuroimaging Initiative (ADNI) data to stratify subjects into 6 stages of disease progression. Subjects included cognitively normal (CN), mild cognitive impairment (MCI), and Alzheimer's Disease (AD). sEBM stage is differentiated for the 3 groups
. Increased sEBM stage is a strong predictor of conversion risk to AD
for MCI subjects, as verified with a Cox proportional-hazards model adjusted for age, sex, education and APOE4 status. Like EBM, sEBM does not rely on apriori defined diagnostic labels and only uses cross-sectional data. |
---|---|
AbstractList | The Event Based Model (EBM) is a probabilistic generative model to explore biomarker changes occurring as a disease progresses. Disease progression is hypothesized to occur through a sequence of biomarker dysregulation "events". The EBM estimates the biomarker dysregulation event sequence. It computes the data likelihood for a given dysregulation sequence, and subsequently evaluates the posterior distribution on the dysregulation sequence. Since the posterior distribution is intractable, Markov Chain Monte-Carlo is employed to generate samples under the posterior distribution. However, the set of possible sequences increases as
! where
is the number of biomarkers (data dimension) and quickly becomes prohibitively large for effective sampling via MCMC. This work proposes the "scaled EBM" (sEBM) to enable event based modeling on large biomarker sets (e.g. high-dimensional data). First, sEBM implicitly selects a subset of biomarkers useful for modeling disease progression and infers the event sequence only for that subset. Second, sEBM clusters biomarkers with similar positions in the event sequence and only orders the "clusters", with each successive cluster corresponding to the next stage in disease progression. These two modifications used to construct the sEBM method provably reduces the possible space of event sequences by multiple orders of magnitude. The novel modifications are supported by theory and experiments on synthetic and real clinical data provides validation for sEBM to work in higher dimensional settings. Results on synthetic data with known ground truth shows that sEBM outperforms previous EBM variants as data dimensions increase. sEBM was successfully implemented with up to 300 biomarkers, which is a 6-fold increase over previous EBM applications. A real-world clinical application of sEBM is performed using 119 neuroimaging markers from publicly available Alzheimer's Disease Neuroimaging Initiative (ADNI) data to stratify subjects into 6 stages of disease progression. Subjects included cognitively normal (CN), mild cognitive impairment (MCI), and Alzheimer's Disease (AD). sEBM stage is differentiated for the 3 groups
. Increased sEBM stage is a strong predictor of conversion risk to AD
for MCI subjects, as verified with a Cox proportional-hazards model adjusted for age, sex, education and APOE4 status. Like EBM, sEBM does not rely on apriori defined diagnostic labels and only uses cross-sectional data. |
Author | Tandon, Raghav Mitchell, Cassie S Kirkpatrick, Anna |
Author_xml | – sequence: 1 givenname: Raghav orcidid: 0000-0003-2603-4930 surname: Tandon fullname: Tandon, Raghav organization: Center for Machine Learning, Georgia Institute of Technology, Atlanta, GA 30332, USA – sequence: 2 givenname: Anna orcidid: 0000-0002-8737-1132 surname: Kirkpatrick fullname: Kirkpatrick, Anna organization: School of Mathematics, Georgia Institute of Technology, Atlanta, GA 30332, USA – sequence: 3 givenname: Cassie S orcidid: 0000-0002-5472-6355 surname: Mitchell fullname: Mitchell, Cassie S organization: Center for Machine Learning, Georgia Institute of Technology, Atlanta, GA 30332, USA |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/38680427$$D View this record in MEDLINE/PubMed |
BookMark | eNo1j1tPAjEQhfuAEUT-gTH9A9XethffBFFJIJqgz6SXgTQuu2S7kPjvqVGfJl_mnDNzrtCgaRtA6IbRO0apvrfaEEGoYERIKg3hG6YHaMQoY4RLa4doknPylEoujGH6Eg2FUaagHqE2z6erB7wOrk7NDs9P0PR46jJEvGoj1Bn3LX7vIKbQ46eUoawKt7sOSmjb4FNyeLE_1CmkYkzt3nVf0OE11BD6H4FrIp7Vx9xDVy5co4utqzNM_uYYfT7PP2avZPn2spg9LomX2vQkQtTBBWqsZtY7b6sqChNECEp5FRgzW2XVtpKSx2A5Z1WlgvalLUjHjOdjdPubezj6PcTNoUvls-_Nf3N-Bh_sXRs |
ContentType | Journal Article |
DBID | NPM |
DOI | 10.1007/978-3-031-34048-2_17 |
DatabaseName | PubMed |
DatabaseTitle | PubMed |
DatabaseTitleList | PubMed |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database |
DeliveryMethod | no_fulltext_linktorsrc |
ExternalDocumentID | 38680427 |
Genre | Journal Article |
GrantInformation_xml | – fundername: NIA NIH HHS grantid: R56 AG056169 – fundername: NIA NIH HHS grantid: R01 AG070937 – fundername: NIA NIH HHS grantid: R01 AG056169 – fundername: NIA NIH HHS grantid: U19 AG065169 |
GroupedDBID | --- F5P NPM |
ID | FETCH-LOGICAL-b478t-ded7cac089719bab955d38c3cc66b6c118f696f5442dc9221556c7b499e4a18b2 |
ISSN | 1011-2499 |
IngestDate | Thu Apr 03 06:55:54 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Keywords | biomarker clustering bayesian learning disease progression modeling prognostic biomarker selection |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-b478t-ded7cac089719bab955d38c3cc66b6c118f696f5442dc9221556c7b499e4a18b2 |
ORCID | 0000-0003-2603-4930 0000-0002-8737-1132 0000-0002-5472-6355 |
OpenAccessLink | https://www.ncbi.nlm.nih.gov/pmc/articles/11056195 |
PMID | 38680427 |
ParticipantIDs | pubmed_primary_38680427 |
PublicationCentury | 2000 |
PublicationDate | 2023-01-01 |
PublicationDateYYYYMMDD | 2023-01-01 |
PublicationDate_xml | – month: 01 year: 2023 text: 2023-01-01 day: 01 |
PublicationDecade | 2020 |
PublicationPlace | Germany |
PublicationPlace_xml | – name: Germany |
PublicationTitle | Information processing in medical imaging : proceedings of the ... conference |
PublicationTitleAlternate | Inf Process Med Imaging |
PublicationYear | 2023 |
SSID | ssib004238817 ssib006573086 |
Score | 2.1636026 |
Snippet | The Event Based Model (EBM) is a probabilistic generative model to explore biomarker changes occurring as a disease progresses. Disease progression is... |
SourceID | pubmed |
SourceType | Index Database |
StartPage | 208 |
Title | sEBM: Scaling Event Based Models to Predict Disease Progression via Implicit Biomarker Selection and Clustering |
URI | https://www.ncbi.nlm.nih.gov/pubmed/38680427 |
Volume | 13939 |
hasFullText | |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LbxMxELZSkFAvCMSrFJAP3CJHyb7s5UaroAopCEEr9RbZXrssTTZRuu2BH9bfx4wfm20p4nFZRWspcTyfxp7x980Q8rbgSSptYpjmyrJsojiT4IuZTMaW69xy60g0s0_F0Un28TQ_HQyue6yly1aN9I87dSX_Y1V4B3ZFlew_WLb7UngBn8G-8AQLw_OvbHwxPZhhSP8VFhpD_imSF4cHsDFVrsnZwpVv-LzBy5gWC23iXQxqA848-7UZXtXSFQiudd1iX8olsnU24EEWJvQQb6rh4eISyynETe57JL93wsfh2ssNgj5mGS5_6qVvgYRT3G6UF5GWMBqNkPQeBIfbFELsL_JFnn2TV1uSwOZ87foJnHsiZtNtKLO6dXxWT2CBaZiQ0A3ZjCTtZTO8A8aULYSE5Q0PnZa-4lH0sq4UxK_ev0_4AGfF0gz8E0vmXh3aA8R66RCRikJgr5E_j96qyR2HdsgORCfYbrWXI4IDqhA31LvgRV3L0e7P9TScd812lzyIv3Ar2nGnnuNH5GEIV-h7j73HZGCaJ2SFuHtHA-qoQx11qKMedbRd0YA6GlBHe6ijgDoaUUc71NEOdRQwQLeoe0pOPkyPD49YaNzBVMZFyypTcS31WJR8UiqpyjyvUqFTrYtCFRpiWluUhc2zLKl0mcCpMy_AV8DCmExOhEqekXvNqjEvCIX9zxglxiVKpmWaS5nnQlUWA4VUZckeee7XZ7721VnmceVe_nZkn-xOSj72tNVX5L4Fd2Bew9myVW-cKX8Cgm52CA |
linkProvider | National Library of Medicine |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=sEBM%3A+Scaling+Event+Based+Models+to+Predict+Disease+Progression+via+Implicit+Biomarker+Selection+and+Clustering&rft.jtitle=Information+processing+in+medical+imaging+%3A+proceedings+of+the+...+conference&rft.au=Tandon%2C+Raghav&rft.au=Kirkpatrick%2C+Anna&rft.au=Mitchell%2C+Cassie+S&rft.date=2023-01-01&rft.issn=1011-2499&rft.volume=13939&rft.spage=208&rft_id=info:doi/10.1007%2F978-3-031-34048-2_17&rft_id=info%3Apmid%2F38680427&rft_id=info%3Apmid%2F38680427&rft.externalDocID=38680427 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1011-2499&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1011-2499&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1011-2499&client=summon |