Early-Stage Event Prediction for Longitudinal Data
Predicting event occurrence at an early stage in longitudinal studies is an important problem which has high practical value. As opposed to the standard classification and regression problems where a domain expert can provide the labels for the data in a reasonably short period of time, training dat...
Saved in:
Published in | Advances in Knowledge Discovery and Data Mining pp. 139 - 151 |
---|---|
Main Authors | , , |
Format | Book Chapter |
Language | English |
Published |
Cham
Springer International Publishing
2016
|
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Predicting event occurrence at an early stage in longitudinal studies is an important problem which has high practical value. As opposed to the standard classification and regression problems where a domain expert can provide the labels for the data in a reasonably short period of time, training data in such longitudinal studies must be obtained only by waiting for the occurrence of sufficient number of events. The main objective of this work is to predict the event occurrence in the future for a particular subject in the study using the data collected at the initial stages of a longitudinal study. In this paper, we propose a novel Early Stage Prediction (ESP) framework for building event prediction models which are trained at early stages of longitudinal studies. More specifically, we develop two probabilistic algorithms based on Naive Bayes and Tree-Augmented Naive Bayes (TAN), called ESP-NB and ESP-TAN, respectively, for early stage event prediction by modifying the posterior probability of event occurrence using different extrapolations that are based on Weibull and Lognormal distributions. The proposed framework is evaluated using a wide range of synthetic and real-world benchmark datasets. Our extensive set of experiments show that the proposed ESP framework is able to more accurately predict future event occurrences using only a limited amount of training data compared to the other alternative approaches. |
---|---|
AbstractList | Predicting event occurrence at an early stage in longitudinal studies is an important problem which has high practical value. As opposed to the standard classification and regression problems where a domain expert can provide the labels for the data in a reasonably short period of time, training data in such longitudinal studies must be obtained only by waiting for the occurrence of sufficient number of events. The main objective of this work is to predict the event occurrence in the future for a particular subject in the study using the data collected at the initial stages of a longitudinal study. In this paper, we propose a novel Early Stage Prediction (ESP) framework for building event prediction models which are trained at early stages of longitudinal studies. More specifically, we develop two probabilistic algorithms based on Naive Bayes and Tree-Augmented Naive Bayes (TAN), called ESP-NB and ESP-TAN, respectively, for early stage event prediction by modifying the posterior probability of event occurrence using different extrapolations that are based on Weibull and Lognormal distributions. The proposed framework is evaluated using a wide range of synthetic and real-world benchmark datasets. Our extensive set of experiments show that the proposed ESP framework is able to more accurately predict future event occurrences using only a limited amount of training data compared to the other alternative approaches. |
Author | Chawla, Sanjay Fard, Mahtab J. Reddy, Chandan K. |
Author_xml | – sequence: 1 givenname: Mahtab J. surname: Fard fullname: Fard, Mahtab J. email: mahtab.jahanbanifard@wayne.edu organization: Computer Science Department, Wayne State University, Detroit, USA – sequence: 2 givenname: Sanjay surname: Chawla fullname: Chawla, Sanjay email: schawla@qf.org.qa, sanjay.chawla@sydney.edu.au organization: University of Sydney, Sydney, Australia – sequence: 3 givenname: Chandan K. surname: Reddy fullname: Reddy, Chandan K. email: reddy@cs.wayne.edu organization: Computer Science Department, Wayne State University, Detroit, USA |
BookMark | eNpFkN1KxDAQhaOu4O66b-BFXyA6mWmS5lLW-gMLCu59SJq0VEsqbRV8e7ur4MXhHM6BgflWbJH6FBm7EnAtAPSN0QUnTsLM0nJOVuAJW9HcHAtzypZCCcGJcnP2PyAs2BIIkBud0wXbjOMbAAilpAGzZFi6ofvmr5NrYlZ-xTRlL0MMbTW1fcrqfsh2fWra6TO0yXXZnZvcJTuvXTfGzZ-v2f6-3G8f-e754Wl7u-MNSjFxI8DpIHWoCqURvVcehK5y7zEodEFKjaQKolDHqiqiKGqfS-UUSkRQtGb4e3b8GNrUxMH6vn8frQB7IGJnIpbs_KY9ArAHIvQDIbtPpw |
ContentType | Book Chapter |
Copyright | Springer International Publishing Switzerland 2016 |
Copyright_xml | – notice: Springer International Publishing Switzerland 2016 |
DOI | 10.1007/978-3-319-31753-3_12 |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 3319317539 9783319317533 |
EISSN | 1611-3349 |
Editor | Wang, Ruili Dobbie, Gill Washio, Takashi Khan, Latifur Bailey, James Huang, Joshua Zhexue |
Editor_xml | – sequence: 1 givenname: James surname: Bailey fullname: Bailey, James email: baileyj@unimelb.edu.au – sequence: 2 givenname: Latifur surname: Khan fullname: Khan, Latifur email: lkhan@utdallas.edu – sequence: 3 givenname: Takashi surname: Washio fullname: Washio, Takashi email: washio@ar.sanken.osaka-u.ac.jp – sequence: 4 givenname: Gill surname: Dobbie fullname: Dobbie, Gill email: g.dobbie@auckland.ac.nz – sequence: 5 givenname: Joshua Zhexue surname: Huang fullname: Huang, Joshua Zhexue email: zx.huang@szu.edu.cn – sequence: 6 givenname: Ruili surname: Wang fullname: Wang, Ruili email: r.wang@massey.ac.nz |
EndPage | 151 |
GroupedDBID | -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE ALMA_UNASSIGNED_HOLDINGS EJD F5P FEDTE HVGLF LAS LDH P2P RIG RNI RSU SVGTG VI1 ~02 |
ID | FETCH-LOGICAL-g251t-910a7d57dc86722bb6b017c4bb2d62ad557236833dfecc8e18fb456a62522063 |
ISBN | 3319317520 9783319317526 |
ISSN | 0302-9743 |
IngestDate | Wed Nov 06 06:29:29 EST 2024 |
IsPeerReviewed | true |
IsScholarly | true |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-g251t-910a7d57dc86722bb6b017c4bb2d62ad557236833dfecc8e18fb456a62522063 |
PageCount | 13 |
ParticipantIDs | springer_books_10_1007_978_3_319_31753_3_12 |
PublicationCentury | 2000 |
PublicationDate | 2016 |
PublicationDateYYYYMMDD | 2016-01-01 |
PublicationDate_xml | – year: 2016 text: 2016 |
PublicationDecade | 2010 |
PublicationPlace | Cham |
PublicationPlace_xml | – name: Cham |
PublicationSeriesSubtitle | Lecture Notes in Artificial Intelligence |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSeriesTitleAlternate | Lect.Notes Computer |
PublicationSubtitle | 20th Pacific-Asia Conference, PAKDD 2016, Auckland, New Zealand, April 19-22, 2016, Proceedings, Part I |
PublicationTitle | Advances in Knowledge Discovery and Data Mining |
PublicationYear | 2016 |
Publisher | Springer International Publishing |
Publisher_xml | – name: Springer International Publishing |
RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Naor, Moni Mitchell, John C. Terzopoulos, Demetri Steffen, Bernhard Pandu Rangan, C. Kanade, Takeo Kittler, Josef Weikum, Gerhard Hutchison, David Tygar, Doug |
RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David organization: Lancaster University, Lancaster, United Kingdom – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo organization: Carnegie Mellon University, Pittsburgh, USA – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef organization: University of Surrey, Guildford, United Kingdom – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. organization: Cornell University, Ithaca, USA – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann organization: CNB H 104.2, ETH Zürich, Zürich, Switzerland – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. organization: Stanford, USA – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni organization: Weizmann Institute of Science, Rehovot, Israel – sequence: 8 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. organization: Indian Institute of Technology Madr, Chennai, India – sequence: 9 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: Fakultät Informatik, TU Dortmund, Dortmund, Germany – sequence: 10 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri organization: Los Angeles, USA – sequence: 11 givenname: Doug surname: Tygar fullname: Tygar, Doug organization: University of California, Berkeley, USA – sequence: 12 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard organization: Max Planck Institute for Informatic, Saarbrücken, Germany |
SSID | ssj0001665909 ssj0002792 |
Score | 2.2810807 |
Snippet | Predicting event occurrence at an early stage in longitudinal studies is an important problem which has high practical value. As opposed to the standard... |
SourceID | springer |
SourceType | Publisher |
StartPage | 139 |
SubjectTerms | Longitudinal data Prediction Regression Survival analysis |
Title | Early-Stage Event Prediction for Longitudinal Data |
URI | http://link.springer.com/10.1007/978-3-319-31753-3_12 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07T8MwELZKWRADb_FWBrYqVeokTjIwIFpUlZYBFcQW2bHLQ6iVaCoEv5672G7SlgWWKIqqOr7POZ_v8R0hF76kjNMRdYXy4YCSBMKNmS_dKAth92Ac1gxGdAd3rPsQ9J7Cp1rtq5K1NMtFM_v-ta7kP6jCM8AVq2T_gOz8T-EB3AO-cAWE4bpk_C66WXV6sY7eF_mst9YzhmyaGWZlalqlNs95Y1D0gJgDxXUy-4C_5Fw0es1KgP_znWsv8fitzK25V9Jk6KKTHdTBbbO6zgqGZBdsVhi8g8mTmNUhX7N5DmN_gi2RZrJov9XWpXB5IR81veybEMbdJNczsV0mrNKpeiVay14J65Vc8muWrrWFY6wPegDtGF07b8u5QFXDYUdrP6W1M0PORV9znBqN29JcSGbzbmn22pV9oZoKgmVbOBrcpdifei1KQDWuX3V6_cfSPcdYmCCzmdnUkWdRB6T0W2GZkH1rQx1WzqJSovnbkCtB98KWGW6TTaxvcbDwBIS2Q2pqvEu2rNwdI_c9QivIOgWyTomsA8g6VWQdRHafDG86w-uuazpsuM9g1-aw03k8kmEks5hFlArBBGjoLBCCSka5DMOI-gxmJkfwqceqFY8EWNwcDs2UgnF7QOrjyVgdEkd4MXbHEMhrGXDlJUHMPZpFgadEkPnJEWnYOaf4yUxTy5cNEkr9FCSUFhJKUULHf_r1Cdkol-ApqecfM3UGpmIuzg2sPyX3XHk |
link.rule.ids | 782,783,787,796,27937 |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Advances+in+Knowledge+Discovery+and+Data+Mining&rft.au=Fard%2C+Mahtab+J.&rft.au=Chawla%2C+Sanjay&rft.au=Reddy%2C+Chandan+K.&rft.atitle=Early-Stage+Event+Prediction+for+Longitudinal+Data&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2016-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783319317526&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=139&rft.epage=151&rft_id=info:doi/10.1007%2F978-3-319-31753-3_12 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon |