Early-Stage Event Prediction for Longitudinal Data

Predicting event occurrence at an early stage in longitudinal studies is an important problem which has high practical value. As opposed to the standard classification and regression problems where a domain expert can provide the labels for the data in a reasonably short period of time, training dat...

Full description

Saved in:
Bibliographic Details
Published inAdvances in Knowledge Discovery and Data Mining pp. 139 - 151
Main Authors Fard, Mahtab J., Chawla, Sanjay, Reddy, Chandan K.
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing 2016
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Predicting event occurrence at an early stage in longitudinal studies is an important problem which has high practical value. As opposed to the standard classification and regression problems where a domain expert can provide the labels for the data in a reasonably short period of time, training data in such longitudinal studies must be obtained only by waiting for the occurrence of sufficient number of events. The main objective of this work is to predict the event occurrence in the future for a particular subject in the study using the data collected at the initial stages of a longitudinal study. In this paper, we propose a novel Early Stage Prediction (ESP) framework for building event prediction models which are trained at early stages of longitudinal studies. More specifically, we develop two probabilistic algorithms based on Naive Bayes and Tree-Augmented Naive Bayes (TAN), called ESP-NB and ESP-TAN, respectively, for early stage event prediction by modifying the posterior probability of event occurrence using different extrapolations that are based on Weibull and Lognormal distributions. The proposed framework is evaluated using a wide range of synthetic and real-world benchmark datasets. Our extensive set of experiments show that the proposed ESP framework is able to more accurately predict future event occurrences using only a limited amount of training data compared to the other alternative approaches.
AbstractList Predicting event occurrence at an early stage in longitudinal studies is an important problem which has high practical value. As opposed to the standard classification and regression problems where a domain expert can provide the labels for the data in a reasonably short period of time, training data in such longitudinal studies must be obtained only by waiting for the occurrence of sufficient number of events. The main objective of this work is to predict the event occurrence in the future for a particular subject in the study using the data collected at the initial stages of a longitudinal study. In this paper, we propose a novel Early Stage Prediction (ESP) framework for building event prediction models which are trained at early stages of longitudinal studies. More specifically, we develop two probabilistic algorithms based on Naive Bayes and Tree-Augmented Naive Bayes (TAN), called ESP-NB and ESP-TAN, respectively, for early stage event prediction by modifying the posterior probability of event occurrence using different extrapolations that are based on Weibull and Lognormal distributions. The proposed framework is evaluated using a wide range of synthetic and real-world benchmark datasets. Our extensive set of experiments show that the proposed ESP framework is able to more accurately predict future event occurrences using only a limited amount of training data compared to the other alternative approaches.
Author Chawla, Sanjay
Fard, Mahtab J.
Reddy, Chandan K.
Author_xml – sequence: 1
  givenname: Mahtab J.
  surname: Fard
  fullname: Fard, Mahtab J.
  email: mahtab.jahanbanifard@wayne.edu
  organization: Computer Science Department, Wayne State University, Detroit, USA
– sequence: 2
  givenname: Sanjay
  surname: Chawla
  fullname: Chawla, Sanjay
  email: schawla@qf.org.qa, sanjay.chawla@sydney.edu.au
  organization: University of Sydney, Sydney, Australia
– sequence: 3
  givenname: Chandan K.
  surname: Reddy
  fullname: Reddy, Chandan K.
  email: reddy@cs.wayne.edu
  organization: Computer Science Department, Wayne State University, Detroit, USA
BookMark eNpFkN1KxDAQhaOu4O66b-BFXyA6mWmS5lLW-gMLCu59SJq0VEsqbRV8e7ur4MXhHM6BgflWbJH6FBm7EnAtAPSN0QUnTsLM0nJOVuAJW9HcHAtzypZCCcGJcnP2PyAs2BIIkBud0wXbjOMbAAilpAGzZFi6ofvmr5NrYlZ-xTRlL0MMbTW1fcrqfsh2fWra6TO0yXXZnZvcJTuvXTfGzZ-v2f6-3G8f-e754Wl7u-MNSjFxI8DpIHWoCqURvVcehK5y7zEodEFKjaQKolDHqiqiKGqfS-UUSkRQtGb4e3b8GNrUxMH6vn8frQB7IGJnIpbs_KY9ArAHIvQDIbtPpw
ContentType Book Chapter
Copyright Springer International Publishing Switzerland 2016
Copyright_xml – notice: Springer International Publishing Switzerland 2016
DOI 10.1007/978-3-319-31753-3_12
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 3319317539
9783319317533
EISSN 1611-3349
Editor Wang, Ruili
Dobbie, Gill
Washio, Takashi
Khan, Latifur
Bailey, James
Huang, Joshua Zhexue
Editor_xml – sequence: 1
  givenname: James
  surname: Bailey
  fullname: Bailey, James
  email: baileyj@unimelb.edu.au
– sequence: 2
  givenname: Latifur
  surname: Khan
  fullname: Khan, Latifur
  email: lkhan@utdallas.edu
– sequence: 3
  givenname: Takashi
  surname: Washio
  fullname: Washio, Takashi
  email: washio@ar.sanken.osaka-u.ac.jp
– sequence: 4
  givenname: Gill
  surname: Dobbie
  fullname: Dobbie, Gill
  email: g.dobbie@auckland.ac.nz
– sequence: 5
  givenname: Joshua Zhexue
  surname: Huang
  fullname: Huang, Joshua Zhexue
  email: zx.huang@szu.edu.cn
– sequence: 6
  givenname: Ruili
  surname: Wang
  fullname: Wang, Ruili
  email: r.wang@massey.ac.nz
EndPage 151
GroupedDBID -DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
ALMA_UNASSIGNED_HOLDINGS
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RIG
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-g251t-910a7d57dc86722bb6b017c4bb2d62ad557236833dfecc8e18fb456a62522063
ISBN 3319317520
9783319317526
ISSN 0302-9743
IngestDate Wed Nov 06 06:29:29 EST 2024
IsPeerReviewed true
IsScholarly true
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-g251t-910a7d57dc86722bb6b017c4bb2d62ad557236833dfecc8e18fb456a62522063
PageCount 13
ParticipantIDs springer_books_10_1007_978_3_319_31753_3_12
PublicationCentury 2000
PublicationDate 2016
PublicationDateYYYYMMDD 2016-01-01
PublicationDate_xml – year: 2016
  text: 2016
PublicationDecade 2010
PublicationPlace Cham
PublicationPlace_xml – name: Cham
PublicationSeriesSubtitle Lecture Notes in Artificial Intelligence
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSeriesTitleAlternate Lect.Notes Computer
PublicationSubtitle 20th Pacific-Asia Conference, PAKDD 2016, Auckland, New Zealand, April 19-22, 2016, Proceedings, Part I
PublicationTitle Advances in Knowledge Discovery and Data Mining
PublicationYear 2016
Publisher Springer International Publishing
Publisher_xml – name: Springer International Publishing
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Steffen, Bernhard
Pandu Rangan, C.
Kanade, Takeo
Kittler, Josef
Weikum, Gerhard
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, Lancaster, United Kingdom
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, United Kingdom
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: CNB H 104.2, ETH Zürich, Zürich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology Madr, Chennai, India
– sequence: 9
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: Fakultät Informatik, TU Dortmund, Dortmund, Germany
– sequence: 10
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: Los Angeles, USA
– sequence: 11
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
  organization: University of California, Berkeley, USA
– sequence: 12
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max Planck Institute for Informatic, Saarbrücken, Germany
SSID ssj0001665909
ssj0002792
Score 2.2810807
Snippet Predicting event occurrence at an early stage in longitudinal studies is an important problem which has high practical value. As opposed to the standard...
SourceID springer
SourceType Publisher
StartPage 139
SubjectTerms Longitudinal data
Prediction
Regression
Survival analysis
Title Early-Stage Event Prediction for Longitudinal Data
URI http://link.springer.com/10.1007/978-3-319-31753-3_12
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07T8MwELZKWRADb_FWBrYqVeokTjIwIFpUlZYBFcQW2bHLQ6iVaCoEv5672G7SlgWWKIqqOr7POZ_v8R0hF76kjNMRdYXy4YCSBMKNmS_dKAth92Ac1gxGdAd3rPsQ9J7Cp1rtq5K1NMtFM_v-ta7kP6jCM8AVq2T_gOz8T-EB3AO-cAWE4bpk_C66WXV6sY7eF_mst9YzhmyaGWZlalqlNs95Y1D0gJgDxXUy-4C_5Fw0es1KgP_znWsv8fitzK25V9Jk6KKTHdTBbbO6zgqGZBdsVhi8g8mTmNUhX7N5DmN_gi2RZrJov9XWpXB5IR81veybEMbdJNczsV0mrNKpeiVay14J65Vc8muWrrWFY6wPegDtGF07b8u5QFXDYUdrP6W1M0PORV9znBqN29JcSGbzbmn22pV9oZoKgmVbOBrcpdifei1KQDWuX3V6_cfSPcdYmCCzmdnUkWdRB6T0W2GZkH1rQx1WzqJSovnbkCtB98KWGW6TTaxvcbDwBIS2Q2pqvEu2rNwdI_c9QivIOgWyTomsA8g6VWQdRHafDG86w-uuazpsuM9g1-aw03k8kmEks5hFlArBBGjoLBCCSka5DMOI-gxmJkfwqceqFY8EWNwcDs2UgnF7QOrjyVgdEkd4MXbHEMhrGXDlJUHMPZpFgadEkPnJEWnYOaf4yUxTy5cNEkr9FCSUFhJKUULHf_r1Cdkol-ApqecfM3UGpmIuzg2sPyX3XHk
link.rule.ids 782,783,787,796,27937
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Advances+in+Knowledge+Discovery+and+Data+Mining&rft.au=Fard%2C+Mahtab+J.&rft.au=Chawla%2C+Sanjay&rft.au=Reddy%2C+Chandan+K.&rft.atitle=Early-Stage+Event+Prediction+for+Longitudinal+Data&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2016-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783319317526&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=139&rft.epage=151&rft_id=info:doi/10.1007%2F978-3-319-31753-3_12
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon