Using representation balancing to learn conditional-average dose responses from clustered data

Estimating a unit's responses to interventions with an associated dose, the "conditional average dose response" (CADR), is relevant in a variety of domains, from healthcare to business, economics, and beyond. Such a response typically needs to be estimated from observational data, whi...

Full description

Saved in:
Bibliographic Details
Main Authors Bockel-Rickermann, Christopher, Vanderschueren, Toon, Berrevoets, Jeroen, Verdonck, Tim, Verbeke, Wouter
Format Journal Article
LanguageEnglish
Published 07.09.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Estimating a unit's responses to interventions with an associated dose, the "conditional average dose response" (CADR), is relevant in a variety of domains, from healthcare to business, economics, and beyond. Such a response typically needs to be estimated from observational data, which introduces several challenges. That is why the machine learning (ML) community has proposed several tailored CADR estimators. Yet, the proposal of most of these methods requires strong assumptions on the distribution of data and the assignment of interventions, which go beyond the standard assumptions in causal inference. Whereas previous works have so far focused on smooth shifts in covariate distributions across doses, in this work, we will study estimating CADR from clustered data and where different doses are assigned to different segments of a population. On a novel benchmarking dataset, we show the impacts of clustered data on model performance and propose an estimator, CBRNet, that learns cluster-agnostic and hence dose-agnostic covariate representations through representation balancing for unbiased CADR inference. We run extensive experiments to illustrate the workings of our method and compare it with the state of the art in ML for CADR estimation.
AbstractList Estimating a unit's responses to interventions with an associated dose, the "conditional average dose response" (CADR), is relevant in a variety of domains, from healthcare to business, economics, and beyond. Such a response typically needs to be estimated from observational data, which introduces several challenges. That is why the machine learning (ML) community has proposed several tailored CADR estimators. Yet, the proposal of most of these methods requires strong assumptions on the distribution of data and the assignment of interventions, which go beyond the standard assumptions in causal inference. Whereas previous works have so far focused on smooth shifts in covariate distributions across doses, in this work, we will study estimating CADR from clustered data and where different doses are assigned to different segments of a population. On a novel benchmarking dataset, we show the impacts of clustered data on model performance and propose an estimator, CBRNet, that learns cluster-agnostic and hence dose-agnostic covariate representations through representation balancing for unbiased CADR inference. We run extensive experiments to illustrate the workings of our method and compare it with the state of the art in ML for CADR estimation.
Author Vanderschueren, Toon
Berrevoets, Jeroen
Bockel-Rickermann, Christopher
Verdonck, Tim
Verbeke, Wouter
Author_xml – sequence: 1
  givenname: Christopher
  surname: Bockel-Rickermann
  fullname: Bockel-Rickermann, Christopher
– sequence: 2
  givenname: Toon
  surname: Vanderschueren
  fullname: Vanderschueren, Toon
– sequence: 3
  givenname: Jeroen
  surname: Berrevoets
  fullname: Berrevoets, Jeroen
– sequence: 4
  givenname: Tim
  surname: Verdonck
  fullname: Verdonck, Tim
– sequence: 5
  givenname: Wouter
  surname: Verbeke
  fullname: Verbeke, Wouter
BackLink https://doi.org/10.48550/arXiv.2309.03731$$DView paper in arXiv
BookMark eNqFjjsOwjAQBV1Awe8AVPgCCQ4mAmoE4gDQEi3xJrLkrKO1ieD2kIie6hXzRpqpGJEnFGKZqXS7z3O1Bn7ZLt1odUiV3ulsIu63YKmWjC1jQIoQrSf5AAdU9iB66RCYZOnJ2B6CS6BDhhql8QG_amg9BQyyYt_I0j1DREYjDUSYi3EFLuDitzOxOp-ux0sylBQt2wb4XfRFxVCk_z8-6PlEsw
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by/4.0
DBID AKY
EPD
GOX
DOI 10.48550/arxiv.2309.03731
DatabaseName arXiv Computer Science
arXiv Statistics
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2309_03731
GroupedDBID AKY
EPD
GOX
ID FETCH-arxiv_primary_2309_037313
IEDL.DBID GOX
IngestDate Tue Jul 30 12:10:34 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2309_037313
OpenAccessLink https://arxiv.org/abs/2309.03731
ParticipantIDs arxiv_primary_2309_03731
PublicationCentury 2000
PublicationDate 2023-09-07
PublicationDateYYYYMMDD 2023-09-07
PublicationDate_xml – month: 09
  year: 2023
  text: 2023-09-07
  day: 07
PublicationDecade 2020
PublicationYear 2023
Score 3.7915976
SecondaryResourceType preprint
Snippet Estimating a unit's responses to interventions with an associated dose, the "conditional average dose response" (CADR), is relevant in a variety of domains,...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Learning
Statistics - Methodology
Title Using representation balancing to learn conditional-average dose responses from clustered data
URI https://arxiv.org/abs/2309.03731
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1NSwMxEB1qT15EUanfc_Aa1G7STY4i1iKoF4U9uWyyWShIK92t9Oc7M2nRS6-ZJAxJhpkkb94AXFfurs5dYxS5G6N0EytlKcpVTmfeBu2DldqAL6-jyYd-LkzRA9zkwlSL1fQn8QP79obiY-YgzTlRemc4ZMjW01uRPieFimvd_68fxZjS9M9JjPdhbx3d4X3ajgPoxdkhfMq3PAp95CbVZ4aeMYWBBd0cpXYD0tW0nqa3OVXRESNTx3reRhoqQNbYImeDYPhaMr1BrJHxnUdwNX58f5go0aj8TvQRJStbirLZMfTpkh8HgFwkKtOhqa0faXPb2Cw6742LZJJO5_EEBttmOd0uOoNdLo8umKj8HPrdYhkvyIl2_lJW8hcSNXfq
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Using+representation+balancing+to+learn+conditional-average+dose+responses+from+clustered+data&rft.au=Bockel-Rickermann%2C+Christopher&rft.au=Vanderschueren%2C+Toon&rft.au=Berrevoets%2C+Jeroen&rft.au=Verdonck%2C+Tim&rft.date=2023-09-07&rft_id=info:doi/10.48550%2Farxiv.2309.03731&rft.externalDocID=2309_03731