Pre-Trained Nonresponse Prediction in Panel Surveys with Machine Learning

While predictive modeling for unit nonresponse in panel surveys has been explored in various contexts, it is still under-researched how practitioners can best adopt these techniques. Currently, practitioners need to wait until they accumulate enough data in their panel to train and evaluate their ow...

Full description

Saved in:
Bibliographic Details
Published inSurvey research methods Vol. 19; no. 2
Main Authors John Collins, Christoph Kern
Format Journal Article
LanguageEnglish
Published European Survey Research Association 08.08.2025
Subjects
Online AccessGet full text

Cover

Loading…
Abstract While predictive modeling for unit nonresponse in panel surveys has been explored in various contexts, it is still under-researched how practitioners can best adopt these techniques. Currently, practitioners need to wait until they accumulate enough data in their panel to train and evaluate their own modeling options. This paper presents a novel “cross-training” technique in which we show that the indicators of nonresponse are so ubiquitous across studies that it is viable to train a model on one panel study and apply it to a different one. The practical benefit of this approach is that newly commencing panels can potentially make better nonresponse predictions in the early waves because these pre-trained models make use of more data. We demonstrate this technique with five panel surveys which encompass a variety of survey designs: the Socio-Economic Panel (SOEP), the German Internet Usage Panel (GIP), the GESIS Panel, the Mannheim Corona Study (MCS), and the Family Demographic Panel (FREDA). We demonstrate that nonresponse history and demographics, paired with tree-based modeling methods, make highly accurate and generalizable predictions across studies, despite differences in panel design. We show how cross-training can effectively predict nonresponse in early panel waves where attrition is typically highest. 
AbstractList While predictive modeling for unit nonresponse in panel surveys has been explored in various contexts, it is still under-researched how practitioners can best adopt these techniques. Currently, practitioners need to wait until they accumulate enough data in their panel to train and evaluate their own modeling options. This paper presents a novel “cross-training” technique in which we show that the indicators of nonresponse are so ubiquitous across studies that it is viable to train a model on one panel study and apply it to a different one. The practical benefit of this approach is that newly commencing panels can potentially make better nonresponse predictions in the early waves because these pre-trained models make use of more data. We demonstrate this technique with five panel surveys which encompass a variety of survey designs: the Socio-Economic Panel (SOEP), the German Internet Usage Panel (GIP), the GESIS Panel, the Mannheim Corona Study (MCS), and the Family Demographic Panel (FREDA). We demonstrate that nonresponse history and demographics, paired with tree-based modeling methods, make highly accurate and generalizable predictions across studies, despite differences in panel design. We show how cross-training can effectively predict nonresponse in early panel waves where attrition is typically highest. 
Author Christoph Kern
John Collins
Author_xml – sequence: 1
  fullname: John Collins
  organization: University of Mannheim
– sequence: 2
  orcidid: 0000-0001-7363-4299
  fullname: Christoph Kern
  organization: Ludwig Maximilian University of Munich
BookMark eNotjstKAzEYRoMo2FbfwEVeYNrcJpMspXgpVC1Y10Mm-dOmtElJxkrfvoO6-uDAdzhjdB1TBIQeKJlSRYWalXyYMcLq6YnqwKZKNPwKjaiSouJc0ls0LmVHiJRKkRFarDJU62xCBIffU8xQjikWwAN3wfYhRRwiXpkIe_z5nU9wLvgn9Fv8Zux2eOElmBxD3NyhG2_2Be7_d4K-np_W89dq-fGymD8uK0e56iupu85zojlYJin1QIWVhEnhm44rQgRlWnnmTN0MtYY6AtIp6mQtDBCq-QQt_rwumV17zOFg8rlNJrS_IOVNa3If7B7aTgivLW8Ub4iwXWNA-Fp5qcVQILjhF5eTXDo
ContentType Journal Article
DBID DOA
DOI 10.18148/srm/2025.v19i2.8473
DatabaseName DOAJ Directory of Open Access Journals
DatabaseTitleList
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ: Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Statistics
EISSN 1864-3361
ExternalDocumentID oai_doaj_org_article_b44f9c3783704cb7ae4f58f694f3043a
GroupedDBID 123
2WC
5VS
ACHQT
ADBBV
AFMMW
ALMA_UNASSIGNED_HOLDINGS
BCNDV
E3Z
GROUPED_DOAJ
KQ8
M~E
OK1
OVT
P2P
TR2
ID FETCH-LOGICAL-d138t-69bbf3093ec2611fe14c60264f7b380041298f2da57361a1d0e6d81d654ae0193
IEDL.DBID DOA
IngestDate Wed Aug 27 01:29:23 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-d138t-69bbf3093ec2611fe14c60264f7b380041298f2da57361a1d0e6d81d654ae0193
ORCID 0000-0001-7363-4299
OpenAccessLink https://doaj.org/article/b44f9c3783704cb7ae4f58f694f3043a
ParticipantIDs doaj_primary_oai_doaj_org_article_b44f9c3783704cb7ae4f58f694f3043a
PublicationCentury 2000
PublicationDate 2025-08-08
PublicationDateYYYYMMDD 2025-08-08
PublicationDate_xml – month: 08
  year: 2025
  text: 2025-08-08
  day: 08
PublicationDecade 2020
PublicationTitle Survey research methods
PublicationYear 2025
Publisher European Survey Research Association
Publisher_xml – name: European Survey Research Association
SSID ssj0066880
Score 2.339459
Snippet While predictive modeling for unit nonresponse in panel surveys has been explored in various contexts, it is still under-researched how practitioners can best...
SourceID doaj
SourceType Open Website
SubjectTerms FReDA
German Internet Panel
GESIS Panel
Machine Learning
Nonresponse
SOEP
Title Pre-Trained Nonresponse Prediction in Panel Surveys with Machine Learning
URI https://doaj.org/article/b44f9c3783704cb7ae4f58f694f3043a
Volume 19
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrZ09T8MwEIYt1KkL4lN8ywNr2jq-OPYIiKogtapEK3WL4viMKkGKStvfj88JEkwsrB4SxW_ke88-P8fYrQYh7MBhol2mEqAKMQPWJGkV3DKYSnmkrYHxRI3m8LzIFj9afVFNWIMHbiaubwG8qWROkBaobF4i-Ex7ZcCHTFxGaxRi3ncy1azBSoXfsr0op4Ph73-u3ynNz3o7YZZpLyzJ8hekP0aT4QHbb20gv2tef8j2sD5iXXJ-DTj5mD1N15jMqIEDOj5ZEZWHilmRh3G3jLcR-LLm07LGN_6yXe-CJJx2Vfk41kcib9GprydsPnycPYyStu9B4oTUm0QZaz2dUGIV8hvhUUBFnaLA51bqSMgy2qeuzHKpRCncAJULvlNlUGKwbPKUdepVjWeM58JDaQbSyYxO6FT4cGMcRuSMSRHO2T1NQvHRoC0Kgk3HgSBB0UpQ_CXBxX885JJ1SaBYW6evWGez3uJ1iPcbexOl_QLzUabt
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Pre-Trained+Nonresponse+Prediction+in+Panel+Surveys+with+Machine+Learning&rft.jtitle=Survey+research+methods&rft.au=John+Collins&rft.au=Christoph+Kern&rft.date=2025-08-08&rft.pub=European+Survey+Research+Association&rft.eissn=1864-3361&rft.volume=19&rft.issue=2&rft_id=info:doi/10.18148%2Fsrm%2F2025.v19i2.8473&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_b44f9c3783704cb7ae4f58f694f3043a