Christoph Kern, Bernd Weiss, Jan-Philipp Kolb
Predicting Nonresponse in Future Waves of a Probability-Based Mixed-Mode Panel with Machine Learning

Journal of Survey Statistics and Methodology, 2023: 11, Heft 1, S. 100-123

ISSN: 2325-0984 (print), 2325-0992 (online)

DOI:

Zusammenfassung

Nonresponse in panel studies can lead to a substantial loss in data quality owing to its potential to introduce bias and distort survey estimates. Recent work investigates the usage of machine learning to predict nonresponse in advance, such that predicted nonresponse propensities can be used to inform the data collection process. However, predicting nonresponse in panel studies requires accounting for the longitudinal data structure in terms of model building, tuning, and evaluation. This study proposes a longitudinal framework for predicting nonresponse with machine learning and multiple panel waves and illustrates its application. With respect to model building, this approach utilizes information from multiple waves by introducing features that aggregate previous (non)response patterns. Concerning model tuning and evaluation, temporal crossvalidation is employed by iterating through pairs of panel waves such that the training and test sets move in time. Implementing this approach with data from a German probability-based mixed-mode panel shows that aggregating information over multiple panel waves can be used to build prediction models with competitive and robust performance over all test waves.

Weitere Informationen

Oxford Academic: PDF

Christoph Kern, Bernd Weiss, Jan-Philipp Kolb
Predicting Nonresponse in Future Waves of a Probability-Based Mixed-Mode Panel with Machine Learning

Weitere Informationen

Besuchsadresse

Postanschrift

Aktuelles

Das MZES

Projekte

Publikationen

Personen

Christoph Kern, Bernd Weiss, Jan-Philipp KolbPredicting Nonresponse in Future Waves of a Probability-Based Mixed-Mode Panel with Machine Learning

MZES Projekt

Weitere Informationen

Christoph Kern, Bernd Weiss, Jan-Philipp Kolb
Predicting Nonresponse in Future Waves of a Probability-Based Mixed-Mode Panel with Machine Learning