New Methods for Job and Occupation Classification

Research question/goal:

Currently, most surveys use open-ended questions to ask participants about their occupation. The verbatim responses are coded afterwards into a classification with hundreds of categories and thousands of jobs, which is an error-prone, time-consuming, and costly task. When textual answers have a low level of detail, accurate coding may be impossible.
The project aimed to improve the measurement process using a novel instrument: during the survey, respondents were asked to answer a closed question about their occupations, directly after they answered an initial open-ended question. A supervised machine learning algorithm was trained to suggest a short list of candidate job categories, from which respondents could select the most appropriate one. Owing to the careful design of the instrument’s layout, the interaction between interviewers and respondents, and the job descriptions that are used for communication, high usability standards can be ensured.
The new instrument has been tested in different population surveys, and it has been shown that interviewers and respondents feel comfortable using the instrument. We argue that data quality improves when respondents can self-select the most appropriate occupational category. However, a detailed analysis of data quality turned out to be complex and is left for future research.

Fact sheet

Funding:

DFG

Duration:

2014 to 2021

Status:

completed

Data Sources:

ALWA and NEPS survey data, additional sources

Geographic Space:

Germany

Publications

Books

Foster, Ian, Rayid Ghani, Ron S. Jarmin, Frauke Kreuter and Julia Lane (Eds.) (2017): Big Data and Social Science: A Practical Guide to Methods and Tools. London: Chapman & Hall / CRC Press. [Chapman & Hall/CRC Statistics in the Social and Behavioral Sciences] more

Journal Articles

Amaya, Ashley, Ruben L. Bach, Florian Keusch and Frauke Kreuter (2021): New Data Sources in Social Science Research: Things to Know Before Working With Reddit Data. Social Science Computer Review, 39, issue 4, pp. 943-960. more

Schierholz, Malte, Miriam Gensicke, Nikolai Tschersich and Frauke Kreuter (2018): Occupation coding during the interview. Journal of the Royal Statistical Society: Series A (Statistics in Society), 181, issue 2, pp. 379–407. more

Schierholz, Malte (2018): Eine Hilfsklassifikation mit Tätigkeitsbeschreibungen für Zwecke der Berufskodierung. AStA Wirtschafts- und Sozialstatistisches Archiv: Eine Zeitschrift der Deutschen Statistischen Gesellschaft, 12, issue 3-4, pp. 285–298 . more

Book Chapters

Kreuter, Frauke (2017): Appendix A: Executive Summary from Innovations in Federal Statistics: Combining Data Sources While Protecting Privacy. Pp. 149–151 in: Robert M. Groves, Brian A. Harris-Kojetin (Eds.) Federal Statistics, Multiple Data Sources, and Privacy Protection: Next Steps. Washington, DC: The National Academies Press. more

Other Publications

Schierholz, Malte, Lorraine Brenner, Lea Cohausz, Lisa Damminger, Lisa Fast, Ann-Kathrin Hörig, Anna-Lena Huber, Theresa Ludwig, Annabell Petry and Laura Tschischka (2018): Eine Hilfsklassifikation mit Tätigkeitsbeschreibungen für Zwecke der Berufskodierung: Leitgedanken und Dokumentation. Nürnberg [IAB-Discussion Paper; 13/2018] more

Schierholz, Malte (2014): Automating survey coding for occupation. Nürnberg [FDZ-Methodenreport; 10/2014] more

Conference Presentations

Schierholz, Malte (2018): A comparison of automatic algorithms for occupation coding. [BigSurv 2018, Barcelona, October 25th to October 27th, 2018] more

Schierholz, Malte (2018): A comparison of automatic algorithms for occupation coding. [European Conference on Data Analysis, Paderborn, July 04th to July 06th, 2018] more

Schierholz, Malte (2018): A comparison of automatic algorithms for occupation coding. [Joint Statistical Meetings, Vancouver, July 28th to August 02nd, 2018] more

Schierholz, Malte (2018): A comparison of automatic algorithms for occupation coding. [Statistische Woche, Linz, September 11th to September 14th, 2018] more

Schierholz, Malte (2017): A New Auxiliary Classification with Job Activities for Occupation Coding. [7th Conference of the European Survey Research Association, Lisbon, July 17th to July 21st, 2017] more

Schierholz, Malte (2016): New Methods for the Measurement of Occupation. [Seminar at the U.S. Census Bureau, Washington, DC, July 26th, 2016] more

Schierholz, Malte (2016): New Methods for the Measurement of Occupation. [Seminar at the Bureau of Labor Statistics, Washington, DC, July 28th, 2016] more

Schierholz, Malte, Miriam Gensicke and Nikolai Tschersich (2016): Occupation Coding During the Interview. [Joint Statistical Meetings 2016, Chicago, IL, July 30th to August 04th, 2016] more

Schierholz, Malte, Miriam Gensicke and Nikolai Tschersich (2016): Occupation Coding During the Interview. [Expert workshop 'Indicators for job quality, industrial relations, occupations, and new skills and tasks', Amsterdam, November 07th to November 08th, 2016] more

Schierholz, Malte (2015): Asking for Occupation during the Interview: Experimental Results. [6th Conference of the European Survey Research Association (ESRA), Reykjavik, July 13th to July 17th, 2015] more

Bethmann, Arne, Malte Schierholz, Knut Wenzig and Markus Zielonka (2014): Automatic Coding of Occupations : Using Machine Learning Algorithms for Occupation Coding in Several German Panel Surveys. [WAPOR 67th Annual Conference : Extensible Public Opinion, Nice, September 04th to September 06th, 2014] more

Schierholz, Malte, Arne Bethmann, Knut Wenzig and Markus Zielonka (2014): Automatic Coding of Occupations : Using Machine Learning Algorithms for Occupation Coding in Several German Panel Surveys. [Statistische Woche, Hannover, September 16th to September 19th, 2014] more

Bethmann, Arne, Malte Schierholz, Knut Wenzig and Markus Zielonka (2014): Automatic Coding of Occupations: Using Machine Learning Algorithms for Occupation Coding in Several German Panel Surveys. [VI European Congress of Methodology, Utrecht University, July 23rd to July 25th, 2014] more

Schierholz, Malte, and Arne Bethmann (2014): Automating Survey Coding for Occupation. [Joint Statistical Meetings 2014, Boston, Mass., August 02nd to August 07th, 2014] more

Fact sheet

Publications

Books

Journal Articles

Book Chapters

Other Publications

Conference Presentations

Visiting address

Postal address

News

The MZES

Projects

Publications

People