This paper outlines two studies on education bias in German probability-based surveys. Study 1 reviews data from 67 surveys across 19 survey programs conducted in Germany from 2000 to 2023. We found a consistent underrepresentation of individuals with a low level of formal education. We also found that the transition to self-administered modes due to rising survey costs may exacerbate this bias in ...
In:
International Journal of Social Research Methodology
(2026), im Ersch. [online first: 2025-06-11]
| Annika Stein, Tobias Gummer, Elias Naumann, Björn Rohr, Henning Silber, Roman Auriga, Michael Bergmann, Arne Bethmann, Michael Blohm, Carina Cornesse, Pablo Christmann, Mustafa Coban, Jean Philippe Décieux, Britta Gauly, Caroline Hahn, Susanne Helmschrott, Oshrat Hochman, Johannes Lemcke, Dörte Naber, Steffen Pötzschke, Joss Roßmann, Jan-Lucas Schanze, Tobias Schmidt, Silke L. Schneider, Heike Spangenberg, Tobias Rettig, Mark Trappmann, Michael Weinhardt, Bernd Weiß
Georeferenced data are often anonymized for data protection reasons. This is done either by aggregating the data into larger spatial units (e.g., higher-level administrative units or grids with larger cell sizes) or by using stochastic methods to deliberately overlay the original coordinates. These methods significantly distort the data and associated variables, making further modeling steps...
Der englische Originaltitel des Seminars lautet: “Measurement error models on anonymized georeferenced data”. Die Präsentation findet auf Englisch statt.
Eine kurze Zusammenfassung zum Vortrag ist nur auf der englischen Veranstaltungsseite verfügbar!
Multiple imputation of missing values in survey data analysis is a state-of-the-art technique. Typically, methods like multivariate imputation by chained equations (mice, van Buuren 2018) are employed, replacing missing values on a variable-by-variable basis. Generally, the information used for imputation comes from the survey dataset being analysed. Valid analysis results are achieved when the...
Der englische Originaltitel des Seminars lautet: “Filling in the Blanks: Augmenting Survey Data Imputation with an External Prior”. Die Präsentation findet auf Englisch statt.
Eine kurze Zusammenfassung zum Vortrag ist nur auf der englischen Veranstaltungsseite verfügbar!
FAIRness of research data, meaning that data are managed according to the principles of being Findable, Accessible, Interoperable, and Reusable, has become a ubiquitous requirement in research data policies as well as in general guidelines for research data management. Meeting this requirement largely depends on the availability of rich and standardized DDI metadata—based on the Data Documentation ...
In:
Data Science Journal
25 (2026), 13, S. 1-13
| Knut Wenzig, Andreas Daniel, Dominique Hansen, Tobias Koberg, Mihaela Tudose
Earnings are often top-coded (right-censored) in administrative registers. The censoring threshold in the case of Germany is the limit value for social security contributions, leading to a substantial fraction of censoring: For example, about 12%of male workers inWest Germany are affected, rising to above 30% for highly educated prime-aged workers. This missing right tail of the earnings distribution ...
In:
Jahrbücher für Nationalökonomie und Statistik
246 (2026), 1, S. 5–34
| Mattis Beckmannshagen, Johannes König, Isabella Retter, Christian Schluter, Carsten Schröder, Yogam Tchokni
Am 17. und 18. November organisieren wir einen zweitägigen Präsenz-Workshop zum Sozio-oekonomischen Panel (SOEP) am DIW Berlin. Neben Vorträgen zu Aufbau, Datenstruktur, Ziehungsdesign und Gewichtungsstrategie bieten hands-on Sessions eine praxisorientierte Annäherung an die Daten und ihre Potentiale. Der Workshop richtet sich an Wissenschaftlerinnen und Wissenschaftler auf allen Qualifikationsstufen, ...