Direkt zum Inhalt

Reanalyse-Archiv des SOEP

Replikation von Analysen mit SOEP-Daten

Das SOEP unterstützt die Bestrebungen in der Wissenschaft, Daten so bereitzustellen, dass sie einfach für Replikationen oder zur Nachnutzung zur Verfügung stehen. Gleichzeitig ist das SOEP jedoch den Befragten gegenüber verpflichtet, sicherzustellen, dass die Daten nur für wissenschafliche Zwecke genutzt werden. Das heißt, alle Datennutzerinnen und -nutzer müssen einen Datennutzungsvertrag mit dem DIW Berlin schließen und Datennutzende dürfen die SOEP-Daten oder auch nur Auszüge daraus nicht Anderen bereitstellen.

Wir möchten alle Forschenden dazu ermuntern, die Syntax der Datenaufbereitung und Analyse zu archivieren. Wenn in der Syntax gleichzeitig die Version des verwendete SOEP-Datensatzes mitgeteilt wird, der über die DOI eindeutig identifiziert werden kann, ist damit eine Replikation der Ergebnisse möglich.

Diese Art der Archivierung kann gern in einem freien Repository (z.B. beim Open Science Framework OSF: https://osf.io/) oder auf der Webseite des veröffentlichenden Journals geschehen. Darüber hinaus bietet auch das FDZ SOEP den Forschenden die Möglichkeit, die Syntaxfiles zu archivieren und auf dieser Webseite zum Download zur Verfügung zu stellen.

Teilweise verlangen Fachzeitschriften jedoch, dass auch der aus den Rohdaten aufbereitete Analysedatensatz bereitgestellt werden muss. Für solche Fälle bieten wir daher die Alternative an, dass auch der aufbereitete Datensatz beim SOEP archiviert und auf Nachfrage über das FDZ SOEP an weitere Nutzende bereitgestellt wird. Aber auch diese Nachnutzenden müssen vorher einen Vertrag mit dem DIW Berlin schließen.

Publikationen, die Informationen zur Re-Analyse bereitstellen

  • Health Care Reform and the Number of Doctor Visits - An Econometric Analysis (2004)
    Rainer Winkelmann
    Journal of Applied Econometrics 19, no. 4, 455-472
    (Pre-published: 2003: IZA DP No. 317. Bonn: Institute for the Study of Labor (IZA)) DOI: 10.1002/jae.764
    Dataset and description

  • Incentive Effects in the Demand for Health Care: A Bivariate Panel Count Data Estimation (2003)
    Regina T. Riphahn, Achim Wambach, and Andreas Million
    Journal of Applied Econometrics 18, no. 4, 387-405.
    (Also published 2003: IZA Reprint Series A - 201/2003. Bonn: Institute for the Study of Labor). DOI: 10.1002/jae.680
    Dataset and description

  • No evidence that economic inequality moderates the effect of income on generosity.
    Schukle, Stefan C., Martin Korndörfer and Boris Egloff (2019)
    Proceedings of the National Academy of Sciences (PNAS), Advance Online Publication.,
    DOI: 10.1073/pnas.1807942116.
    Description

  • Stated and Revealed Heterogeneous Risk Preferences in Educational Choice (2017)
    Frank M. Fossen and Daniela Glocker
    European Economic Review 91, August 2017, 1-25, DOI: 10.1016/j.euroecorev.2017.03.016
    Description

  • Fertility progression in Germany: An analysis using flexible nonparametric cure survival models (2016)
    Vincent Bremhorst, Michaela Kreyenfeld and Philippe Lambert
    Demographic Research 35 (18), 505-534. DOI:10.4054/DemRes.2016.35.18

  • Law and social capital: Evidence from the Code Napoleon in Germany (2016)
    Johannes C Buggle
    European Economic Review, 87, August 2016, 148-175. DOI: 10.1016/j.euroecorev.2016.05.003.

Dataset not completely anonymized. Please contact SOEPmail@diw.de to gain access.

  • The macroeconomic effects of social security contributions and benefits (2020)
    Gechert, Sebastian, Christoph Paetz, and Paloma Villanueva
    Journal of Monetary Economics (online first), DOI: 10.1016/j.jmoneco.2020.03.012
    Replication files

  • Being Unengaged at Work but Still Dedicating Time and Energy: A Longitudinal Study (2020)
    Hommelhoff, Sabine, David Richter, Cornelia Niessen, Denis Gerstorf and Jutta Heckhausen
    Motivation Science 6 (4), 368-373, doi: 10.1037/mot0000155
    Replication files

  • The impact of having children on domain-specific life satisfaction: A quasi-experimental longitudinal investigation using the Socio-Economic Panel (SOEP) data (2020)
    Krämer, Michael D., and Joseph Lee Rodgers
    Journal of Personality and Social Psychology 119 (6), 1497–1514, DOI: 10.1037/pspp0000279
    Syntax

  • Long-term effects of pregnancy and childbirth on sleep satisfaction and duration of first-time and experienced mothers and fathers (2019)
    Richter, David, Michael D. Krämer, Nicole K. Y. Tang, Hawley E. Montgomery-Downs and Sakari Lemola
    Sleep 42 (4), 1-10, doi: 10.1093/sleep/zsz015
    Syntax

  • Predictors of Refugee Adjustment: The Importance of Cognitive Skills and Personality (2019)
    Hahn, Elisabeth, David Richter, Jürgen Schupp and Mitja D. Back
    Collabra: Psychology 5 (1), Art. 23, doi: 10.1525/collabra.212
    Syntax

  • Wie beeinflussen Kinder das Eingehen neuer Partnerschaften? Ein Vergleich zwischen Alleinerziehenden und Kinderlosen (2019)
    Rapp, Ingmar, und Elif Sari
    Zeitschrift für Soziologie 48 (1), 23-41. DOI: 10.1515/zfsoz-2019-0003
    Syntax

  • Age Differences in Intertemporal Choice: U-Shaped Associations in a Probability Sample of German Households (2018)
    Richter, David and Rui Mata
    Psychology and Aging 33 (5), 782-788, doi: 10.1037/pag0000266
    Syntax 

  • Successfully Striving for Happiness: Socially Engaged Pursuits Predict Increases in Life Satisfaction (2018)
    Rohrer, Julia M., David Richter, Martin Brümmer, Gert G. Wagner and Stefan C. Schmukle
    Psychological Science 29 (8), 1291-1298, doi: 10.1177/0956797618761660
    Syntax

  • Personality Trait Effects on Green Household Installations (2018)
    Busic-Sontic, Ante and Cameron Brick
    Collabra: Psychology 4 (4), 8, doi: 10.1525/collabra.120
    Syntax

  • "What Else Are You Worried About?" – Integrating Textual Responses into Quantitative Social Science Research (2017)
    Rohrer, Julia M., Martin Brümmer, Stefan Schmukle, Jan Goebel and Gert G. Wagner
    PLOS ONE 12 (7), e0182156, doi: 10.1371/journal.pone.0182156
    Syntax

  • The Micro-Foundations of Party Competition and Issue Ownership: The Reciprocal Effects of Citizens’ Issue Salience and Party Attachments (2018)
    Neundorf, Anja and James Adams
    British Journal of Political Science 48 (2), 385-406, DOI: 10.1017/S0007123415000642
    Description

  • Individual Wealth and Subjective Financial Well-being in Marriage: Resource Integration or Separation? (2017)
    Philipp M. Lersch
    Journal of Marriage and Family 79 (5), 1211-1223, DOI: 10.1111/jomf.12406
    Replication files

  • Parents' Death and Adult Well-being: Gender, Age, and Adaptation to Filial Bereavement (2015)
    Thomas Leopold and Clemens M. Lechner Journal of Marriage and Family 77 (3), 747-760. DOI: 10.1111/jomf.12186
    Stata syntax (Do-File)

  • Religious Attendance Buffers the Impact of Unemployment on Life Satisfaction: Longitudinal Evidence from Germany (2015)
    Thomas Leopold and Clemens M. Lechner
    Journal for the Scientific Study of Religion 54 (1), 166-174. DOI: 10.1111/jssr.12171
    Stata syntax (Do-File)

  • Belastet, aber hochzufrieden? Arbeitsbelastung von Lehrkräften im Quer- und Längsschnitt (2014)
    Johannes Schult, Manuela Münzer-Schrobildgen and Jörn R. Sparfeldt
    Zeitschrift für Gesundheitspsychologie, 22, no. 2, 61-67. DOI: 10.1026/0943-8149/a000114
    Stata syntax (Do-File)

  • Prädiktoren des Berufserfolgs von Hochschulabsolventen: Befunde aus dem Sozio-Ökonomischen Panel (2012)
    Johannes Schult
    Wirtschaftspsychologie, 14, no. 4, 82-91.
    Stata syntax (Do-File)

Detailed instructions for researchers

In the interests of improving the statistical infrastructure for reanalysis and replication studies using SOEP data, the SOEP group now offers users a variety of options for making their SOEP working dataset available to other researchers. These options apply to all data formats associated with SOEP, including CNEF, EU-SILC-Clone, LIS, and LWS. If your working dataset includes any SOEP microdata (or data derived from SOEP), you as a SOEP user may not transfer the data to the journal’s editorial office, but may instead take advantage of the following alternatives:

  • In our view, the most transparent approach is to publish the relevant syntax for the article along with the data processing method. The syntax published on the journal webpage should clearly identify the SOEP data version used by the authors (more information here). Other researchers who are interested in the data can obtain the identical version after signing a user contract with the SOEP. Users can include a paragraph in their article like the following to describe this process:

    Data Availability: Data are available from the German Socio-economic Panel Study (SOEP) due to third party restrictions (for requests, please contact soepmail@diw.de). The scientific use file of the SOEP with anonymous microdata is made available free of charge to universities and research institutes for research and teaching purposes. The direct use of SOEP data is subject to the strict provisions of German data protection law. Therefore, signing a data distribution contract is a precondition for working with SOEP data.

    The data distribution contract can be requested with a form, available at: http://www.diw.de/soepforms. For further information, contact the SOEPhotline at either soepmail@diw.de or +49-30-89789-292.

  • If the journal to which you intend to submit a paper does not agree with the approach above, we offer a second option: storing your working dataset in a special archive at DIW Berlin. From our experience, journals generally accept this arrangement. Researchers who want to reanalyze the dataset must apply for a standard SOEP user contract to be granted access to the archived data. Of course, such a contract includes access to all other SOEP scientific use files as well. If the dataset is exceptionally sensitive due to the inclusion of detailed geo-coded data, users will probably be required to visit the RDC SOEP to obtain data access.
  • SOEP data can only be made available for free download under two conditions. First, we (the SOEP group, together with the DIW Berlin data protection officer) have to check whether the relevant working dataset can be treated as a “completely anonymized dataset” (absolut anonymisierter Mikrodatensatz). This may be the case when the number of observations as well as variables is small and all original IDs have been removed. Second, measures have to be undertaken to change the data, for instance, by adding random error to metric variables and randomly interchanging categories in the case of non-metric variables. In such cases, journals may be permitted to archive the data, but only with DIW Berlin’s official approval.

Whenever a journal editor asks for your working data set, please contact us at  soepmail@diw.de. We would be happy to deposit the data in a special archive and notify the journal editor about the access procedure.

If you want to send us data for our replication service, please use the cryptshare server hosted by us at https://cs-soep.diw.de/. After briefly registering, you can distribute the data using a secure, encrypted server.

In order to improve the infrastructure for the re-analysis of published findings based on SOEP data we also provide information of the following types:

  • references to publications using completely anonymized and partly artificial (SOEP) microdata (including links to the dataset)
  • references to publications using a working dataset deposited at our archive  (as offered above),  available for licenced SOEP users, and
  • references to publications using the SOEP dataset and providing the generated syntax files.

Bei Fragen hilft Ihnen gerne die SOEPhotline

SOEPhotline

Philipp Kaminsky und Antonia Meier
Nutzerberatung und Vertragsangelegenheiten des FDZ SOEP in der Infrastruktureinrichtung Sozio-oekonomisches Panel

keyboard_arrow_up