Skip to content!

SOEP-IS 2019 (Data 1998-2019)

Dataset Information

Title: SOEP Innovation Sample (SOEP-IS), data from 1998-2019

DOI info : 10.5684/
Collection period: 1998-2019
Publication date: 2021-05-06
Principal investigators: Jan Goebel, Stefan Liebig, David Richter, Carsten Schröder, Jürgen Schupp, Knut Wenzig

Data collector: Kantar Public Deutschland GmbH

Population: Persons living in private households in Germany.

Selection method: All samples of SOEP are multi-stage random samples which are regionally clustered. The respondents (households) are selected by random-walk.

Collection mode: The SOEP-IS is conducted using CAPI (Computer-assisted personal interviewing) exclusively. In principle, an interviewer tries to obtain face-to-face interviews with all members of a given survey household aged 16 and over. Additionally, one person (head of household) is asked to complete a household questionnaire covering information on housing, housing costs, and different sources of income. This also includes some questions about children up to 16 years of age in the household (e.g., kindergarten attendance, elementary school attendance, etc.).

Citation of the Data Set: SOEP Innovation Sample (SOEP-IS), data from 1998-2019. 2020. DOI: 10.5684/

New Modules

Innovative modules of survey year 2018

Changes in datasets and individual variables


An error occurred in the 2017 release, concerning several activity-related variables of the p-dataset (pli0090, pli0091, pli0092, pli0096, pli0097, pli0098). In the p-dataset, these variables have five possible positive responses,

"[1] Daily", "[2] At least once a week", "[3] At least once a month", "[4] Seldom" and "[5] Never".

This reflects how the questions were asked in several specific survey years – in other survey years, however, the "Daily"-option was not included in the questionnaire and only

"[1] At least once a week", "[2] At least once a month", "[3] Seldom" and "[4] Never" were possible responses.

To account for this difference in the final dataset, we have been recoding the latter version to fit to the first one (i.e., "[1] At least once a week" to "[2] At least once a week", etc.) in previous years. Unfortunately, this recoding was not performed for the 2017 data, resulting in original "[1] At least once a week"-responses being reported as "[1] Daily"-responses in the final dataset, "[2] At least once a month"-responses as "[2] At least once a week"-responses and so forth.  

This error only concerned the 2017 responses for the variables mentioned above and has been fixed in the new 2019 release version. It should be noted that in the 2019 interviews, the full scale with 5 possible responses has been used again. To highlight that two different response scales have been used over the years (1-4 and 1-5), we have created additional copies of these variables with a suffix ("*_v1" and "*_v2"). These copies only include the respective original scales without any recoding.

Var-de Var-de Var-en Var-en

Please find all sample specific questionnaires of this year and all questionnaires of previous years on this site

1) SOEP-IS 2019 – PPFAD: Person-related Meta-dataset

2) SOEP-IS 2019 – PBRUTTO: Person-related Gross File

3) SOEP-IS 2019 – HBRUTTO: Household-related Gross File

4) SOEP-IS 2019 – PGEN: Person-related Status and Generated Variables

5) SOEP-IS 2019 – HGEN: Household-related Status and Generated Variables

6) SOEP-IS 2019 – PHRF: Weights for Persons

7) SOEP-IS 2019 – HHRF: Weights for Households

8) SOEP-IS 2019 – P: Variables from the Individual Question Module

9) SOEP-IS 2019 – H: Variables from the Household Question Module

10) SOEP-IS 2019 – BIO: Variables from the Life Course Question Module

11) SOEP-IS 2019 – BIOPAREN: Biography Information on the Parents

12) SOEP-IS 2019 – BIOAGE: Variables from the Modules of Questions on Children

13) SOEP-IS 2019 – BIOBIRTH: Birth Biography of Female and Male Respondents

14) SOEP-IS 2019 – KID: Pooled Dataset on Children

15) SOEP-IS 2019 – INNO: Variables from the Innovation Modules

16) SOEP-IS 2019 – INNO_H: Household-Variables from the Innovation Modules

17) SOEP-IS 2019 – IBIP_PARENT: Variables from Bonn Intervention Panel (parents)

18) SOEP-IS 2019 – IBIP_PUPIL: Variables from Bonn Intervention Panel (children)

19) SOEP-IS 2019 – ILANGUAGE: Variables from Innovative Language Modules

20) SOEP-IS 2019 – ILOTTERY: Variables from an Innovative Lottery Experiment in 2016

21) SOEP-IS 2019 – IDRM: Person-related Data from Innovative DRM Module

22) SOEP-IS 2019 – IDRM_ESM: Person-related DRM Data from Innovative ESM Module

23) SOEP-IS 2019 – IESM: Person-related ESM Data from Innovative ESM Module

24) SOEP-IS 2019—IRISK: Decision from Description vs. Decision from Experience

25) SOEP-IS 2019 – COGNIT: Cognitive Achievement Potentials

26) SOEP-IS 2019 – INTV: Variables about the interviewers