General
Important note: The weighting for the 2022 survey year is based on the previous year's marginal distribution. The updated marginal distributions of the Federal Statistical Office of Germany for 2022 were not yet available to us at the time of the data release. We therefore ask you to apply the weighting for 2022 with caution. As soon as we have all the necessary data, we will update the weighting and make it available.
In recent years, the data set structure in the SOEP-IS has differed in many respects from that in the SOEP-Core study. In order to ensure high comparability of the data across studies, some datasets were newly added to the SOEP-IS, others were renamed or removed (whereby the variables were moved to other datasets). This resulted in the following specific changes:
Renamed datasets:
- bio -> biol
- h -> hl
- ppfad -> ppathl
- kid -> kidlong
- p -> pl
New datasets:
- hpath
- hpathl
- ppath
Deleted datasets:
- hhrf -> variables moved to hpathl
- phhrf -> variables moved to ppathl
To make it easier to work with SOEP-IS data together with Core data, fundamental changes were also made to the variable names. Where possible, these are now based on the variable names used in Core. In addition, versioning and harmonization of variables that have changed over the years were implemented. This improvement allows a better understanding of adjustments to the questionnaire and their effects on the data.
bioparen
The “bioparen” dataset contains generated biographical information about the respondents' parents. Until 2020, the dataset was generated every year; from 2021, generation was suspended indefinitely. The information on the respondents' parents can still be found in the biographical dataset, but in the form of simple survey data.