The Socio-Economic Panel (SOEP) is a representative, multi-cohort survey that has been running since 1984. Every year, individuals in households throughout Germany are surveyed by our survey institute on behalf of DIW Berlin. These respondents provide information on topics such as their income, employment history, education, and health. Because the same people are surveyed every year, it is possible to track long-term psychological, economic, societal, and social developments. To keep pace with changes in society, random samples are added regularly and the survey is adapted accordingly.
Title: Socio-Economic Panel (SOEP), data from 1984-2021, EU Edition
DOI : 10.5684/soep.core.v38eu
Collection period: 1984-2021
Principal investigators: Jan Goebel, Markus M. Grabka, Carsten Schröder, Sabine Zinn, Charlotte Bartels, Mattis Beckmannshagen, Andreas Franken, Martin Gerike, Florian Griese, Christoph Halbmeier, Selin Kara, Peter Krause, Elisabeth Liebau, Jana Nebelin, Marvin Petrenz, Sarah Satilmis, Rainer Siegers, Hans Walter Steinhauer, Felix Süttmann, Knut Wenzig, Stefan Zimmermann
Contributor: infas Institut für angewandte Sozialwissenschaft GmbH (Data Collector)
Population: Persons living in private households in Germany
Amount of households: 19.032
Amount of individuals: 32.050 + 3476 Children
Special samples: Citizens of the GDR (1990), Immigration/Migration (1994/95, 2013, 2015, 2020), Refugees (since 2016). See the chapter SOEP-Samples in Detail on the SOEPcompanion for a description of all our samples.
Selection method: All samples of SOEP are multi-stage random samples which are regionally clustered. The respondents (households) are selected by random-walk or register sample.
Collection Mode:The interview methodology of the SOEP is based on a set of pre-tested questionnaires for households and individuals. Principally an interviewer tries to obtain face-to-face interviews with all members of a given survey household aged 12 years and over. Additionally one person (head of household) is asked to answer a household related questionnaire covering information on housing, housing costs, and different sources of income. This covers also some questions on children in the household up to 17 years of age, mainly concerning attendance at institutions (kindergarten, elementary school, etc.)
Citation of the Data Set: Socio-Economic Panel (SOEP), data for years 1984-2021, SOEP-Core v38, EU Edition, 2023, doi:10.5684/soep.core.v38eu
If you don‘t exclude observations from the Migration Samples in your analysis, please also cite as follows:
IAB-SOEP Migration Samples (M1, M2), data of the years 2013-2021, DOI: 10.5684/soep.iab-soep-mig.2021
If you don‘t exclude observations from the Refugee Samples in your analysis, please also cite as follows:
IAB-BAMF-SOEP Survey of Refugees (M3-M5), data of the years 2016-2021, DOI: 10.5684/soep.iab-bamf-soep-mig.2021
Publications using this file should refer to the above DOI Find an explanation on the usage of DOI here.and cite the following reference
If you do not exclude the cases of the migration samples in your analysis, then please also cite the following reference
If you do not exclude the cases of the refugee samples in your analysis, please also cite: IAB-BAMF-SOEP survey of refugees (M3-M5), data for the years 2016-2021,
For the SOEP-Core data 1984-2021 (v38) - waves A bis BL - we provide the following editions:
soep.core.v38eu (EU Edition, 100%)
soep.core.v38i (International Scientific Use Version, 95%)
soep.core.v38t (Teaching Edition, 50%)
soep.core.v38at (Add-on: Area types)
soep.core.v38pr (Add-on: Planning regions)
soep.core.v38r (Remote Edition)
soep.core.v38o (Onsite Edition)
For detailed infomation on the different data editions, see SOEPcompanion.
These datasets are included in SOEP v38, but are also available as individual data sets upon request:
soep.iab-soep-mig.2021 (Migration Sample)
soep.iab-bamf-soep-mig.2021 (Refugee Sample)
Dataset gkal and lkal
For large datasets like pl we recommend the use of Stata/MP or Stata/SE on a computer with an internal memory of 16GB.
Users can still work with the data in Stata/IC or on less powerful computers, but to work effectively SOEP offers for pl alternative data formats.
If you wish to order an alternative format for pl (e.g. pl in separate year or decade data sets) because your system requirements are not sufficient, please submit your request via the
[order form](https://www.diw.de/de/diw_01.c.357906.de/soep_bestellformular_mod.html) or contact the SOEP hotline by phone or e-mail.
The time use variables could have been -2 and 0 in the data, but both values meant "does not apply". All -2 values were therefore set to 0 as a correction process, since the questionnaire design expects a 0 to be assigned for "does not apply".
imonth, iday, ihour and iminute for 2021 were moved to the INSTRUMENTATION dataset, where they can be found from this version on.
The ZIP-Files contain now a folder soepdata, which contains itself the folders eu-silc-like-panel and raw. This makes it easier to refer to the folders in the documentation. We called the soepdata folder sometimes "toplevel folder" or "./", what has been less informative for our users.
There have been over 40 datasets, which have been saved in the former toplevel folder (see above) and in the raw folder. You find them now exclusively in the new `soepdata` folder.
The following data sets are still at the V37 level and have not been updated. We will update them as far as possible with the next realease of the data:
|PEQUIV||CNEF Eqivalent File|
The following data sets are still at the V36 level and have not been updated. We will update them as far as possible with the next realease of the data:
|REFUGSPELL||Migration History for Refugees|
|BIOJOB||First and last Job|
|BIOPAREN||SES of Parents|
Individual (PAPI) 2021: -de -en
Household (PAPI) 2021: -de -en
Biography (PAPI) 2021: -de -en
Catch-up Individual (PAPI) 2021: -de -en
Youth (16-17-year-olds, PAPI) 2021: -de -en
Early Youth (13-14-year-olds, PAPI) 2021: -de -en
Pre-teen (11-12-year-olds, PAPI) 2021: -de -en
Mother and Child (Newborns, PAPI) 2021: -de -en
Mother and Child (2-3-year-olds, PAPI) 2021: -de -en
Mother and Child (5-6-year-olds, PAPI) 2021: -de -en
Parents and Child (7-8-year-olds, PAPI) 2021: -de -en
Mother and Child (9-10-year-olds, PAPI) 2021: -de -en
Deceased Individual (PAPI) 2021: -de -en
Please find all sample specific questionnaires of this year and all questionnaires of previous years on this site
15) Die Vercodung der offenen Angaben zur beruflichen Tätigkeit nach der International Standard Classification of Occupations 2008 (ISCO08) - Direktvercodung - Vorgehensweise und Entscheidungsregeln bei nicht eindeutigen Angaben
All documentation for filtering can be found on this page