Das Sozio-oekonomische Panel (SOEP) ist eine repräsentative Wiederholungsbefragung, die bereits seit 1984 läuft. Im Auftrag des DIW Berlin werden jedes Jahr Personen aus Haushalten in ganz Deutschland von unserem Erhebungsinstitut befragt. Die Daten geben Auskunft zu Fragen über Einkommen, Erwerbstätigkeit, Bildung oder Gesundheit. Weil jedes Jahr die gleichen Personen befragt werden, können langfristige soziale und gesellschaftliche Trends besonders gut verfolgt werden. Zur adäquaten Erfassung des gesellschaftlichen Wandels werden immer wieder Stichproben implementiert, sowie eine Anpassung des Erhebungsprogramms vorgenommen.
Titel: Sozio-oekonomisches Panel, Daten der Jahre 1984-2019 (SOEP-Core, v36, Add-on: Area types)
DOI Zur Erklärung von DOI und dessen Verwendung gibt es hier Informationen . : 10.5684/soep.core.v36at
Erhebungszeitraum: 1984-2019
Veröffentlichungsdatum: 31.03.2021
PrimärforscherInnen: Stefan Liebig, Jan Goebel, Markus Grabka, Carsten Schröder, Sabine Zinn, Charlotte Bartels, Alexandra Fedorets, Andreas Franken, Martin Gerike, Florian Griese, Jannes Jacobsen, Selin Kara, Johannes König, Peter Krause, Hannes Kröger, Elisabeth Liebau, Maria Metzing, Jana Nebelin, Marvin Petrenz, David Richter, Paul Schmelzer, Christian Schmitt, Jürgen Schupp, Daniel Schnitzlein, Rainer Siegers, Hans Walter Steinhauer, Knut Wenzig, Stefan Zimmermann
Datenerhebung: Kantar Deutschland GmbH
Population: Personen in Privathaushalten in der Bundesrepulik Deutschland.
Besondere Stichproben: BürgerIn der DDR (1990), Zuwanderung/Migration (1994/95, 2013, 2015), Geflüchtete (seit 2016). Eine ausführliche Beschreibung aller Stichproben können Sie im SOEPcompanion unter SOEP-Samples in Detail nachlesen.
Auswahlverfahren: Alle Samples des SOEP werden mittels mehrstufiger Stichprobenziehungen, die regional gebündelt sind, gezogen. Die Befragten (Haushalte) werden per random-walk oder per Registerstichprobe ausgesucht.
Erhebungsverfahren: Die Methode der Datenerhebung des SOEP basiert auf einem Set von Fragebögen sowohl für die Haushalte als auch für die Individuen. Prinzipiell versucht die interviewende Person face-to-face-Interviews mit allen Haushaltsmitgliedern durchzuführen, die im Befragungsjahr 12 Jahre alt werden oder älter sind. Zusätzlich wird eine Person (Haushaltsvorstand) gebeten, einen Haushaltsfragebogen zu beantworten. Dort werden Fragen zu Wohnsituation und -kosten, verschiedenen Einkommensquellen sowie Fragen zu im Haushalt lebenden Kindern unter 12 Jahren (z.B. Besuch des Kindergartens, der Grundschule etc.) gestellt.
Zitation der Daten: Sozio-oekonomisches Panel, Daten der Jahre 1984-2019 (SOEP-Core, v36, Add-on: Area types). 2021. DOI: 10.5684/soep.core.v36at
Wenn Sie bei Ihrer Analyse nicht die Fälle der Migrations-Stichproben ausschliessen, dann zitieren Sie bitte auch:
IAB-SOEP-Migrationsstichproben (M1, M2), Daten der Jahre 2013-2019, DOI: 10.5684/soep.iab-soep-mig.2019
Wenn Sie bei Ihrer Analyse nicht die Fälle der Geflüchteten-Stichproben ausschliessen, dann zitieren Sie auch bitte auch: IAB-BAMF-SOEP-Befragung Geflüchteter (M3-M5), Daten der Jahre 2016-2019, DOI: 10.5684/soep.iab-bamf-soep-mig.2019
Zusammenfassung: Das SOEP bietet zusätzliche Datensätze an, die die EU Edition um Gemeindegrößenklassen erweitern können. Der Zugriff auf diese Dateien ist eingeschränkter, da sie den Nutzern mehr sensible Informationen über die Befragten liefern. Weitere Informationen liefert das Kapitel Editions im SOEPcompanion.
In Publikationen, die diese Datei verwenden, soll auf die oben genannte DOI Zur Erklärung von DOI und dessen Verwendung gibt es hier Informationen . verwiesen und folgende Referenzen zitiert werden:
Wenn Sie bei Ihrer Analyse nicht die Fälle der Migrations-Stichproben ausschliessen, dann zitieren Sie bitte auch:
Wenn Sie bei Ihrer Analyse nicht die Fälle der Geflüchteten-Stichproben ausschliessen, dann zitieren Sie bitte auch: IAB-BAMF-SOEP-Befragung Geflüchteter (M3-M5), Daten der Jahre 2016-2021,
Wenn Sie bei Ihrer Analyse die Daten der SOEP-LEE2 Befragungen nutzen, dann zitieren Sie bitte auch:
Wenn Sie bei Ihrer Analyse noch fachspezifischer referieren möchten, dann zitieren Sie bitte auch:
Für die SOEP-Core-Daten 1984-2019 (v36) - Wellen A bis BJ - stehen folgende Datensätze zur Verfügung:
soep.core.v36eu (EU Edition, 100%)
soep.core.v36i (International Scientific Use Version, 95%)
soep.core.v36t (Teaching Edition, 50%)
soep.core.v36at (Add-on: Area types)
soep.core.v36pr (Add-on: Planning regions)
soep,core.v36r (Remote Edition)
soep.core.v36o (Onsite Edition)
Ausführliche Informationen zu allen Editionen sind auf dem SOEPcompanion zu finden.
In der aktuellen Datenweitergabe komplett enthalten, auf spezielle Anfrage auch als Einzeldatensatz erhältlich:
soep.iab-soep-mig.2019 (Migrationsstichproben)
soep.iab-bamf-soep-mig.2019 (Geflüchtetenstichproben)
New Sample P
“Top Shareholder Sample”: Sample P was conceptualized as a sample of highly affluent households in Germany. Against the backdrop of the increasing income and wealth inequality in Germany over recent decades, despite economic growth, there has been a growing need for data on wealthy populations in the social sciences. Sample P was created to improve the empirical base for the German government’s poverty and wealth report and to lay the foundation for medium- and long-term cross-sectional and longitudinal analysis. The gross sample consisted of 23,259 households.
New Sample Q
“LGB* Sample”: Sample Q is a boost sample of a hard-to-survey population: lesbians, gays, bisexuals, transgender people, and those who identify as non-binary. While the actual percentage of LGBTQ+ people in the general population is unknown, this group was too scarcely represented in the SOEP to allow for meaningful analysis. 835 households were recruited through an approximately 9-month-long telephone screening process. Of these households, 477 participated in the survey between April and November.
Dataset BIOL - Variables on recognition of occupational qualifications in samples M3-M5 and the CAMCES module (identifiable in the variable label by the abbreviation AA/AAC) were corrected. The slightly different biographical questionnaire for samples M1-M2 is no longer used, and variables on migration history have been added to the SOEP-Core biographical questionnaire, which is now used for all samples. The variables have been integrated, versioned, and harmonized in biol accordingly. The religious affiliation of the father and mother has been reversioned and harmonized to include the response option “konfessionslos” (no religious affiliation). Additional variables with occupational codes have been added. Some variables at the federal state level were included as East-West variables with the suffix _ew. Since bioresid and biosoc will no longer be part of the data distribution, some data processing steps for the variables in these datasets have been included in the versioning and harmonization routines for biol.
Dataset PL - Variables on balance of assets (identifiable in variable label by abbreviation VB) have been corrected, re-sorted, and labeled. Inheritance variables have been re-versioned. Religious background has been re-versioned and harmonized to include the response option “konfessionslos” (no religious affiliation).
Dataset HBRUTTO - Some regional variables at the federal state level have been included as East-West variables with the suffix _ew. New variables on incentive type, incentive model, and variables describing screening process for the LGB sample have been added. Residential environment variables (wum) will no longer be included in the survey starting in 2019.
Dataset PBRUTTO - Some regional variables at the federal state level have been included as East-West variables with the suffix _ew. New variables on DRV record linkage and IAB record linkage have been added. Variables have been added indicating which questionnaire was used.
Dataset JUGENDL - Since bioage17 will no longer be part of the data distribution, some of the data processing steps for bioage17 variables have been included in the versioning and harmonization routines for jugendl.
Dataset KIDLONG - k_nrkid has been corrected to count only 16-year-old children in the household. Households with children without a stated birth year have been assigned a missing value of -1. - bgk93_r/kd_cty_r included incorrect values and has been corrected.
Dataset PLUECKEL - lpid has been removed.
Dataset HBRUTT - Some regional variables at the federal state level have been included as East-West variables with the suffix _ew.
Due to changes in data protection and privacy law, variables containing information on Germany’s federal states (Bundesländer) may not be transmitted to recipients outside the European Union. We have developed a new concept with different editions for the different data access procedures resulting from the change in law (listed in ascending order by the amount of information contained in each edition):
The default edition that we transmit to European users by sending them a personalized download link is the EU Edition. Some datasets may not be available in more restricted editions. If variables are not available in a more restricted edition, they are recoded to -7, a new missing value labeled “only available in less restricted edition”.
Dataset BIOREGION
Dataset BIORESIDREFING
Dataset MORE_DOCU
Dataset MORE_LOCAL
1. REGIONL
2. BIOIMMIG
Originally, BIOIMMIG was generated by appending each new wave of data to the data from the previous years. This practice bears potential for errors since the SOEP includes a large number of variables that need to be comparable over time. In order to minimize this potential for error, v36 of BIOIMMIG is the first version of the dataset that has been generated using longitudinal data.
Variable bireason “Main reason for moving to Germany”
Variable biscger “Attended school in Germany”
There has been a significant increase in cases due to the addition of further variables and the new migration questions from the biographical questionnaire. The variable “country of last school attendance” was only included as an indicator from wave bd (2013) onwards to generate the variable biscger. The corresponding long variable, lb0186_v1, also has values for the years 2001-2012.
Variable bicamp “Refugee residence Y, N”
Due to two newly added variables (lr3440, plm0679), there are significantly more cases with a value of "2 No".
Variables birelh[p|gp|c|sb|sh|dr|fr] “Family in Country of Origin”
Previously, in the generation process, it was not defined whether these variables should represent country of origin or country of residence or both. In the previous versions, the two were arbitrarily merged. The decision to include only the country of origin leads to a significant reduction of cases.
Variable birelhc2 "Underage children not in Germany”
Due to two newly added years (1997, 1999), there are significantly more cases with the value "2 No".
Variable biwfam "Already had family in country”
The variable biwfam was changed from a category (y/n) to a binary variable, since otherwise there would be a distortion of the content. The data generated previously were too imprecise, since the recoding to value “2 No” does not unambiguously exclude cases in which respondents have no family members in Germany.
For a closer look at the changes and variables used, see the bioimmig documentation.
3. HGEN
Variable hgtyp2hh in hgen
Datasets BIOSOC, BIORESID, BIOAGE17
The datasets biosoc, bioage17, and bioresid will no longer be provided. Most of the information from biosoc and bioresid will be maintained in the biol dataset with different variable names. In jugendl, the variables from bioage17 are retained. In order to reduce the number of datasets and to avoid redundant information, we decided to include the variables from biosoc and bioresid in biol and bioage17 in jugendl. The generated data from biosoc, bioage17, and bioresid are reproduced in the best possible way in biol and jugendl by applying versioning and harmonization. Users who have used biosoc, bioage17, or bioresid should use this table to facilitate transition.
The datasets with the suffixes mig and refugees—for instance, bep_mig and bgp_refugees—are no longer available. This information from the migration and refugee samples is fully integrated into the associated “raw” and “long” files.
Individual (PAPI) 2019: Field-de Field-en
Individual (CAPI) 2019: Var-de
Household (PAPI) 2019: Field-de Field-en
Household (CAPI) 2019: Var-de
Biography (PAPI) 2019: Field-de Field-en
Biography (CAPI) 2019: Var-de
Catch-up Individual (PAPI) 2019: Field-de
Catch-up Individual (CAPI) 2019: Var-de
Youth (16-17-year-olds, PAPI) 2019: Field-de
Youth (16-17-year-olds, CAPI) 2019: Var-de
Early Youth (13-14-year-olds, PAPI) 2019: Field-de
Early Youth (13-14-year-olds, CAPI) 2019: Var-de
Pre-teen (11-12-year-olds, PAPI) 2019: Field-de
Pre-teen (11-12-year-olds, CAPI) 2019: Var-de
Mother and Child (Newborns, PAPI) 2019: Field-de
Mother and Child (Newborns, CAPI) 2019: Var-de
Mother and Child (2-3-year-olds, PAPI) 2019: Field-de
Mother and Child (2-3-year-olds, CAPI) 2019: Var-de
Mother and Child (5-6-year-olds, PAPI) 2019: Field-de
Mother and Child (5-6-year-olds, CAPI) 2019: Var-de
Parents and Child (7-8-year-olds, PAPI) 2019: Field-de
Parents and Child (7-8-year-olds, CAPI) 2019: Var-de
Mother and Child (9-10-year-olds, PAPI) 2019: Field-de
Mother and Child (9-10-year-olds, CAPI) 2019: Var-de
Deceased Individual (PAPI) 2019: Field-de
Deceased Individual (CAPI) 2019: Var-de
Alle Sample-spezifischen Fragebögen dieses Jahres und alle Fragebögen der vorherigen Befragungsjahre finden Sie auf dieser Seite
3) SOEP-Core – 2019: Design, Nonresponse, and Weighting in the Sample Q (Queer)
5) SOEP-Core – 2019: Sampling, Nonresponse, and Weighting in the Sample P
6) SOEP-Core v36 – PPATHL: Person-Related Meta-Dataset
8) SOEP-Core v36 – HPATHL: Household-Related Meta-Dataset
9) SOEP-Core v36 – INTERVIEWER
10) SOEP-Core v36 – PBRUTTO: Person-Related Gross File
11) SOEP-Core v36 – HBRUTTO: Household-Related Gross File
12) SOEP-Core v36 – PGEN: Person-Related Status and Generated Variables
13) SOEP-Core v36 – HGEN: Household-Related Status and Generated Variables
17) SOEP-Core v36 – BIOPAREN: Biography Information for the Parents of SOEP-Respondents
19) SOEP-Core v36 – BIOSIB: Information on Siblings in the SOEP
21) SOEP-Core v36 – PFLEGE: Documentation of Generated Person-level Long-term Care Variables
22) SOEP-Core v36 – LIFESPELL: Information on the Pre- and Post-Survey History of SOEP-Respondents
1) Handgreifkraftmessung im Sozio-oekonomischen Panel (SOEP) 2006 und 2008
2) The new IAB-SOEP Migration Sample: an introduction into the methodology and the contents
3) The Request for Record Linkage in the IAB-SOEP Migration Sample
5) The Measurement of Labor Market Entries with SOEP Data: Introduction to the Variable EINSTIEG_ARTK
6) Job submission instructions for the SOEPremote System at DIW Berlin – Update 2014
7) SOEP 2015 – Informationen zu den SOEP-Geocodes in SOEP v32
9) Die Vercodung der offenen Angaben zu den Ausbildungsberufen im Sozio-Oekonomischen Panel
10) Das Studiendesign der IAB-BAMF-SOEP Befragung von Geflüchteten
11) Scales Manual IAB-BAMF-SOEP Survey of Refugees in Germany – revised version
12) SOEP 2010 – Preparation of data from the new SOEP consumption module: Editing, imputation, and smoothing
13) SOEP Scales Manual (updated for SOEP-Core v32.1)
17) Multi-Itemskalen im SOEP Jugendfragebogen
20) SOEP-CoV: Project and Data Documentation
22) SOEP 2013 – Documentation of Generated Person-Level Long-Term Care Variables in PFLEGE
23) SOEP-Core v34 – PFLEGE: Documentation of Generated Person-level Long-term Care Variables
26) SOEP-Core v36: Codebook for the EU-SILC-like panel for Germany based on the SOEP
Alle Dokumentationen zum Filtern finden Sie auf dieser Seite