The German Socio-Economic Panel Study (SOEP) is a wide-ranging representative longitudinal study of private households, located at the German Institute for Economic Research, DIW Berlin. Every year, there were nearly 11,000 households, and more than 20,000 persons sampled by the fieldwork organization TNS Infratest Sozialforschung. The data provide information on all household members, consisting of Germans living in the Old and New German States, Foreigners, and recent Immigrants to Germany. The Panel was started in 1984. Some of the many topics include household composition, occupational biographies, employment, earnings, health and satisfaction indicators.
As early as June 1990—even before the Economic, Social and Monetary Union—SOEP expanded to include the states of the former German Democratic Republic (GDR), thus seizing the rare opportunity to observe the transformation of an entire society. An immigrant sample was added as well to account for the changes that took place in Germany society in 1994/95. Further new samples were added in 1998, 2000, 2002, and 2006. The survey is constantly being adapted and developed in response to current social developments.
Titel: Sozio-oekonomisches Panel (SOEP), Daten der Jahre 1984 – 2007
Primärforscher: Gert. G. Wagner, Joachim R. Frick, Jürgen Schupp, Silke Anger, Jan Goebel, Markus M. Grabka, Olaf Groh-Samberg, Elke Holst, Peter Krause, Martin Kroh, Henning Lohmann, Rainer Pischner, Christian Schmitt, C. Katharina Spieß, Martin Spieß
Datenerhebung: TNS Infratest Sozialforschung GmbH
Population: Personen in Privathaushalten in der Bundesrepulik Deutschland
Auswahlverfahren: Alle Samples des SOEP werden mittels mehrstufiger Stichprobenziehung, die regional gebündelt sind, gezogen. Die Befragten (Haushalte) werden per random-walk ausgesucht.
Erhebungsverfahren: Die Methode der Datenerhebung des SOEP basiert auf einem Set von Fragebögen sowohl für die Haushalte als auch für die Individuen. Prinzipiell versucht ein Interviewer face-to-face-Interviews mit allen Haushaltsmitgliedern durchzuführen, die 16 Jahre oder älter sind. Zusätzlich wird eine Person (Haushaltsvorstand) gebeten, einen Haushaltsfragebogen zu beantworten, einschließlich Fragen zur Wohnsituation, Kosten, verschiedenen Einkommensquellen, sowie Fragen zu im Haushalt lebenden Kindern unter 16 Jahren (z.B. Besuch des Kindergartens, der Grundschule etc.).
|Number of units||61.544|
|Number of Variables||39.550 in 297 Datensätzen|
|Data formats||STATA, SPSS, SAS, CSV|
Publications using this file should refer to the above DOI Find an explanation on the usage of DOI here.and cite one of the following references
The 2008 data distribution (1984-2007) provides, for the year 2007, the usual wave-specific data XPBRUTTO, XP, XPKAL, XPGEN, XHBRUTTO, XH, XHGEN, XKIND and WPLUECKE as well as the updated files with a longitudinal component (PFAD files, biography files, spell data and weighting factors).
In the survey year 2006, a representative supplementary sample for all of Germany was added: refreshment sample H. Biographical background information has been collected from respondents in sample H for the first time in 2007. This data has been fully integrated into alle relevant biography files (BIOxxxx).
As part of the SOEP innovations projects TNS Infratest Sozialforschung conducted in December 2006 a postal survey among former SOEP panel members from households which had been classified as final refusals in 2001-2004. As a byproduct we could change the information on year of birth from missing to a valid value for 21 of these persons (more information can be found in the executive summary (PDF, 36.18 KB)executive summary of the TNS Infratest Methodenbericht).
Furthermore the following additions and modifications have been made:
A. New and Renamed Datasets
In the 2006 survey year, for the first time, short cognitive tests were carried out with a subsample of the SOEP. The goal was to employ a robust set of instruments that could be administered easily by trained interviewers in just a few minutes. Close to 80% of all persons chosen for participation in the cognitive test provided valid answers. Thus, for the first time, the SOEP now contains indicators of cognitive potentials for more than 5,500 persons, along with diverse educational information based on degrees and certifications. It is planned that the first repeat of the test will take place in the 2010 survey year. A detailed documentation and selection analyses can be found in Schupp et al. (2008) Erfassung kognitiver Leistungspotentiale Erwachsener im Sozio-oekonomischen Panel (SOEP), DIW Berlin, Data Documentation 32 (PDF, 447.63 KB).
PBR_EXIT and PBR_HHCH:
These two datasets replace the former dataset YPBRUTTO, however this year both variants are available
Multiple imputed dataset on monthly net household income for the years 1996 to 2007. The dataset is stored in long format (long format: hhnrakt, svyyear, mj, also called mim format within stata). Each item non-response on net household income was imputed 10 times. More information can be found in HGEN.pdf (PDF, 0.64 MB)
B. New Variables
B.1 Dataset XPBRUTTO
B.2 Dataset $PEQUIV
B.3 Dataset $HGEN
C. Revised Variables
C.1 In the Dataset $PKAL
C.2 In the Dataset HHRF/PHRF
However, the weighting factors for the year 2007are also based on (newest available) microcensus benchmark data from 2006; they are therefore only provisional with regard to the figures given for households and individuals in Germany.
C.3 In the Dataset $PGEN
D. Error Updates
D.1 In the Dataset VH and WH
Variable Label: Owner Of The DwellingValueWrongCorrect
|-2||Does not apply||Does not apply|
|-1||No answer||No answer|
|1||Self Owned Res. Property||Local Govt. Apt.|
|2||Local Govt. Apt.||Co-Operative Apt.|
|3||Co-Operative Apt.||Company Apt.|
|4||Company Apt.||Private Owner|
|5||Private Owner||Do Not Know|
D.2 In the dataset $PGEN
|Dec. 04, 2008||In the process of extensive checking, several problems were identified in the 1984-2007 data distribution currently available on DVD (waves A-X).
The corrected datasets are now available to be downloaded as a password-protected ZIP file from our homepage. To obtain download access to the corrected datasets, please send an e-mail to firstname.lastname@example.org or call the SOEPhotline at +49 30 89789 292.
To unzip the files, you will need the password for the current 1984-2007 data distribution, or the password used to access the expanded regional data in the GGKBOU dataset. If you do not have the current data distribution, please contact our hotline (email@example.com).
The fixed files are:
15) Die Vercodung der offenen Angaben zur beruflichen Tätigkeit nach der International Standard Classification of Occupations 2008 (ISCO08) - Direktvercodung - Vorgehensweise und Entscheidungsregeln bei nicht eindeutigen Angaben
All documentation for filtering can be found on this page