The 2008 data distribution (1984-2007) provides, for the year 2007, the usual wave-specific data XPBRUTTO, XP, XPKAL, XPGEN, XHBRUTTO, XH, XHGEN, XKIND and WPLUECKE as well as the updated files with a longitudinal component (PFAD files, biography files, spell data and weighting factors).
In the survey year 2006, a representative supplementary sample for all of Germany was added: refreshment sample H. Biographical background information has been collected from respondents in sample H for the first time in 2007. This data has been fully integrated into alle relevant biography files (BIOxxxx).
As part of the SOEP innovations projects TNS Infratest Sozialforschung conducted in December 2006 a postal survey among former SOEP panel members from households which had been classified as final refusals in 2001-2004. As a byproduct we could change the information on year of birth from missing to a valid value for 21 of these persons (more information can be found in the executive summaryexecutive summary of the TNS Infratest Methodenbericht).
Furthermore the following additions and modifications have been made:
A. New and Renamed Datasets
COGNIT06:
In the 2006 survey year, for the first time, short cognitive tests were carried out with a subsample of the SOEP. The goal was to employ a robust set of instruments that could be administered easily by trained interviewers in just a few minutes. Close to 80% of all persons chosen for participation in the cognitive test provided valid answers. Thus, for the first time, the SOEP now contains indicators of cognitive potentials for more than 5,500 persons, along with diverse educational information based on degrees and certifications. It is planned that the first repeat of the test will take place in the 2010 survey year. A detailed documentation and selection analyses can be found in Schupp et al. (2008) Erfassung kognitiver Leistungspotentiale Erwachsener im Sozio-oekonomischen Panel (SOEP), DIW Berlin, Data Documentation 32.
PBR_EXIT and PBR_HHCH:
These two datasets replace the former dataset YPBRUTTO, however this year both variants are available
MIHINC:
Multiple imputed dataset on monthly net household income for the years 1996 to 2007. The dataset is stored in long format (long format: hhnrakt, svyyear, mj, also called mim format within stata). Each item non-response on net household income was imputed 10 times. More information can be found in HGEN.pdf
B. New Variables
B.1 Dataset XPBRUTTO
B.2 Dataset $PEQUIV
B.3 Dataset $HGEN
C. Revised Variables
C.1 In the Dataset $PKAL
C.2 In the Dataset HHRF/PHRF
However, the weighting factors for the year 2007are also based on (newest available) microcensus benchmark data from 2006; they are therefore only provisional with regard to the figures given for households and individuals in Germany.
C.3 In the Dataset $PGEN
D. Error Updates
D.1 In the Dataset VH and WH
Variable Label: Owner Of The Dwelling
ValueWrongCorrect-2 | Does not apply | Does not apply |
-1 | No answer | No answer |
1 | Self Owned Res. Property | Local Govt. Apt. |
2 | Local Govt. Apt. | Co-Operative Apt. |
3 | Co-Operative Apt. | Company Apt. |
4 | Company Apt. | Private Owner |
5 | Private Owner | Do Not Know |
D.2 In the dataset $PGEN