1984-2017 (Wave BH)
Overview (May 2019):
Values for the variables plb0186_v2 and plb0186_h for the East sample in 1990 are too small by a factor of 10.
The names assigned to the raw variables bhh_37_01 “electricity included in rent” and bhh_37_02 “assessed burden of housing expenses (rent and additional expenses)” do not correspond to the standard SOEP concept for naming variables. Both variables will be renamed in the new version.
The previous version from the migspell dataset was delivered.
The new identifiers were not filled in and have to be filled in from the old identifiers.
Details:
1. Dataset: pl
Variables: plb0186_v2, plb0186_h
Values for the variables plb0186_v2 “Actual working time with overtime (1990-2017)” and plb0186_h “Actual working time with overtime (harmonized)” have the wrong values for the East sample in 1990.
The variable plb0186_h is made up of the variables plb0186_v1 (1984-1989) and plb0186_v2 (1990-2017). We included all of the values for plb0186_v1 as they were, and divided all of the valid values for plb0186_v2 by 10. The process of harmonization is necessary due to the fact that the two raw variables for 1990 were provided in different formats:
gpost: gp3601e (two-digit, no comma)
gp: gp39 (three-digit, no comma)
The raw variable gp3601e from gpost was assigned to the variable plb0186_v2 although it does not have to be divided by 10. As a result, all values for the East German population for the year 1990 were mistakenly divided by 10. The simplest way of solving this problem is to multiply the valid values for the East German population by 10.
|
cd "Datenpfad" |
Detailed information on the general process used to harmonize variables can be found here:
Versioning and harmonization of variables
Working with harmonized Variables
2. Dataset: bhh
Variables: bhh_37_01, bhh_37_02
The names assigned to the raw variables bhh_37_01 “electricity included in rent” and bhh_37_02 “assessed burden of housing expenses (rent and additional expenses)” do not correspond to the standard SOEP concept for naming variables. Both variables had to be renamed:
bhh_37_01 “Electricity included in rent” → bhh_33
bhh_37_02 “Assessed burden of housing expenses (rent and additional costs)” → bhh_37
To find out more about how raw variables are named in the SOEP, see the SOEPcompanion:
Naming conventions of Variables and Datasets
3. Dataset: migspell
Unfortunately the previous version of the migspell dataset was delivered. For the current version, please contact the SOEPhotline or write an email to soepmail.
4. Dataset: biobirth, bioimmig, biojob, bioparen, bioresid, biosib, biosoc, biotwin, pflege
Variables: pid, cid, hid
In the process of “merging” SOEP-Long and SOEP-Core, all of the SOEP-Long ID variables (pid, hid, cid) were also included in the raw datasets to make merging easier for users. In some datasets, only the ID variables were created but not filled in with the corresponding IDs.
Empty pid: biobirth, bioimmig, biojob, bioparen, bioresid, biosib, biosoc, biotwin, pflege
Empty hid: bioimmig, bioresid, biosoc
Empty cid: biobirth, bioimmig, biojob, bioparen, bioresid, biosib, biosoc, biotwin, pflege
With these datasets, please continue to use persnr, hhnrakt, hhnr, or copy the content into the corresponding new ID variable.
|
clonevar pid = persnr |
Further information on SOEP identifiers can be found here:
Dataset Identifier