Changes in the Dataset

Known Bugs/Fixes

SOEP Quicklinks:  

SOEPinfo

SOEPlit

SOEPnewsletter

SOEPmonitor

SOEPdata Documents

SOEPdata FAQ


SOEP v31

1984-2014 (Wave BE)

June 6, 2016

In the file with generated longitudinal data on children (KIDLONG) in SOEP-Core v31.1 another correction had to be implemented: Some few data that only had been asked in the FiD study were missing.

This applies to the variables KA06$$ (Activities for children below the age of 6) and KA16$$ (Activities for children aged 6 to 16).

If you want to analyze these variables there are three ways to use the corrected data:

  1. You use the original data from SOEP-Core (in the files $$KIND)
  2. You use the data set KIDL in SOEPlong (there the data had been implemented correctely)
  3. You may ask for the correct data set KIDLONG at our hotline (). We can provide you with an individualized download link.
March 18, 2016 Various updates forced us to distribute a new version. Please see the doi landing page soep.v31.1 for the documentation of the changes.

SOEP v29

1984-2012 (Wave BC)

Mar. 27,2014

HGEN

Errors in the imputation of electricity, heating, and additional expenses for tenants in the current data distribution resulted in values that were too high. These errors also affected the generation of rent including maintenance but excluding heating. The variables affected are: electr$$, heat$$, util$$, rent$$, and frent$$ for the years 2008 to 2012. The variables typ1hh12 and typ2hh12 changed for two households.

BCPKAL

Also in the 2012 survey year, after the suspension of compulsory military service in Germany, the related calendar information in the individual questionnaire was revised. This revision was made in the original individual data for 2012 but not in the corresponding calendar data—these have now been updated retrospectively for the data distribution v29.

Both errors were corrected and an update is now available for downloading upon request (). If you would like to use this updated version in your work, please cite the version number, SOEP v29.1 (or better, doi: 10.5684/soep.v29.1) in publications using these data.

SOEP v28

1984-2011 (Wave BB)

Dec. 19,2012

BIOCOUPLM, BIOCOUPLY, BIOMARSM, BIOMARSY
In some cases, reports of a past divorce were not taken into account in the data generation process. In addition, the reported year of death of a former partner was in some cases overwritten by the respective current year of the interview. This affected not only the start and end date of some spells but also missing information and validation checks.

$FAMSTD
The mistakenly overwritten information in the generation of BIOCOUPL$ affected validation checks. A majority of the formerly missing information is now available. However, the number of implausible answers has also risen in the process.

An update for all corrected files can be downloaded by means of a personalized link. Please contact to obtain your link. 

Please note: If you use one of the provided bugfixes in your analyses we recommend citing it as follows:
English:
Socio-Economic Panel (SOEP), data for years 1984-2011, version 28.1, SOEP, 2012.
German:
Sozio-oekonomisches Panel (SOEP), Daten für die Jahre 1984-2011, Version 28.1, SOEP, 2012.
Short Version:
SOEP v28.1 

SOEP v27

1984-2010 (Wave BA)

March, 30, 2012

BIOAGE03
The age of the children was not correct and had to be recalculated. In addition, some missing values for children’s weight and height had the wrong value “0” and had to be recoded. Finally, the number of doctor visits for the survey years 2005 and 2006 were set to “0” by mistake and had to be recoded.

BIOAGE06
For 14 children, the birth month was missing although this information was available in recent waves. This information was added. In addition, one incorrect person number and one incorrect household number were changed.

BIOAGE08
The age of the children was not correct and had to be recalculated.

LIFESPELL
In the update of the dataset LIFESPELL, approximately 100 cases of emigration were recoded for the time period 2006-2010. In the original version of v27 they were incorrectly specified as living in Germany. The revised LIFESPELL file also contains new information about the year of death for a small number of individuals. For more information please contact Hannes Neiss (). 

An update for all corrected files can be downloaded, but only by means of a personalized link. Please contact to obtain such a link.

Please note: If you use one of the provided bugfixes in your analyses we recommend citing it as follows:
English:
Socio-Economic Panel (SOEP), data for years 1984-2010, version 27.2, SOEP, 2012.
German:
Sozio-oekonomisches Panel (SOEP), Daten für die Jahre 1984-2010, Version 27.2, SOEP, 2012.
Short Version:
SOEP v27.2

Jan 2, 2012

 COGDJ
In the file COGDJ, the 2010 data had not yet been updated in the released version. For a bugfix for download please contact .

 English labels
In the data sets ZHBRUTTO and BAHBRUTTO, some of the English labels shifted position and had to be redefined. This applies to the following variables:

ZHBRUTTO BAHBRUTTO
SAMPLE1  
ZBULA  
ZDATUMMO  
ZHAND  
ZHERGS  
ZHTYP BAHTYP
ZSAMPREG

Also, in the $PGEN data sets, no English value labels were generated for the new variables on educational degrees and training qualifications prior to joining the panel. This applies to the English labels for the following variables:
FIELD$$, DEGREE$$, and TRAINA$$–TRAIND$$.

If you use one of those variables, please contact to obtain a download link for the bugfixes.

PPFADL in SOEPlong
In the SOEPlong data version distributed earlier this year, the following two variables in the file PPFADL had missing values in 2010:

HID — key indicator for households and
NETT1 — the short version of the tracking variable NETTO.

An update for PPFADL can be downloaded, but only by means of a personalized link. Please contact to obtain such a link. 

Please note: If you use one of the provided bugfixes in your analyses we recommend citing it as follows:
English:
Socio-Economic Panel (SOEP), data for years 1984-2010, version 27.1, SOEP, 2012.
German:
Sozio-oekonomisches Panel (SOEP), Daten für die Jahre 1984-2010, Version 27.1, SOEP, 2012.
Short Version:
SOEP v27.1.

SOEP v26

1984-2009 (Wave Z)

Jan. 6, 2011

 There was a problem in the assignment of the correct current household number in 3% of the children in the generated longitudinal dataset KIDLONG. The variable HHNRAKT has been corrected accordingly.
In addition, the data in the variable K_NRKID for survey year 1987 have changed for child 397403.
Here, the "number of children in the HH below the age of 16" went from 1 to 2.

Please contact if you use the KIDLONG dataset. We will provide an individualized method of downloading the corrected version for both the 100% dataset for the EEA countries and the 95% version available for use worldwide.


Please note: If you use the corrected dataset KIDLONG we recommend citing it as follows:
English:
Socio-Economic Panel (SOEP), data for years 1984-2009, version 26.1, SOEP, 2011.
German:
Sozio-oekonomisches Panel (SOEP), Daten für die Jahre 1984-2009, Version 26.1, SOEP, 2011.
Short Version:
SOEP v26.1.

   

1984-2008 (wave Y)

Feb. 10, 2010

Downloadable bug-fix for children's weighting factors of wave Y (2008)

Individuals born in 2002 (thus being 6 years of age in wave Y, 2008) whose parents completed the newly introduced child questionnaire for this particular cohort did not receive a valid score on the wave-specific cross-sectional weighting variable (this population can be identified by YNETTO=23). This affects the variable YPHRF in the file PPHRF and the variable W1110108 in the file YPEQUIV. This inaccuracy applies only to these 237 children aged 6 in this particular wave and affects only the individual, but not the household weights. Moreover, any weighted analysis based only on adult respondents using, for instance, the YP and YPGEN files is virtually unaffected by this error. Users who wish to include the six-year-olds in a weighted analysis are asked to download updated versions of the datasets YPHRF and YPEQUIV.

Please send an email to to request a personalized URL and further details.

Dec. 5, 2009

In the dataset BIOIMMIG an incorrect assignment to the variable BIGOBACK (the variable on the probability to return home) was made for the categories -2 (“does not apply”) and 2 (“Yes, probably”) in some cases since 2001.

To correct this error, please download the appropriate script for your statistical program (SAS, SPSS or Stata) and run it after adjusting the script to the path of your local settings.

Script for Stata | TXT, 320.45 KB

Script for SPSS | TXT, 289.2 KB

Script for SAS | TXT, 309.72 KB

 

Nov. 9, 2009

Shortly after completing the DVD, an error in data generation was identified in the file BIOPAREN.
The error is in the categories of parental religious affiliation (MRELI, VRELI). The codes for the categories "other Christian affiliation", "Islamic affiliation" as well as "other religious affiliation" require correction. The other categories of the variable are not affected.

To correct this error, please download the appropriate script for your statistical program (SAS, SPSS or Stata) and run it after adjusting the script to the path of your local settings.

Script for Stata | TXT, 75.48 KB

Script for SPSS | TXT, 64.96 KB

Script for SAS | TXT, 75.55 KB

If you need an update for another statistical programm, please contact our hotline at .

nach oben

1984-2007 (wave X)

Dec. 04, 2008 In the process of extensive checking, several problems were identified in the 1984-2007 data distribution currently available on DVD (waves A-X).

The corrected datasets are now available to be downloaded as a password-protected ZIP file from our homepage. To obtain download access to the corrected datasets, please send an e-mail to or call the SOEPhotline at +49 30 89789 292.

To unzip the files, you will need the password for the current 1984-2007 data distribution, or the password used to access the expanded regional data in the GGKBOU dataset. If you do not have the current data distribution, please contact our hotline (soepmail@diw.de).

The fixed files are:

  • HHRF (weighting factors for households): in preparing the weighting factors for households, an older version was mistakenly distributed for the variables WHHRFALL and XHHRFALL. These have now been replaced with the revised version.
  • PBIOSPE: Due to a problem in data storage, some of the earnings biographies surveyed since wave U for the first time or subsequently were not recorded correctly. PBIOSPE was therefore revised retroactively from wave U on.
  • XHBRUTTO: Here, an erroneous code for the East German federal states was corrected in the variable XBULA.
  • WP: WKLAS, WIS88, WIS88N and WKLASN were updated. This was necessary since some data had been overwritten with missings.
  • WPGEN/XPGEN: Because of the corrections to WKLAS and WIS88 in WP, it was necessary to update some generated variables which are derived from $KLAS and $IS88: This includes the variables IS8806, ISEI06, MPS06, SIOPS06, EGP06 and KLAS06. Furthermore, due to the revision of PBIOSPE (see above) EXPFT$$, EXPPT$$ and EXPUE$$ were also updated.
  • HBRUTT00: Because of a conflict in household IDs for the expanded original gross sample F, the household IDs had to be changed in some cases. This only applies to households that did not provide valid SOEP interviews.
  • GGKBOU: As a result of the change in HBRUTT00, the identifier HHNR was adapted in some cases in this dataset as well.

nach oben

1984-2006 (wave W)

Apr. 03, 2008 We have found some wrong labelling for the variables indicating the owner of the dwelling (VH27 and WH27), please note the relevant corrections in the table below.

This will be fixed with the next data release.

Variable Label: Owner Of The Dwelling

ValueWrongCorrect
-2 Does not apply Does not apply
-1 No answer No answer
1 Self Owned Res. Property Local Govt. Apt.
2 Local Govt. Apt. Co-Operative Apt.
3 Co-Operative Apt. Company Apt.
4 Company Apt. Private Owner
5 Private Owner Do Not Know
Mar. 31, 2008 In the information on school and occupational training, the data on graduations and completed training since 2005 contained errors (variables PSBIL and PBBIL01-03 in VPGEN and WPGEN). Further information can be obtained from and .
Sept. 28,2007 In the process of inputting the revised ERWZEIT variables, the VEBZEIT variables in columns of the same name from previous years were overwritten. Both variables have now been corrected in the PGEN files for the years 1984-1997 (Waves A-N). Those users who need to use data from before 1998 for their analyses should input the new PGEN files.
The updated data are provided in the various formats for downloading. Please request the passwort from the .

nach oben

1984-2005 (wave V)

Jul. 14, 2006 In BIOPAREN in the values for the following variables contain errors:
  • VAORTAKT : 'Current residence of Father'
  • MAORTAKT : 'Current residence of Mother'
  • VAORTUP : 'Year of Update of VAORTAKT'
  • MAORTUP 'Year of Update of MAORTAKT'
For an update please contact .
Jul. 13, 2006

In BIOAGE01 the labels for the variable BCKSTOER are missing.

value labels
(-1)'N.A.'
(0)'None of These Disorders '
(1)'Sensory'
(2)'Motor Functions'
(3)'Neurological'
(4)'Speech'
(5)'Regulatory'
(6)'Chronic Illness'
(7)'Physical Disability'
(8)'Mental Disability'
(11)'Motor Functions + Regulatory'
(12)'Sensory + Motor Functions + Speech'
(13)'Sensory + Motor Functions + Chronic Illness'
(14)'Sensory + Motor Functions'
(15)'Sensory + Motor Functions + Chronic Illness + Neurological + Speech + Physical Disability'  

Jul. 12, 2006 In Microsoft Windows, the links on CD 3 to document names containing "-en" (for example, links to documentation on the generated variables in English) are incorrect. If you receive an error mesage when attempting to access a particular document, change the "-en" to "_en" in your browser´s address window. With Linux and Unix, you shouldn´t have any problems.

nach oben

1984-2004 (wave U)

Aug. 24, 2005 In 2005, the SOEP group together with our field work agency TNS Infratest Sozialforschung, carried out extensive checks on all regional identifiers in the SOEP data such as administrative districts and federal states. Firstly, this enabled us to replace missing values of regional identifiers even in past years with valid information. Secondly, in some cases the regional identifiers $BULA and $SAMPREG have been corrected for former waves. Based on these changes, all information concerning regional identifiers in the SOEP should be consistent.

The checks mentioned above have been finialized after the data production of our most recent CD-Rom (up to wave U, 2004). If you are interested in using the corrected information you may apply the following statements | TXT, 9.92 KB .

1984-2003 (wave T)

Feb. 18, 2005 Probably only in STATA used with Windows 2000 some variables are diplayed in a curious way.
More information in German.
Dec. 10, 2004 Since the distribution of SOEP data 1984-2003, some variables have been corrected or modified.

Distributions before wave 20 (T)

 19.12.2003 POP - Variables in the data distribution within Germany

Provisional values for the generated variables for population membership (SPOP and SHPOP) have inadvertently been distributed. We will provide an update at the beginning of next year. The POP variables which rely on extrapolation factors have been calculated using the correct data and are therefore not affected.

 18.12.2003 Data distribution within Germany

Due to an error in the setup program for the 1984-2002 data distribution for Stata and SPSS, the file "BIOJOB" is not automatically installed. SAS users are not affected.In order to gain access to the "BIOJOB" file through Stata or SPSS, it has to be installed manually using a program-specific command.

  • These following steps are required: Open the 'Work' directory for the SOEP-data.
  • Insert the SOEP19 CD#1 (in this example in drive D:. Please change this if your drive has a different letter).
  • Use the command

d:\data\gsoep\sta_100.exe -pass=******** biojob.* (Stata-Files),
d:\data\gsoep\por_100.exe -pass=******** biojob.* (SPSS-Portable-Files) or
d:\data\gsoep\sps_100.exe -pass=******** biojob.* (SPSS-SAV-Files)

in order to install the respective statistical package.
(******* ist the password)

If you have any further problems or questions please ask Rainer Pischner.

 04.11.2003 Data distribution within Germany

A LABEL bug in the file BIOPAREN on the German CD 1984-2002.In the file BIOPAREN we discovered a small value label bug. It emerged in the variables VNAT und MNAT.

The label for value 2 has to be "andere Staatsangehörigkeit als deutsch" and not "türkisch".

 03.05.2002 THE FOLLOWING ONLY AFFECTS THE ENGLISH LANGUAGE VERSION of the GSOEP. THE GERMAN VERSION IS NOT AFFECTED!

Unfortunately we have found a few more LABEL bugs in the English distribution the Person Files. The data is ok but incorrectly labeled.

You can download code in STATA, SPSS and SAS which can be copied and run. Simply edit the pathname of where you installed the data, at the top of the code chunk.

That will patch things up quickly. Sorry for any hassles caused.

John Haisken-DeNew

 28.02.2002 THE FOLLOWING ONLY AFFECTS THE ENGLISH LANGUAGE VERSION. THE GERMAN VERSION IS NOT AFFECTED !!!!!

Unfortunately we have found a few more VAR LABEL bugs in the english distribution of QP (Person File 2000). The data is ok but incorrectly labeled (var labels).

Attached is code in STATA, SPSS and SAS which can be copied and run. Simply edit the pathname of where you installed the data, at the top of the code chunk (AND at the bottom for SPSS only).

That will patch things up quickly. Sorry for any hassles caused.

===================== STATA ====================

use c:\gsoep17\qp
label variable qp03 "Maternity, Paternity Leave"
label variable qp04 "Registered As Unemployed"
label variable qp6301 "Second Job, Earnings"
label variable qp6302 "Gross Amt Second Job Monthly Income"
label variable qp6303 "Old-Age,Invalid Pension"
label variable qp6304 "Gross Amt. Of Old-Age,Invalid Pension,Mo"
label variable qp6305 "Widow-Er,Orphan Benefit"
label variable qp6306 "Gross Amt Of Widow-Er,Orphan Benefit,Mo"
label variable qp6307 "Unemployment Benefit"
label variable qp6308 "Gross Amt.Of Unemployment Benefit,Mo"
label variable qp6309 "Unemployment Relief"
label variable qp6310 "Gross Amt.Of Unemployment Relief, Mo"
label variable qp6311 "Subsistence Allowance"
label variable qp6312 "Gross Amt. Of Subsistence Allowance,Mo"
label variable qp6313 "Transition Money, etc."
label variable qp6314 "Gross Amt. Of Transition Money, etc."
label variable qp6315 "Early Retirement Benefits"
label variable qp6316 "Gross Amt. Of Early Rtiremnt Benefits,Mo"
label variable qp6317 "Maternity Benefit"
label variable qp6318 "Gross Amount Of Maternity Benefit"
label variable qp6319 "Student Grant"
label variable qp6320 "Gross Amount Of Student Grant,Mo"
label variable qp6321 "Military,Civilian Payments"
label variable qp6322 "Gross Amt. Military,Civilian Pay,Mo"
label variable qp6323 "Income From Persons Not In Household"
label variable qp6324 "Gross Amt. Income-Persons Not In HH,Mo"
label variable qp6325 "No Other Income Besides Earned Income"
save, replace

===================== SPSS =====================

get file='c:\gsoep17\qp.sav'.
var label qp03 "Maternity, Paternity Leave".
var label qp04 "Registered As Unemployed".
var label qp6301 "Second Job, Earnings".
var label qp6302 "Gross Amt Second Job Monthly Income".
var label qp6303 "Old-Age,Invalid Pension".
var label qp6304 "Gross Amt. Of Old-Age,Invalid Pension,Mo".
var label qp6305 "Widow-Er,Orphan Benefit".
var label qp6306 "Gross Amt Of Widow-Er,Orphan Benefit,Mo".
var label qp6307 "Unemployment Benefit".
var label qp6308 "Gross Amt.Of Unemployment Benefit,Mo".
var label qp6309 "Unemployment Relief".
var label qp6310 "Gross Amt.Of Unemployment Relief, Mo".
var label qp6311 "Subsistence Allowance".
var label qp6312 "Gross Amt. Of Subsistence Allowance,Mo".
var label qp6313 "Transition Money, etc.".
var label qp6314 "Gross Amt. Of Transition Money, etc.".
var label qp6315 "Early Retirement Benefits".
var label qp6316 "Gross Amt. Of Early Rtiremnt Benefits,Mo".
var label qp6317 "Maternity Benefit".
var label qp6318 "Gross Amount Of Maternity Benefit".
var label qp6319 "Student Grant".
var label qp6320 "Gross Amount Of Student Grant,Mo".
var label qp6321 "Military,Civilian Payments".
var label qp6322 "Gross Amt. Military,Civilian Pay,Mo".
var label qp6323 "Income From Persons Not In Household".
var label qp6324 "Gross Amt. Income-Persons Not In HH,Mo".
var label qp6325 "No Other Income Besides Earned Income".
save outfile='c:\gsoep17\qp.sav'.

===================== SAS ======================

libname soep 'c:\gsoep17';
libname library 'c:\gsoep17';
options compress=no ls=80 errors=1 nofmterr nodate nocenter;
data soep.qp;
set soep.qp;
label
QP03 = "Maternity, Paternity Leave"
QP04 = "Registered As Unemployed"
QP6301 = "Second Job, Earnings"
QP6302 = "Gross Amt Second Job Monthly Income"
QP6303 = "Old-Age,Invalid Pension"
QP6304 = "Gross Amt. Of Old-Age,Invalid Pension,Mo"
QP6305 = "Widow-Er,Orphan Benefit"
QP6306 = "Gross Amt Of Widow-Er,Orphan Benefit,Mo"
QP6307 = "Unemployment Benefit"
QP6308 = "Gross Amt.Of Unemployment Benefit,Mo"
QP6309 = "Unemployment Relief"
QP6310 = "Gross Amt.Of Unemployment Relief, Mo"
QP6311 = "Subsistence Allowance"
QP6312 = "Gross Amt. Of Subsistence Allowance,Mo"
QP6313 = "Transition Money, etc."
QP6314 = "Gross Amt. Of Transition Money, etc."
QP6315 = "Early Retirement Benefits"
QP6316 = "Gross Amt. Of Early Rtiremnt Benefits,Mo"
QP6317 = "Maternity Benefit"
QP6318 = "Gross Amount Of Maternity Benefit"
QP6319 = "Student Grant"
QP6320 = "Gross Amount Of Student Grant,Mo"
QP6321 = "Military,Civilian Payments"
QP6322 = "Gross Amt. Military,Civilian Pay,Mo"
QP6323 = "Income From Persons Not In Household"
QP6324 = "Gross Amt. Income-Persons Not In HH,Mo"
QP6325 = "No Other Income Besides Earned Income";
run;
=============================================
John Haisken-DeNew 

nach oben