Integrated Studies

Along with the main studies, SOEP-Core and SOEP-IS, the SOEP Research Data Center distributes a series of other datasets. These include research projects carried out with specific samples that have been integrated into SOEP-Core but are also available as independent datasets.

Families in Germany (FID)

Titel: Familien in Deutschland (FiD), Data from 2010-2013

DOI: 10.5684/soep.fid.v4.0
Collection Period: 2010-2013
Release Date: 28.02.2014
Primary Investigators: Jürgen Schupp, C. Katharina Spieß, Mathis Fräßdorf, Rainer Siegers, Juliana Werneburg


Families in Germany (FiD) was the first systematic, large-scale evaluation of the entire spectrum of public benefits to married people and families in Germany. The study was commissioned by the Federal Ministry for Family Affairs, Senior Citizens, Women, and Youth (BMFSFJ) and the Federal Ministry of Finance (BMF).

The FiD study was designed to provide an improved source of nationally representative data for the analysis of several subgroups of the population that are important for family policy but too small in size relative to the overall population to produce statistically reliable and generalizable results. These include single-parent families, low-income families, and families with more than two children. To address the need for more observations of these groups, the FiD study was conducted from 2010-2013 by SOEP at DIW Berlin in partnership with TNS Infratest Sozialforschung (now Kantar Public). It was modelled on the SOEP survey and provides data on more than 4,500 households.

In FiD, the following four supplementary samples were surveyed annually from 2010 to 2013, thus forming a panel study:

  1. A sample of families in "critical income brackets"
  2. A sample of single parents
  3. A sample of families with more than two children
  4. A "cohort sample" of the 2007, 2008, 2009, and 2010 (first quarter) birth cohorts

The data were first released to the scientific community in April 2012. They can be obtained from the SOEP Research Data Center with a SOEP data distribution contract (forms). Although the FiD survey instruments were adapted to the specific research focus of the evaluation, they were based on those used in the SOEP. By using the FiD and SOEP data together, researchers can obtain a representative picture of the German population. To facilitate data analysis, the SOEP team has included the majority of the SOEP datasets in FiD along with joint weighting factors. More information in greater detail can be found in the article:

Schröder, Mathis, Rainer Siegers, and C. Katharina Spieß. 2013. "Familien in Deutschland" - FiD. Schmollers Jahrbuch 133 (4), 595-606 (DOI: 10.3790/schm.133.4.595).

The Federal Ministry of Family Affairs, Senior Citizens, Women, and Youth has published a compilation of studies produced as part of the evaluation here.

Selection mode:

Cohort: clustered sample of the local registries at the domicil ('Einwohnermeldeamt'), oversampling of migrant households via nationality and onomastics.

Large families, single parents, low income families: people drawn from a representative random sample who gave their allowance for more surveys and screened accordingly to the criteria via telephone interview.

Collection mode:

In FiD only Computer Assisted Personal Interviews (CAPI) were conducted. Principally an interviewer tries to obtain face-to-face interviews with all members of a given survey household aged 16 years and over. Additionally one person (head of household) is asked to answer a household related questionnaire covering information on housing, housing costs, and different sources of income. This covers also some questions on children in the household up to 16 years of age, mainly concerning attendance at institutions (kindergarten, elementary school, etc.).

Download check:

Packaged as zip-files with MD5 fingerprints
SPSS und Stata mit englisch/deutschen Labels

c338363aeb7738e3d5e0ed1ba694eed3

Citation:

The FiD data are now completely integrated into SOEP-Core (in some cases after slight adjustments), Version 31ff. (10.5684/soep.v31). The FiD households are being surveyed further with the SOEP-Core questionnaires as samples L1-L3.

The FiD data are also available as an independent dataset. The conditions and procedures for data access are the same as for the SOEP-Core data; for more information, click here.

Only in German:

2013:

2012:

2011:

2010:

Kohortenstichprobe

Screening-Stichprobe

IAB-SOEP-Migration Sample

The IAB-SOEP Migration Sample is a joint project of the Institute for Employment Research (IAB) and the Socio-Economic Panel (SOEP) at the German Institute for Economic Research (DIW Berlin). The project tempts to overcome limitations of previous datasets by drawing a sample that takes into account changes in the structure of migration to Germany since 1995. The dataset is an additional sample for the SOEP-Core study and therefore completely harmonized with the SOEP and integrated into SOEP v33.1 (identical questionnaire with additional questions on the respondent's migration situation). The study opens up new perspectives for migration research and gives insights on the living situations of new immigrants to Germany.

Sampling

The sampling frame of the IAB-SOEP Migration Sample M1 is based on the administrative records in the Integrated Employment Biographies (IEB) of the IAB and the households have been surveyed initially in 2013. The sample has been drawn from 250 regional units with a multi-step procedure that permits a random selection of individuals in the target population using an equal probability selection method. In each of the 250 regional units, 80 addresses were drawn randomly, where some countries of origin have been given a higher probability to ensure a sufficient number of observations. These groups consist of immigrants from the EU-New Member States and Southern European countries.

The households from the second IAB-SOEP Migration Sample (M2) surveyed in 2015 are now also included in this release. The target population of the second IAB-SOEP Migration Sample consists of immigrants to Germany who have arrived between 2010 and 2013. Migrants from the new EU member states in Eastern Europe dominate this group. This focus will make it possible to better describe the dynamic recent evolution of immigration to Germany. The sample M2 consists of 1,096 households, and was, like sample M1, drawn from register data from the Federal Employment Agency.

Interviews

The sampled individuals have been interviewed with a personal questionnaire including questions on their (migration) biography and a household questionnaire. All family members aged 16 years or older were also asked the questions of the personal questionnaire. From the second survey year on all other SOEP questionnaires are in the field, especially the age specific Mother and Child Questionnaries. 

The IAB-SOEP Migration Sample provides a database which permits to gain new insights by addressing various aspects of immigration regarding activation and attraction of skilled immigrants and to provide clear guidance regarding immigration, integration and labour market policies in Germany. The database contains the following information:

  • Migration history: Year of immigration, migration history, search behaviour and information channels, social networks.
  • Education history: highest obtained schooling and vocational degrees, years of schooling, education acquired at home and abroad, acknowledgement procedure of foreign credentials, language proficiency.
  • Employment history: Employment, self-employment, unemployment in Germany and in foreign countries.
  • Labour market background: Earnings, full- and part-time employment, working hours, benefit assistance, reservation wage, participation in active labour market policies.
  • Return migration: Return migration intensions, return migration, limited survey of returned immigrants.
  • Miscellaneous: Remittances to home countries, life satisfaction, risk preferences, social integration and acceptance.

The study opens up new perspectives for migration research and gives insights on the living situations of new immigrants to Germany.

The questionnaire covers the complete migration, education, and labor market histories of respondents in both their country of origin and in all countries to which they have lived in. Additionally the questionnaire includes several new batteries of questions that have not previously been considered in the SOEP or other household surveys in Germany, or not in the necessary depth. Examples include questions on earnings and labor market integration and occupational status before migration, migration decisions in the family and partnership context, and purposes and transfer channels of remittances.

The IAB-SOEP Migration Sample substantially increases the available sample size in the SOEP-Core study for research on migration and the lives of immigrants in Germany. 4,964 persons residing in 2,723 households participated in the first wave of the survey.

Table 1: Development of case numbers in the IAB-SOEP Migration Sample

2013 2014 2015 2016 2017
Adults 4.964 3.752 4.770 3.832 3.408
Children 2.481 1.950 2.480 1.972 1.768

Since the survey is also included in the regular SOEP, migrants from the other SOEP-Core samples can be included in analyses, increasing the number of observations further.

Record Linkage

Please note that data from both samples can be linked with administrative employment and income data: Survey respondents are asked to provide explicit consent to record linkage. But since this linked dataset contains social data, these weakly anonymized data are only accessible on site at the Research Data Center of the German Federal Employment Agency at the IAB (FDZ IAB). Researchers can access FDZ IAB data through a guest visit to the IAB or through remote data processing, also arranged with the IAB. The linked data will soon be available to external researchers. Requests for data access should be directed to FDZ IAB, since a contract with IAB for data use is required. (more information).

Title: IAB-SOEP-Migration Sample (M1, M2), Data from years 2013-2017

DOI:
10.5684/soep.iab-soep-mig.2017

Publication date: 2019-03-06

Principal investigators: Herbert Brücker, Stefan Liebig, Jürgen Schupp

Co-PIs: Martin Kroh, Jan Goebel, Parvati Trübswetter

Affiliated staff members for providing the Scientific Use File: Charlotte Bartels, Alexandra Fedorets, Andreas Franken, Markus Grabka, Jannes Jacobsen, Selin Kara, Peter Krause, Hannes Kröger, Elisabeth Liebau, Maria Metzing, Janine Napieraj, Jana Nebelin, David Richter, Diana Schacht, Paul Schmelzer, Christian Schmitt, Daniel Schnitzlein, Carsten Schröder, Rainer Siegers, Knut Wenzig, Stefan Zimmermann

Datenerhebung: Kantar Public Deutschland

Erhebungszeitraum:
2013-2017

Citation:

If you publish using this data, it is mandatory to quote the following reference:

Herbert Brücker, Martin Kroh, Simone Bartsch, Jan Goebel, Simon Kühne, Elisabeth Liebau, Parvati Trübswetter, Ingrid Tucci & Jürgen Schupp (2014): The new IAB-SOEP Migration Sample: an introduction into the methodology and the contents. SOEP Survey Paper 216, Series C. Berlin, Nürnberg: DIW Berlin.

Data Access

Via RDC SOEP:
The data from the IAB-SOEP Migration Sample are at the same time additional samples for the SOEP-Core data set, the data are integrated in SOEP v34 as samples M1 and M2 in SOEP v34 (identical questionnaire with additional questions on the respondent's migration situation). All SOEP users with a valid contract already received the data with SOEP v34 or can order it as SOEP v.34 or as a separated data set  in the usual ways (SOEPhotline, website) without needing to sign any additional data distribution contracts. The dataset is available upon request for free via personalized secure download. New users find information on the SOEP application process here.

Via Research Data Centre (FDZ) of the Federal Employment Agency at the Institute for Employment Research:
The application process of the FDZ IAB.

Data structure

The data structure is very similar to the structure used in SOEP-Core. For a comprehensive description of SOEP-Core including M1 and M2 please refer to the new SOEPcompanion.

Each wave is identified by letters of the alphabet: the first wave in 1984 is wave “A”, 1985 is wave “B”, and so on, up to BG in 2016. To simplify notation, the “$” sign is used, when all waves of one group of datasets are referred to. For example, $H refers to all household level datasets AH to BFH. For each year of SOEP data there are single data files for households (e.g. $H) as well as for individual respondents (e.g. $P) and children (e.g. $KIND) based on interview information. These observations make up the “net” population, with each of these files containing as many records as interviews could be conducted. Additional data files with a limited number of variables based on the “address log” constitute the “gross” number of households and persons, i.e. all households and their members which were eligible for an interview in any given year. For an overview, please see the table below

Data set label Survey years Subject of analysis
ppfad Individual Tracking File P
hpfad Household Tracking File H
$$p_mig Integrated personal and biographical questionnaire (Sample M specific) 2013/14/15/16 P
migspell Migration biography in spell format P
$$p Personal questionnaire 2013/14/15/16 P
$$h Household questionnaire 2013/14/15/16 H
$$kind Data on children (from HH-Questionnaire) 2013/14/15/16 P
$$pgen Generated Individual Data 2013/14/15/16 P
$$pkal Individual Calendar 2013/14/15/16 P
$$hgen Generated Household Data 2013/14/15/16 H
mihinc Multiple imputed data on monthly household income H
pflege Persons needing care within the household P
health Health indicators P
$$hbrutto Gross Household Data 2013/14/15/16 H
$$pbrutto Gross Individual Data 2013/14/15/16 P
hhrf Weighting and staying probabilities H
phrf Weighting and staying probabilities P
biobirth Generated biographical information: Birth Biography of Female and Male Respondents P
bioedu Generated biographical information: educational participation and transition P
biocouplm Generated biographical information: couple history, monthly P
biocouply Generated biographical information: couple history, annual P
bioimmig Generated biographical information: Generated and Status Variables for Foreigners P
biojob Generated biographical information: First and last job P
bioresid Generated biographical information: Occupancy and Second Residence P
biomarsm Generated biographical information: marital history files, monthly P
biomarsy Generated biographical information: marital history files, annual P
bioparen Generated biographical information: Biography Information for the Parents of SOEP-Respondents P
biosib Generated biographical information: Information on siblings P
biosoc Generated biographical information: Retrospective Data on Youth and Socialization P
biotwin Generated biographical information: Twins in the SOEP P
bioage17 Generated biographical information: Data from the Youth Questionnaire 2014/15/16 P
bioagel Generated biographical information: Data from the Mother & Child Questionnaires 2014/15/16 P
pbiospe Generated biographical information: Activity Biography P
cirdef Random Groups H
design Survey Design H
kidlong Data on children (from HH-Questionnaire, in long format) P
lifespell Spell Information on the Pre- and Post-Survey History of SOEP-Respondents P
artkalen Spell data from the activity calendar P
$$pequiv Cross-national Equivalent File 2013/14/15/16 P
$$page17 Questions from the Youth questionnaire not included in BIOAGE17 2014/15/16 P
bepluecke Short questionnaire of the year before (if missing) 2015/16 P
$$school Data from the Pre-teen questionnaire (11-12 years old) 2013/14/15/16 P
$$vp Data on the deceased person 2014/15/16 P
cogdj Data on cognitive tests (Youth) P
hbrutt$$ Original gross population of the sample specific first wave 2013/15/16
pbr_exit Cumulated Exit P

Missing conventions

Survey variables might be missing, i.e. without a valid code or value for different reasons. In the SOEP, negative values are not valid for any variable, but are used instead to code different reasons for missing information. There are two distinctions for missing values: they may originate in the respondent’s answer or in the survey design. The respondent may refuse or not know an answer or she may report invalid values on the one hand, and the interview design may exclude respondents with certain characteristics from some questions on the other (e.g. men will never be asked if they are pregnant). The following codes apply:

Code Meaning

-1 no answer / don’t know
-2 does not apply
-3 implausible value
-4 inadmissable multiple response
-5 not included in this version of the questionnaire
-6 version of questionnaire with modified filtering

With the extension of the SOEP in recent years, entirely new samples have been added to the core study. In these samples, sometimes questions are left out completely, e.g. to shorten the questionnaire or because the focus of the sample is different as in some of the related studies. In such a case, the variable will be set to “-5 Not included in this version of the questionnaire” for an entire subsample.

With the use of CAPI, recent developments include an “integrated” person questionnaire, i.e. the biography part and the “regular” part of the questionnaire are asked as one. Some of the questions in the biography part are repeated in the regular part. While in the PAPI mode, the respondent will answer the same question twice, the CAPI allows to filter the respondent around the question if it has already been asked. These cases are very rare - if they occur, they receive a code “-6 Version of questionnaire with modified filtering”.

Questionnaires and additional fieldwork instruments available as SOEP Survey Papers:

More documentation will follow.

Comprehensive documentation on SOEP-Core, the integration of the samples and how to work with the SOEP data on http://companion.soep.de

Additional documents:

  • The new IAB-SOEP Migration Sample: an introduction into the methodology and the contents (SOEP Survey Paper 216)
  • Flowcharts for the Integrated Individual-Biography Questionnaire of the IAB-SOEP Migration Sample 2013 (SOEP Survey Paper 261)
  • How to Generate Spell Data from Data in "Wide" Format based on the migration biographies of the IAB-SOEP Migration Sample (SOEP Survey Paper 228)
  • The 2013 IAB-SOEP Migration Sample (M1): Sampling Design and Weighting Adjustment (SOEP Survey Paper 271)

IAB-BAMF-SOEP Survey of Refugees

The Institute for Employment Research (IAB), the Socio-Economic Panel (SOEP) at the German Institute for Economic Research (DIW Berlin), and the Research Centre on Migration, Integration, and Asylum of the Federal Office of Migration and Refugees (BAMF-FZ) are working together on a project to survey people seeking protection in Germany from political persecution and violent conflicts. The influx of refugees to Germany, which rose sharply in the second half of 2015, poses challenges for public policy, government administration, and the population at large. To effectively coordinate lodging and assistance for these refugees and to foster their long-term integration into the labor market, educational system, and other societal spheres, robust and reliable information is needed that can be generalized and applied usefully to the challenges at hand. It is up to the empirical social research to provide a data base for these purposes.

To meet this need, the Institute for Employment Research (IAB) of the Federal Employment Agency, Research Centre on Migration, Integration, and Asylum of the Federal Office of Migration and Refugees (BAMF-FZ), and the Socio-Economic Panel (SOEP) at DIW Berlin are conducting a comprehensive standardized survey of refugees in Germany based on a qualitative preliminary study. Participation in the survey is voluntary, and the survey is being conducted by specially trained interviewers from the survey institute commissioned to do the fieldwork, KANTAR Public. Conceptualized as a longitudinal study, the survey was carried out in 2016, 2017, and 2018, asking adult refugees who arrived in Germany since 2013 seeking asylum. Data are available on more than 4000 adults and more than 5000 children in the respective survey years.

Table 1: Evaluable data sets on adults from the IAB-BAMF-SOEP Survey of Refugees

Wave 1 (2016)   Wave 1 (2016)  Wave 2 (2017) Wave 2 (2017)
Individuals Households Individuals Households
M3 (BA) 2.213 1.678 1.489 1.068
M4 (GeFam) 2.252 1.611 1.875 1.251
M5 (BA/GeFam) - - 2.252 1.519
Total 4.465 3.289 5.616 3.838

Table 2: Evaluable data sets on children of the IAB-BAMF-SOEP Study of Refugees

Amount of children in subsample

2016 2017
M3 (BA) 1.744 1.316
M4 (GeFam) 3.641 2.779
M5 (BA/GeFam) - 1.847
Total 5.385 5.942

The financing for the continuation in the following years (2019-2021) is applied for.

The anonymised data is made available to the entire scientific community for research purposes in spring of the following year.

Questions about refugees’ living situations and the overall framework conditions

The study looks at the living situations of refugees as well as their schooling, vocational training, and current occupational situations. It also collects information on refugees’ language use, living situations, family situations, social participation, and contact with both Germans and people of their own ethnic backgrounds. Other focal points of the study, which is conducted using standardized survey instruments, are the impacts of the legal and institutional frameworks for refugees and the effectiveness of various support programs.

Connection with the IAB-SOEP Migration Sample, the Socio-Economic Panel, and the IAB’s Integrated Employment Biographies Sample

The study and selection of questions in the survey closely follows that of the IAB-SOEP Migration Sample, which is a cooperation project between IAB and the SOEP group at DIW Berlin. Launched in 2013, the IAB-SOEP Migration Sample was also designed as a longitudinal study. As is the case with the IAB-SOEP Migration Sample, the data from the IAB-BAMF-SOEP Refugee Survey will become part of the data provided by the Socio-Economic Panel (SOEP). Every year since 1984, several thousand people have taken part in the SOEP study. Currently, the SOEP study has around 30,000 respondents in around 15,000 households. These data provide a basis for comparison with other groups of immigrants to Germany as well as with individuals without a background of immigration. Data from the IAB-BAMF-SOEP Refugee Survey will ultimately be linked to data from the IAB’s Integrated Employment Biographies Sample conditional on respondents’ consent. This will make it possible to trace the occupational biographies of refugees in Germany precisely.

Table 3: Approval of IEB register linking

2016 20171
M3 77% (1.693) 79% (212)
M4 80% (1.795) 67% (293)
M5 - 78% (1.766)
Appoval total 78% (3.488) 77% (2.271)
Respondents total
100% (4.465) 100% (2.956)

1Individuals in M3 and M4 were only interviewed a second time, who had not clearly denied this question in the first survey year.

This enables the employment biographies of the refugees in Germany to be traced exactly.

Title: IAB-BAMF-SOEP Survey of Refugees 2017
DOI: 10.5684/soep.iab-bamf-soep-mig.2017
Publication date: 2019-03-06
Principal investigators: Herbert Brücker, Stefan Liebig
Co-PI's: Martin Kroh, Renate Leistner-Rocca, Jan Goebel, Nina Rother, Jürgen Schupp

Affiliated staff members for providing the Scientific Use File: Charlotte Bartels, Alexandra Fedorets, Andreas Franken, Marco Giesselmann, Markus Grabka, Jannes Jacobsen, Selin Kara, Peter Krause, Hannes Kröger, Elisabeth Liebau, Maria Metzing, Jana Nebelin, Lisa Pagel, David Richter, Diana Schacht, Paul Schmelzer, Christian Schmitt, Daniel Schnitzlein, Carsten Schröder, Rainer Siegers, Manuel Siegert, Pavarti Trübswetter, Ehsan Vallizadeh, Knut Wenzig, Stefan Zimermann

Data collector: Kantar Public Deutschland

Collection period: 2016-2017

Citation:

If you publish using this data, it is mandatory to quote the following reference:

Brücker, Herbert, Martin Kroh, Simon Kühne, Elisabeth Liebau, Jürgen Schupp, Manuel Siegert, and Parvati Trübswetter. 2016. Das Studiendesign der IAB-BAMF-SOEP Befragung von Geflüchteten. SOEP Survey Papers 365 (PDF, 355.71 KB): Series C - Data Documentations (Datendokumentationen). Berlin: DIW Berlin.

Data access

Via RDC (Research Data Centre) SOEP:
The data set is also additional samples for the SOEP core study; it is integrated as sample M3, M4 and M5 in SOEP v34 (identical questionnaires, enriched with questions about the respondents' flight and migration situation). All SOEP users with a valid data transfer contract have either received the data with SOEP v34 or can order them either as SOEP v34 or separately as IAB-BAMF-SOEP Survey of Refugees data set by the usual means (SOEPhotline, website). As usual, the data record is made available free of charge via a personalised, secure download link. New users find information on the SOEP application process here .

Via Research Data Centre (FDZ) of the Federal Employment Agency at the Institute for Employment Research:
The application process of the FDZ IAB
.

Comprehensive documentation is available soon.

Please see the documentation of variables for 2016 at paneldata.org.

Preliminary questionnaires of the 2018 survey in German.

 

Additional documents:

SOEP Survey Paper (2019)
Supplementary of the IAB-BAMF-SOEP Survey of Refugees in Germany (M5) 2017Supplementary of the IAB-BAMF-SOEP Survey of Refugees in Germany (M5) 2017 (PDF, 2.27 MB)
Jannes Jacobsen, Martin Kroh, Simon Kühne, Jana A. Scheible, Rainer Siegers, and Manuel Siegert

SOEP Survey Paper (2019)
Dokumentation der Kompetenztestung im Rahmen der IAB-BAMF-SOEP-Befragung von Geflüchteten 2017
Stefan Schipolowski, Aileen Edele

SOEP Survey Papers (2017)
Sampling, Nonresponse, and Integrated Weighting of the 2016 IAB-BAMF-SOEP Survey of Refugees (M3/M4) - Revised Version
Martin Kroh, Simon Kühne, Jannes Jacobsen, Manuel Siegert, Rainer Siegers

SOEP Survey Papers (2017)
Scales Manual IAB-BAMF-SOEP Survey of Refugees in Germany - Revised Version
Jannes Jacobsen, Julius Klikar, Jürgen Schupp

SOEP Survey Papers (2016)
Das Studiendesign der IAB-BAMF-SOEP Befragung von Geflüchteten
Martin Kroh, Herbert Brücker, Simon Kühne, Elisabeth Liebau, Jürgen Schupp, Manuel Siegert, Parvati Trübswetter