Skip to content!

Regional Data

SOEP offers diverse possibilities for regional and spatial analysis. With the anonymized regional information on the residences of SOEP respondents (households and individuals), it is possible to link numerous regional indicators on the levels of the states (Bundesländer), spatial planning regions, districts, and postal codes with the SOEP data on these households. However, specific security provisions must be observed due to the sensitivity of the data under data protection law (see overview). Accordingly, you are not allowed to make statements on, e.g., place of residence or administrative district in your analyses, but the data does provide valuable background information.

The variable $BULA (= Bundesland) is contained in the standard data set. If you need more detailed regional information for your research work, e.g., the municipal size classes, you need an expanded data distribution contract, which consists mainly of a data protection concept to be developed by you.

To use the spatial planning regions (geocodes) you need both an expanded data distribution contract and an expanded data protection concept. After signing your contract, you will receive this data.

On research stays at DIW Berlin or using our SOEPremote system of remote computer access, you can conduct analyses on the level of small-scale official county codes (KKZ), which are considered highly sensitive data under data protection regulations.

The precondition for using SOEPremote is, again, an expanded data distribution contract and an application for the use of SOEPremote, which also constitutes an addendum to your contract. After activation of access, you can transfer your analysis syntax (currently only in STATA format) through the remote access system by e-mail to our server. This processes the task automatically-after verifying data protection requirements-and sends you the results by email in a log file.

Postal code data can only be used on site at DIW Berlin in order to prevent misuse.

Regional Analyses with SOEP Overview:

Data Editions of SOEP-Core

The highly populous states, e.g., Baden-Wuerrttemberg, Bavaria, and North Rhine-Westphalia, can be used for analysis given the large sample size. In general, the danger exists that for more detailed structural analyses, the case numbers on the specific states are too low to allow for statistically significant conclusions. The data can be evaluated, however, for "pools" of individual smaller federal states (e.g., state types).

In your data protection concept for use of the municipal size classes, the following points should be taken into account:

  • Who will work with the data?
  • Procedures for changing passwords on a regular basis
  • Description of location/ type of computer where the data are being stored
  • Procedure to prevent use of the data on other computers (also not on home PC)

This must be signed by the data protection officer of your institution.

For the use of spatial planning regions (geocodes) you must also take the following points into account:

  • isolated computer (not linked to a network)
  • at least two-stage access control for files
  • authorized user must be able to determine whether unauthorized access by others has occurred or been attempted (through a protocol)
  • procedure to prevent use of the data on other computers (also not on a home PC)
  • limitation of access to central IT facilities
  • regular checks by data protection officer

This data protection concept must be signed by the data protection officer of your institution. An example of such a data protection concept, which of course would have to be adapted to your institution, can be found below the download module.

You are welcome to send us a draft of your data protection concept by e-mail.

You can find a good overview of the possibilities for regional analyses with SOEP data in the following text, which was published in a DIW Berlin series:


Gundi Knies und C. Katharina Spieß:
Regional Data in the German Socio-Economic Panel Study (SOEP)

Peter Hintze, Tobia Lakes:
Data Documentation 46: Geographically Referenced Data in Social Science: A Service Paper for SOEP Data Users

Jan Goebel:
Job submission instructions for the SOEPremote System at DIW Berlin

Contact person

keyboard_arrow_up