Introducing Open Data Format: A Platform-Independent, Non-Proprietary, Metadata-Enriched, Multilingual Data Format and its Implementation in R and Stata

Diskussionspapiere extern

Xiaoyao Han, Tom Hartl, Knut Wenzig

Berlin: KonsortSWD, 2024, 24 S.
(Working Paper / KonsortSWD ; 10)

Abstract

This paper introduces the Open Data Format (ODF), a new, non-proprietary, multilingual, metadata enriched, and zip-compressed data format that meets the FAIR Guiding Principles for scientific data management and stewardship. The data format is specified as a CSV file with the raw data and an XML file containing the metadata both compressed into a zip file with the .zip extension. Data files can be enriched with multilingual metadata following the forthcoming DDI Codebook 2.6 standard. The paper also introduces software packages for R (opendataformat) and Stata (opendf) that provide import and export filters and enable data users to work with ODF data files in the respective environment.

Xiaoyao Han

Wissenschaftliche Mitarbeiterin in der Infrastruktureinrichtung Sozio-oekonomisches Panel

Knut Wenzig

Mitarbeiter der Infrastruktureinrichtung in der Infrastruktureinrichtung Sozio-oekonomisches Panel



Keywords: ODF, Open Data Format, Metadata, DDI Codebook, Multilingual, opendataformat, opendf, R-Package, Stata Package
DOI:
https://doi.org/10.5281/zenodo.14215268

keyboard_arrow_up