3.1 Data Formats and Data Files - TwinLife Documentation

The TwinLife data is available free of charge and can be accessed via GESIS after filling in the ➔ Data Use Agreement. The data delivery consists of a certain number of data files in SPSS and Stata format:

Master data (ZA6701_master_v$): Includes information on the gross sample, such as consistency checked variables that are stable over time (e.g., sex, year of birth, relation to the twins, zygosity, migration background) and wave-specific variables (e.g., person type, response status, family composition) about all individuals included in TwinLife in each wave. The master file includes all persons who have ever been recorded in TwinLife. This includes the target persons of the twin families, but also household members not relevant for the survey.
Survey data in person format (ZA6701_person_wid$_v$): There is one data set for each data collection. Each surveyed person has one data row. The data collection identifier is the variable wid. The survey file of a data collection includes all persons that belong to a household of which at least one person participated in that data collection. Families that (temporarily or permanently) refused to participate as a whole, are not included.
Data of covid supplemental surveys (ZA6701_person_cov$_v$): There is one data set for each covid supplemental survey. Each surveyed person has one data row. The data collection identifier is the variable cov.
Survey data in family format (ZA6701_family_wide_wid$_v$): There is one data set for each data collection. Each family has one data row with information of each participating person in the family being stored in separate variables/columns). See chapter 3.3 for more detailed information. Person format and family format data sets contain the same data using different structures.
Survey weights (ZA6701_weights_v$): A data file containing the survey weights (design, nonresponse and panel weights).
Children of twins (ZA6701_cot_v$): A data file containing information about the children of the twins, collected from the twins during the F2F4 data collection (wid 7)
Twin zygosity assessment (ZA6701_zygosity_v$): A data file with information of the twin zygosity assessment in F2F 1.
Survey mode (ZA6701_mode_wid1_v$): A data file with information on the survey mode for each variable in F2F1.
Unadjusted data of all variables collected in the PAPI survey mode (ZA6701_person_unadj_wid$_v$): One data file for each data collection with data unadjusted for filter errors for all constructs/variables that were at least partly surveyed in the PAPI mode. See chapter 4.2 for further details.

All data is provided with English and German variable descriptions. In Stata, these languages are included in one data set while in SPSS, these are separate data files. The data are checked for inconsistencies and adjusted for filter errors (for details see chapter 4).