TwinLife Documentation TwinLife Documentation TwinLife Documentation
  • Overview and getting started
    • Overview (Table of content)
    • Getting started
    • Data access & documentation
  • 1. About TwinLife
    • 1.1 Basic Concept
    • 1.2 Study Design and Sample Structure
    • 1.3 Where and how to get the Data
  • 2. Documentation of the study
    • 2.1 Data Documentation Website and ShortGuide
    • 2.2 Documentation within the Data Sets
    • 2.3 paneldata.org
    • 2.4 Codebooks
    • 2.5 Technical Report Series, Methodology Reports, and Working Paper Series
  • 3. Data Structure
    • 3.1 Data Formats and Data Files
    • 3.2 Person Types
    • 3.3 System of Variable Names
    • 3.4 ID Variables, Wave and Data Collection Identifiers
    • 3.5 Missing Types and their Meanings
    • 3.6 Delivered Para Data
    • 3.7 Weights
    • 3.8 Pecularities of Data
    • 3.9 How to match the Data Files
    • 3.10 Matching information from the parent-about-child questionnaire to the child's data set
  • 4. Check Routines
    • 4.1 Check routines
    • 4.2 Data Adjustment
  • 5. Generated Variables and Scales
    • 5.1 Generated Variables
    • 5.2 Generated Scales
  • 6. Publications and Citation
    • 6.1 Publications and Literature Database
    • 6.2 Citation
  • 7. Useful Links
  • Terms and Privacy
  • Downloads
  • Overview and getting started
    • Overview (Table of content)
    • Getting started
    • Data access & documentation
  • 1. About TwinLife
    • 1.1 Basic Concept
    • 1.2 Study Design and Sample Structure
    • 1.3 Where and how to get the Data
  • 2. Documentation of the study
    • 2.1 Data Documentation Website and ShortGuide
    • 2.2 Documentation within the Data Sets
    • 2.3 paneldata.org
    • 2.4 Codebooks
    • 2.5 Technical Report Series, Methodology Reports, and Working Paper Series
  • 3. Data Structure
    • 3.1 Data Formats and Data Files
    • 3.2 Person Types
    • 3.3 System of Variable Names
    • 3.4 ID Variables, Wave and Data Collection Identifiers
    • 3.5 Missing Types and their Meanings
    • 3.6 Delivered Para Data
    • 3.7 Weights
    • 3.8 Pecularities of Data
    • 3.9 How to match the Data Files
    • 3.10 Matching information from the parent-about-child questionnaire to the child's data set
  • 4. Check Routines
    • 4.1 Check routines
    • 4.2 Data Adjustment
  • 5. Generated Variables and Scales
    • 5.1 Generated Variables
    • 5.2 Generated Scales
  • 6. Publications and Citation
    • 6.1 Publications and Literature Database
    • 6.2 Citation
  • 7. Useful Links

3.1 Data Formats and Data Files

  • Print
  • Email
  • The TwinLife data is available free of charge and can be accessed via GESIS after filling in the ➔ Data Use Agreement. The data delivery consists of a certain number of data files in SPSS and Stata format:

    • Master data (ZA6701_master_v$): Includes information on the gross sample, such as consistency checked variables that are stable over time (sex, year of birth, relation to the twins, zygosity, migration background) and wave-specific variables (person type, response status, family composition) about all individuals included in TwinLife in each wave.

    • Survey data in person format (ZA6701_person_wid$_v$): There is one data set for each data collection (F2F 1, CATI 1, F2F 2). Each surveyed person has one data row. The data collection identifier is the variable wid.

    • Survey data in family format (ZA6701_family_wide_wid$_v$.dta): There is one data set for each data collection (F2F 1, CATI 1, F2F 2). Each family has one data row with information of each participating person in the family being stored in separate variables/columns). See chapter 3.3 for more detailed information. Person format and family format data sets contain the same data using different structures.

    • Twin zygosity assessment (ZA6701_zygosity_v$): A data file with the information of the twin zygosity assessment in F2F 1.

    • Unadjusted data of all variables collected in the PAPI survey mode (ZA6701_person_unadj_wid1_v4-0-0.dta): One data file for each data collection with data unadjusted for filter errors for all constructs/variables that were at least partly surveyed in the PAPI mode (as of data release v4-1-0 in autumn 2020). See chapter 4.2 for further details.

    All data is provided with English and German variable descriptions. In Stata, these languages are included in one data set while in SPSS, these are separate data files. The data are checked for inconsistencies and adjusted for filter errors (for details see chapter 4).

    • Prev
    • Next
    • Terms and Privacy
    • Downloads