TwinLife Documentation TwinLife Documentation TwinLife Documentation
  • Overview and getting started
    • Table of content
    • Getting started
    • Data access & documentation
  • 1. About TwinLife
    • 1.1 Basic Concept
    • 1.2 Study Design and Sample Structure
    • 1.3 Where and how to get the Data
  • 2. Documentation of the study
    • 2.1 Data Documentation Website and ShortGuide
    • 2.2 Documentation within the Data Sets
    • 2.3 paneldata.org
    • 2.4 Codebooks
    • 2.5 Technical Report Series, Methodology Reports, and Working Paper Series
  • 3. Data Structure
    • 3.1 Data Formats and Data Files
    • 3.2 Person Types
    • 3.3 System of Variable Names
    • 3.4 ID Variables, Wave and Data Collection Identifiers
    • 3.5 Missing Types and their Meanings
    • 3.6 Delivered Para Data
    • 3.7 Weights
    • 3.8 Pecularities of Data
    • 3.9 How to match the Data Files
    • 3.10 Matching information from the parent-about-child questionnaire to the child's data set
  • 4. Check Routines
    • 4.1 Check routines
    • 4.2 Data Adjustment
  • 5. Generated Variables and Scales
    • 5.1 Generated Variables
    • 5.2 Generated Scales
  • 6. Publications and Citation
    • 6.1 Publications and Literature Database
    • 6.2 Citation
  • 7. Useful Links
  • Terms and Privacy
  • Downloads
  • Overview and getting started
    • Table of content
    • Getting started
    • Data access & documentation
  • 1. About TwinLife
    • 1.1 Basic Concept
    • 1.2 Study Design and Sample Structure
    • 1.3 Where and how to get the Data
  • 2. Documentation of the study
    • 2.1 Data Documentation Website and ShortGuide
    • 2.2 Documentation within the Data Sets
    • 2.3 paneldata.org
    • 2.4 Codebooks
    • 2.5 Technical Report Series, Methodology Reports, and Working Paper Series
  • 3. Data Structure
    • 3.1 Data Formats and Data Files
    • 3.2 Person Types
    • 3.3 System of Variable Names
    • 3.4 ID Variables, Wave and Data Collection Identifiers
    • 3.5 Missing Types and their Meanings
    • 3.6 Delivered Para Data
    • 3.7 Weights
    • 3.8 Pecularities of Data
    • 3.9 How to match the Data Files
    • 3.10 Matching information from the parent-about-child questionnaire to the child's data set
  • 4. Check Routines
    • 4.1 Check routines
    • 4.2 Data Adjustment
  • 5. Generated Variables and Scales
    • 5.1 Generated Variables
    • 5.2 Generated Scales
  • 6. Publications and Citation
    • 6.1 Publications and Literature Database
    • 6.2 Citation
  • 7. Useful Links

3.4 ID Variables, Wave and Data Collection Identifiers

  • Print
  • Email
  • In TwinLife, various ID variables/identifiers are available. Each person belongs to a family that has a unique family ID (fid) and to a household that has a unique household ID which is wave-specific (hid; a composite of the family ID and an indicator for the household).1  Additionally, each person also has a unique person ID (pid, which is a composite of the family ID and the person type). Although the person types except for the twins can change (i.e. ‘700 - other person’ might change to ‘110 - partner of twin’, or ‘200 - surveyed sibling’ might change to ‘201 - non-surveyed sibling’), the person ID is invariable over time.

    The family ID consists of six digits: the first digit indicates the twin birth cohort (e.g., 1 for the first cohort; note that information about birth cohort is also coded in variable cgr); the other five digits are assigned randomly. ID variables are particularly important when different data files have to be combined. To match data of different survey waves in the family-wide-format, the variable fid needs to be used; to match the master data set with the person-format, the variable pid has to be used. Please note that time variable information in the master data set need to be reshaped into the long format in order to match the data with the person-format of the survey data. Before matching the master data set with the family format, the master data set has to be restructured to family format.

    Furthermore, the variables with variable stem wav describe exactly in which survey wave (wav0200, wav0300) and subsample (wav0100) the data was assessed. The variable wid is the data collection identifier (wid == 1 stands for the first face-to-face household survey (F2F1), wid == 2 for the first telephone survey (CATI1), and so on).

     

    1 The indicator for the household and therefore the household ID itself are linked to the information whether the twins live in this household or not. It is possible that in two consecutive years two households with different household compositions have the same household ID. Therefore, hid should not be used for longitudinal analyses.

    • Prev
    • Next
    • Terms and Privacy
    • Downloads