![]() |
Data transfer format: Transfer of case and subcohort data from MDC
|
|
© National Institute for Health and Welfare
and the MORGAM Project investigators Last updated: 2013-11-20 For more information, please contact Kari Kuulasmaa (firstname.lastname@thl.fi) |
The purpose of this form is to provide a format for the transfer of data from the selection of cases and subcohort from the MDC to MORGAM Participating Centres (MPCs) and Laboratories. The data on this form are derived from Form 65: Case and subcohort selection data and Form 21: Additional Baseline Data, and the specific instructions have been targeted for those analyzing the MORGAM data.
Data in the format specified here is provided for every member of the MORGAM cohort, even though each of them are not selected to the case-cohort set.
Case-cohort selection for MORGAM's Biomarker SubStudy is described in the sections " Case-cohort sampling in MORGAM" and " Enlargement of subsample" of the Manual.
For genetic studies, the format specified here accompanies the data on genotypes (see Form 64: Transfer of SNP genotype data from MDC). The MORGAM case-cohort design for genetic studies is described in sections "Selection of cases and cohort subsample" and "Subsample selection after extension of follow-up" of the Manual.
This format should not be used for transferring data from the MORGAM Participating Centres to the MORGAM Data Centre (MDC), because these data are generated by the MDC.
| ITEM NAME | SPECIFICATION AND CODES | FORMAT OR VALUE | |
|---|---|---|---|
Form identification: |
|||
| FORM | Form identification | I2 | |6|6| |
| VERSN | Form version | I1 | |2| |
KEY1R: |
|||
| CENTRE | MORGAM Participating Centre | C2 | |_|_| |
| RUNIT | MORGAM Reporting Unit | C2 | |_|_| |
| COHORT | Cohort identification within the RUNIT 01 = MONICA baseline survey 02 = MONICA middle survey 03 = MONICA final survey 21, 22, ... other cohorts |
C2 | |_|_| |
| SERIAL | Serial number | C6 | |_|_|_|_|_|_| |
| ROUNDS | Measurement round of the cohort | C2 | |_|_| |
| KEY1R | Identification of individual in MORGAM study CENTRE + RUNIT + COHORT + SERIAL+ROUNDS |
C14 | |_|_|_|_|_|_|_|_|_|_|_|_|_|_| |
General information on case and subcohort selection for the CENTRE/RUNIT/COHORT combination |
|||
| SELECTION | Case and subcohort selection id number | C3 | |_|_|_| |
| PHASE | Phase of selection of cases and subsample
in the cohort 01 = first 02 = second etc. |
C2 | |_|_| |
| DATE | Date when the selection of cases and cohort subsample was done (date ANSI) | C8 | |_|_|_|_||_|_||_|_| |
Case and subcohort selection data: |
|||
| ELIGSC | Eligibility of the person to the subcohort 1 = eligible 2 = ineligible |
I1 | |_| |
| PROB | Selection probability of the person to the
cohort subsample (Sampling weights are derived from Item PROB and the case status, which depends on the definition of the end-point for each analysis.) 8 if ELIGSC = 2 |
R | |
| SUBCOH | Was the person selected to the subcohort? 1 = yes 2 = no 8 if ELIGSC = 2 |
I1 | |_| |
| CASE | Was the person selected as a case to the case-cohort
set? 1 = yes 2 = no, because the the general selection criteria are not met 3 = no, because of a separate decision, although the selection criteria were met |
I1 | |_| |
Data on consent for the use of DNA (not relevant for biomarker studies): |
|||
| CONSCHD | Consent for the use of DNA to study CHD 1 = yes, written 2 = Ethics Committee approval for anonymized analysis 3 = no consent or approval for using DNA 8 = irrelevant, DNA not available |
I1 | |_| |
| CONSSTR | Consent for the use of DNA to study stroke 1 = yes, written 2 = Ethics Committee approval for anonymized analysis 3 = no consent or approval for using DNA 8 = irrelevant, DNA not available |
I1 | |_| |
| CONSTED | Consent for the use of DNA to study thromboembolic
disease 1 = yes, written 2 = Ethics Committee approval for anonymized analysis 3 = no consent or approval for using DNA 8 = irrelevant, DNA not available |
I1 | |_| |
| CONSANY | Consent for the use of DNA to study any cause of death 1 = yes, written 2 = Ethics Committee approval for anonymized analysis 3 = no consent or approval for using DNA 8 = irrelevant, DNA not available |
I1 | |_| |
| FORMAT | Type | Format | Example | Comments |
|---|---|---|---|---|
| C | Character | C7 | SWE-NSWa | RUA abbreviation used in MORGAM. |
| C2 | 03 | Cohort identification | ||
| F | Float | F5.2 | 13.1 | Variable includes a decimal point (.). |
| R | Real number | R | 0.24927345 | Selection probability. |
| I | Integer | I5 | 221 10323 |
The definitions of each variable and instructions for their use are given below.
| FORM | Form identification | I2 | |6|6| |
Number 66 indicates the "Data transfer format: Transfer of case and subcohort data from MDC".
| VERSN | Version of this form | I1 | |2| |
This indicates the version number of this data transfer format entitled "Data transfer format: Transfer of case and subcohort data from MDC".
| CENTRE | MORGAM Participating Centre | C2 | |_|_| |
| RUNIT | MORGAM Reporting Unit | C2 | |_|_| |
| COHORT | Cohort identification within the RUNIT 01 = MONICA baseline survey 02 = MONICA middle survey 03 = MONICA final survey 21, 22, ... other cohorts |
C2 | |_|_| |
| SERIAL | Serial number | C6 | |_|_|_|_|_|_| |
| ROUNDS | Measurement round of the cohort 01 = baseline measurement 02 = second measurement etc. |
C2 | |_|_| |
These are key items used for identifying the record and merging it with other records of the same individual.
CENTRE is the official MORGAM Participating Centre code number, RUNIT the official MORGAM Reporting Unit code number and COHORT the official MORGAM Cohort code number as they appear in section "MORGAM Participating Centres and cohorts" of the MORGAM Manual.
SERIAL is the identification for the individual. It is unique within the combination of CENTRE, RUNIT and COHORT. ROUNDS identifies the measurement round of the cohort (repeat measurements).
| KEY1R | Identification of individual in MORGAM study CENTRE + RUNIT + COHORT + SERIAL+ROUNDS |
C14 | |_|_|_|_|_|_|_|_|_|_|_|_|_|_| |
KEY1R is constructed from items CENTRE, RUNIT, COHORT, SERIAL and ROUNDS by placing them one after the other. The purpose of this item is to facilitate the merging of different records of the same individual.
| SELECTION | Case and subcohort selection id number | C3 | |_|_|_| |
This is a unique sequence number which identifies the selection of cases and subcohorts from which these data were derived. For each selection, a report with the same SELECTION number is available in the internal MORGAM web site.
| PHASE | Phase of selection of cases and subsample
in the cohort 01 = first 02 = second etc. |
C2 | |_|_| |
The selection of cases and subcohort may have been conducted in several phases, where earlier selections have been supplemented later, for example due to extension of the follow-up period of a MORGAM cohort.
This item indicates the latest phase of the case-cohort selection for the CENTRE/RUNIT/COHORT/ROUNDS combination. The selection data in the data transfer set corresponds to combination of the latest and all earlier case and sub-cohort selections. General information on the selections is available in the internal MORGAM web site.
| DATE | Date when the selection of cases and cohort subsample was done (date ANSI) | C8 | |_|_|_|_||_|_||_|_| |
The first four numbers indicate the year, the next two the month and the last two the day of month of the latest phase of the selection.
| ELIGSC | Eligibility of the person to the subcohort 1 = eligible 2 = ineligible |
I1 | |_| |
This is the same as item ELIGSC of Form 65: Case and subcohort selection data. For more information, see
| PROB | Selection probability of the person to the
cohort subsample 8 if ELIGSC = 2 |
R |
This is the selection probability (for genetic study see Sampling within the strata) of the individual to the subcohort, i.e. the marginal probability on which the person is in the cohort subsample. The sampling weights for data analysis in the case-cohort design are derived from item PROB and the case status, which depends on the definition of the end-point for each analysis.
| SUBCOH | Was the person selected to the subcohort? 1=yes 2=no 8 if ELIGSC = 2 |
I1 | |_| |
This item indicates whether the individual belongs to the subcohort or not (for genetic study see Subcohort sampling).
| CASE | Was the person selected as a case to the case-cohort
set? 1 = yes 2 = no, because the the general selection criteria are not met 3 = no, because of a separate decision, although the selection criteria were met |
I1 | |_| |
This item indicates whether the individual was selected to the case-cohort set as a case or not (for genetic study see Identification of cases).
Note that here the term case covers the cases of all potential study endpoints, but only one or some of them are usually used for any single data analysis. It is possible that DNA are not available for a person selected as a case (see item ELIGSC).
Item CASE is derived from items CASEDTH, CASECHD, CASESTR, CASETED, CASEAP, CASEHF, CASEAF, CASEDIAB, CASEBCHD and CASEBSTR of Form 65: Case and subcohort selection data:
These items are not relevant for biomarker studies.
| CONSCHD | Consent for the use of DNA to study CHD 1 = yes, written 2 = Ethics Committee approval for anonymized analysis 3 = no consent or approval for using DNA 8 = irrelevant, DNA not available |
I1 | |_| |
| CONSSTR | Consent for the use of DNA to study stroke 1 = yes, written 2 = Ethics Committee approval for anonymized analysis 3 = no consent or approval for using DNA 8 = irrelevant, DNA not available |
I1 | |_| |
| CONSTED | Consent for the use of DNA to study thromboembolic
disease 1 = yes, written 2 = Ethics Committee approval for anonymized analysis 3 = no consent or approval for using DNA 8 = irrelevant, DNA not available |
I1 | |_| |
| CONSANY | Consent for the use of DNA to study any cause of death 1 = yes, written 2 = Ethics Committee approval for anonymized analysis 3 = no consent or approval for using DNA 8 = irrelevant, DNA not available |
I1 | |_| |
These are the same as items CONSCHD, CONSSTR, CONSTED and CONSANY of Form 21: Additional baseline data.
The case-cohort set may include individuals for whom there is no consent to use the DNA for the study of some of the MORGAM end-points. Therefore, a person's data cannot be used to study:
| Date | Update |
|---|---|
| 2007-11-27 | First published version |
| 2013-11-20 | Version 2. Biomarker SubStudies, item ROUNDS |