MORGAM logo

Data transfer format: Transfer of case and subcohort data from MDC

  • Form: 66
  • Version: 1
  • Date: 2005-03-10

Valid HTML 4.01!
© National Institute for Health and Welfare and the MORGAM Project investigators
Last updated: 2007-11-27
For more information, please contact Olli Saarela (firstname.lastname@thl.fi) or Kari Kuulasmaa (firstname.lastname@thl.fi)

 The purpose of this form is to provide a format for the transfer of data from the selection of cases and subcohort from the MDC to MORGAM Participating Centres (MPC) and Laboratories. Data in the format specified here accompanies the data on genotypes (see Form 64: Transfer of SNP genotype data from MDC). The data on this form are derived from Form 65: Case and subcohort selection data and Form 21: Additional Baseline Data, and the specific instructions have been targeted for those analyzing the MORGAM data.

The MORGAM case-cohort design is described in sections "Selection of cases and cohort subsample" and "Subsample selection after extension of follow-up" of the Manual.

Contents


Format specification

ITEM NAME SPECIFICATION AND CODES FORMAT OR VALUE

Form identification:

FORM Form identification I2 |6|6|
VERSN Form version I1 |1|

Key 1:

CENTRE MORGAM Participating Centre C2 |_|_|
RUNIT MORGAM Reporting Unit C2 |_|_|
COHORT Cohort identification within the RUNIT
01 = MONICA baseline survey
02 = MONICA middle survey
03 = MONICA final survey
21, 22, ... other cohorts
C2 |_|_|
SERIAL Serial number C6 |_|_|_|_|_|_|
KEY1 Identification of individual in MORGAM study
CENTRE + RUNIT + COHORT + SERIAL
C12 |_|_|_|_|_|_|_|_|_|_|_|_|

General information on case and subcohort selection for the CENTRE/RUNIT/COHORT combination

SELECTION Case and subcohort selection id number C3 |_|_|_|
PHASE Phase of selection of cases and subsample in the cohort
01 = first
02 = second
etc.
C2 |_|_|
DATE Date when the selection of cases and cohort subsample was done (date ANSI) C8 |_|_|_|_||_|_||_|_|

Case and subcohort selection data:

ELIGSC Eligibility of the person to the subcohort
1 = eligible
2 = ineligible
I1 |_|
PROB Selection probability of the person to the cohort subsample
(Sampling weights are derived from Item PROB and the case status, which depends on the definition of the end-point for each analysis.)
8 if ELIGSC = 2
R  
SUBCOH Was the person selected to the subcohort?
1 = yes
2 = no
8 if ELIGSC = 2
I1 |_|
CASE Was the person selected as a case to the case-cohort set?
1 = yes
2 = no, because the the general selection criteria are not met
3 = no, because of a separate decision, although the selection criteria were met
I1 |_|

Data on consent for the use of DNA:

CONSCHD Consent for the use of DNA to study CHD
1 = yes, written
2 = Ethics Committee approval for anonymized analysis
3 = no consent or approval for using DNA
8 = irrelevant, DNA not available
I1 |_|
CONSSTR Consent for the use of DNA to study stroke
1 = yes, written
2 = Ethics Committee approval for anonymized analysis
3 = no consent or approval for using DNA
8 = irrelevant, DNA not available
I1 |_|
CONSTED Consent for the use of DNA to study thromboembolic disease
1 = yes, written
2 = Ethics Committee approval for anonymized analysis
3 = no consent or approval for using DNA
8 = irrelevant, DNA not available
I1 |_|
CONSANY Consent for the use of DNA to study any cause of death
1 = yes, written
2 = Ethics Committee approval for anonymized analysis
3 = no consent or approval for using DNA
8 = irrelevant, DNA not available
I1 |_|

Columns of the format specification

ITEM NAME
name used for the item in the MDC.
SPECIFICATION AND CODES
specification and values of the variable. More details can be found in the section "Definitions of the variables" below, or by following the hyperlink in column ITEM NAME.
FORMAT OR VALUE
specifies the format in which the value should be presented in the transfer data set. In the cases where the value of the variable is fixed, the value is also given in this column.
FORMAT Type Format Example Comments
C Character C7 SWE-NSWa RUA abbreviation used in MORGAM.
C2 03 Cohort identification
F Float F5.2 13.1 Variable includes a decimal point (.).
R Real number R 0.24927345 Selection probability.
I Integer I5 221
10323
 

Specific comments on each item

The definitions of each variable and instructions for their use are given below.

FORM Form identification I2 |6|6|

Number 66 indicates the "Data transfer format: Transfer of case and subcohort data from MDC".

VERSN Version of this form I1 |1|

This indicates the version number of this data transfer format entitled "Data transfer format: Transfer of case and subcohort data from MDC".

CENTRE MORGAM Participating Centre C2 |_|_|
RUNIT MORGAM Reporting Unit C2 |_|_|
COHORT Cohort identification within the RUNIT
01 = MONICA baseline survey
02 = MONICA middle survey
03 = MONICA final survey
21, 22, ... other cohorts
C2 |_|_|
SERIAL Serial number I6 |_|_|_|_|_|_|

These are key items used for identifying the record and merging it with other records of the same individual.

CENTRE is the official MORGAM Participating Centre code number, RUNIT the official MORGAM Reporting Unit code number and COHORT the official MORGAM Cohort code number as they appear in section "MORGAM Participating Centres and cohorts" of the MORGAM Manual.

SERIAL is the identification for the individual. It is unique within the combination of CENTRE, RUNIT and COHORT.

KEY1 Identification of individual in MORGAM study
CENTRE + RUNIT + COHORT + SERIAL
C12 |_|_|_|_|_|_|_|_|_|_|_|_|

KEY1 is constructed from items CENTRE, RUNIT, COHORT and SERIAL by placing them one after the other. The purpose of this item is to facilitate the merging of different records of the same individual.

SELECTION Case and subcohort selection id number C3 |_|_|_|

This is a unique sequence number which identifies the selection of cases and subcohorts from which these data were derived. For each selection, a report with the same SELECTION number is available in the internal MORGAM web site.

PHASE Phase of selection of cases and subsample in the cohort
01 = first
02 = second
etc.
C2 |_|_|

The selection of cases and subcohort may have been conducted in several phases, where earlier selections have been supplemented later, for example due to extension of the follow-up period of a MORGAM cohort.

This item indicates the latest phase of the case-cohort selection for the CENTRE/RUNIT/COHORT combination. The selection data in the data transfer set corresponds to combination of the latest and all earlier case and sub-cohort selections. General information on the selections is available in the internal MORGAM web site.

DATE Date when the selection of cases and cohort subsample was done (date ANSI) C8 |_|_|_|_||_|_||_|_|

The first four numbers indicate the year, the next two the month and the last two the day of month of the latest phase of the selection.

ELIGSC Eligibility of the person to the subcohort
1 = eligible
2 = ineligible
I1 |_|

This is the same as item ELIGSC of Form 65: Case and subcohort selection data. Briefly, the eligibility of the person to the subcohort requires that

It is possible that DNA are not available for a person eligible to the subcohort. This is because the availability of DNA was based on item Form 21, but after selection to the case-cohort set, the sample was not found or no DNA could be extracted from the sample. This is a rare situation in most cohorts, but common in those (e.g. PRIME) where the availability of DNA could be checked only after the selection of the case-cohort set.

PROB Selection probability of the person to the cohort subsample
8 if ELIGSC = 2
R  

This is the selection probability of the individual to the subcohort, i.e. the marginal probability on which the person is in the cohort subsample. The sampling weights for data analysis in the case-cohort design are derived from item PROB and the case status, which depends on the definition of the end-point for each analysis.

SUBCOH Was the person selected to the subcohort?
1=yes
2=no
8 if ELIGSC = 2
I1 |_|

This item indicates whether the individual belongs to the subcohort or not.

CASE Was the person selected as a case to the case-cohort set?
1 = yes
2 = no, because the the general selection criteria are not met
3 = no, because of a separate decision, although the selection criteria were met
I1 |_|

This item indicates whether the individual was selected to the case-cohort set as a case or not. Note that here the term case covers the cases of all potential study endpoints, but only one or some of them are usually used for any single data analysis. Genotypic data are not necessarily available for all cases (see instructions for ELIGSC ... ELIGOD above). It is possible that DNA are not available for a person selected as a case. This is because at the time of selection of the case-cohort set, the availability of DNA was based on item DNAAV of Form 21, but after the selection, the sample was not found or no DNA could be extracted from the sample. This is a rare situation in most cohorts, but common in those (e.g. PRIME) where the availability of DNA could be checked only after the selection of the case-cohort set.

Item CASE is derived from items CASEDTH, CASECHD, CASESTR, CASETED, CASEAP, CASEBCHD and CASEBSTR of Form 65: Case and subcohort selection data:

Items CONSCHD...CONSANY

CONSCHD Consent for the use of DNA to study CHD
1 = yes, written
2 = Ethics Committee approval for anonymized analysis
3 = no consent or approval for using DNA
8 = irrelevant, DNA not available
I1 |_|
CONSSTR Consent for the use of DNA to study stroke
1 = yes, written
2 = Ethics Committee approval for anonymized analysis
3 = no consent or approval for using DNA
8 = irrelevant, DNA not available
I1 |_|
CONSTED Consent for the use of DNA to study thromboembolic disease
1 = yes, written
2 = Ethics Committee approval for anonymized analysis
3 = no consent or approval for using DNA
8 = irrelevant, DNA not available
I1 |_|
CONSANY Consent for the use of DNA to study any cause of death
1 = yes, written
2 = Ethics Committee approval for anonymized analysis
3 = no consent or approval for using DNA
8 = irrelevant, DNA not available
I1 |_|

These are the same as items CONSCHD, CONSSTR, CONSTED and CONSANY of Form 21: Additional baseline data.

The case-cohort set may include individuals for whom there is no consent to use the DNA for the study of some of the MORGAM end-points. Therefore, a person's data cannot be used to study: