![]() |
Data transfer format - SNP genotypic data transfer to MDC - for special situations
|
|
© National Institute for Health and Welfare
and the MORGAM Project investigators Last updated: 2012-10-17 For more information, please contact Ari Haukijärvi (firstname.lastname@thl.fi) or Kari Kuulasmaa (firstname.lastname@thl.fi) |
This data transfer format is an alternative to Form 46, and should be used only in special situations, after consultation with the MDC. In this data transfer format, Key2 is replaced with Key0 and Key1. The primary format for the transfer of genotypic data to the MDC is Form 46.
(NOTE: Version 1 of Form 47, named "SNP data transfer inventory", had a
different purpose, which has become redundant.)
| VARIABLE NAME | SPECIFICATION | FORMAT AND VALUE | |
|---|---|---|---|
| FORM | Form number | I2 | |4|7| |
| VERSION | Form version | I1 | |2| |
| GLAB | Genotyping Laboratory code, selected from the list of genotyping laboratories | I3 | |_|_|_| |
| SHIPMENT | Shipment number given by the Genotyping Laboratory | I3 | |_|_|_| |
| SHIPDATE | Date of this shipment YYYYMMDD (year month day) |
C8 | |_|_|_|_||_|_||_|_| |
| KEY1 | Identification of individual in MORGAM study: CENTRE + RUNIT + COHORT + SERIAL |
C12 | |_|_||_|_||_|_||_|_|_|_|_|_| |
| KEY0 | Local key identifying DNA sample in the Genotyping Laboratory | C20 | free text |
| DNA_DUPL | Type of the DNA sample 0 = primary 1 = duplicate |
I1 | |_| |
| MARKER | SNP / polymorphism name (abbreviation) as used locally | C30 | free text |
| METHOD | Method of genotyping | I1 | |_| |
| GENOTYPE | Genotype | C256 | free text |
| STATUS | Genotyping status | I1 | |_| |
| GENODATE | Genotyping analysis run date or reading date YYYYMMDD (year month day) |
C8 | |_|_|_|_||_|_||_|_| |
| COMMENTS | Comments for this genotype data row (if any) | C256 | free text |
| FORMAT | Type | Code | Example | Comments |
|---|---|---|---|---|
| C | Character | C8 | 20090205 | Used for dates, leading zeroes are obligatory in days and months. |
| I | Integer | I3 | 200 5 |
Leading zeroes should not be used. |
Instructions for making corrections to data that have already been sent to the MDC are given in section
Data communication between the Participating Centres and the MDC.
Please contact the MDC for instructions if you can not provide information as specified in this document or
if you have any problems with the interpretation of the coding for any specific items.
The detailed definition for each variable and instructions for their use are given below. Follow these instructions when creating a file for genotype data transfer from the Genotyping Laboratory to the MDC.
| FORM | Form number | I2 | |4|7|| |
| The form number 47 indicates the "Data transfer format : SNP genotype data transfer to the MDC". In this form both KEY1 and KEY0 will be used in the MDC to identify each individual and genotyping sample instead of using normal KEY2 identification (see Form46 definition). | |||
| To top of the page | |||
| VERSION | Version of this form | I1 | |2| |
| Current version of this data transfer format. If the version number of the data transfer format which you are using is not "2", these instructions do not correspond to the format you are using. Version number changes if some major changes are introduced in the form, for example adding and/or deleting data items that need to be included. | |||
| To top of the page | |||
| GLAB | Genotyping Laboratory code | I3 | |_|_|_| |
Enter the appropriate code of your MORGAM Genotyping Laboratory from the list of MORGAM Genotyping Laboratories:
|
|||
| To top of the page | |||
| SHIPMENT | Shipment number given by the Genotyping Laboratory (GLAB) | I3 | |_|_|_| |
| This is the shipment number from the Genotyping Laboratory to the MDC. Usually this is a sequential number of shipments from GLAB to the MDC. If you are not using this kind of general sequential number, please enter the number of shipment to the MDC on that particular day. | |||
| To top of the page | |||
| SHIPDATE | Date of this shipment YYYYMMDD (year month day) |
C8 | |_|_|_|_||_|_||_|_| |
| The date of this shipment (the date when you completed this data form). The date format is YYYYMMDD (date ANSI, 8 characters, year month day). | |||
| To top of the page | |||
| KEY1 | Identification of individual in MORGAM study (CENTRE + RUNIT + COHORT + SERIAL) | C12 | |_|_|_|_||_|_||_|_| |
| This is the primary key identifying individual in the MORGAM study. The first 2 characters correspond to the CENTRE, the next 2 characters to the RUNIT, the next 2 to the COHORT and the last 6 characters to the SERIAL on MORGAM Form 20 ("Data transfer format: MONICA survey data"). Please make sure that the codes are identical with those used for the same persons there. It is important that there are no misprints in this entry to ensure connecting the result with the correct person. | |||
| To top of the page | |||
| KEY0 | Local key identifying DNA sample in the Genotyping Laboratory (GLAB) | C20 | free text |
| This is a local key identifying the DNA sample in the Genotyping Laboratory. It is important that there are no misprints in this entry. The KEY2 sample identification code which is used in the MORGAM study will be generated in the MDC, based on the KEY1 and KEY0 information delivered here. | |||
| To top of the page | |||
| DNA_DUPL |
Type of the DNA sample:
|
I1 | |_| |
| Enter here the type of the sample whether it is a primary sample or a duplicate sample for quality control. The MDC will create the 7-character MORGAM KEY2 for the samples for all KEY0s. | |||
| To top of the page | |||
| MARKER | SNP / polymorphism name (abbreviation) as used locally | C30 | free text |
| Enter here the SNP / polymorphism name, as it is recorded in your local database. | |||
| To top of the page | |||
| METHOD | Method of genotyping | I1 | |_| |
Enter appropriate code for the genotyping method from the following list:
|
|||
| To top of the page | |||
| GENOTYPE | Genotype | C256 | free text |
|
Enter here alleles in capital letters (A, C, T, G) in alphabetical order, from both chromosomes, separated by slash ( / ). Deletions should be marked as D, unknown genotype as N, for example:
|
|||
| To top of the page | |||
| STATUS | Genotyping status | I1 | |_| |
Enter the code for either successful genotyping or the reason for an unknown genotype.
Use codes 1 ... 4 to specify the reason for unknown genotype if code N was used for GENOTYPE. Use code 5 if both alleles were coded using A, C, T, G or D.
|
|||
| To top of the page | |||
| GENODATE | Genotyping analysis run date or reading date YYYYMMDD (year month day) |
C8 | |_|_|_|_||_|_||_|_| |
| Enter here the analysis run date or reading date of this genotype. The date format is YYYYMMDD (date ANSI, 8 characters, year month day). | |||
| To top of the page | |||
| COMMENTS | Comments for this genotype data row (if any) | C256 | free text |
| Enter here comments for this data row (if any). Maximum 256 characters is allowed for this variable. | |||
| To top of the page | |||
The data should be prepared in ASCII format using semicolon (;) as a delimiter, with the names of data items (i.e. variables) in the first row. If data contains text with delimiter (semicolon) used inside the text, the value should be surrounded by double quotes ("). A dot (.) shold be used as a decimal point.
To simplify the tracing of data transfers the files should be named as: F47_GLAB_YYYYMMDD_N.CSV, where
|
FORM;VERSION;GLAB;SHIPMENT;SHIPDATE;KEY1;KEY0;DNA_DUPL;MARKER;METHOD;GENOTYPE;STATUS;GENODATE;COMMENTS 47;2;916;1;20090525;260301700002;1111222;0;RS2689216;2;A/G;5;20090420; 47;2;916;1;20090525;260301700008;2222333;1;RS4756356;2;C/C;5;20090420; |
The data files should be archived in ZIP format before sending to the MDC. The archive name should be the same as the file name above. Please send the archived data file to the MDC (to Ari Haukijarvi, firstname.lastname@thl.fi) as an attachment to an e-mail message. Please make sure to save a copy of the transferred files in the sending MPC or laboratory.
After the data are received in the MDC, an acknowledgement will be sent to the sender. The data will be checked for consistency and quality, and after any problems have been resolved, they will be entered into the MORGAM database.
|
To top of the page |