![]() |
Data transfer format - DNA sex determination
|
|
© National Institute for Health and Welfare
and the MORGAM Project investigators Last updated: 2010-10-15 For more information, please contact Päivi Laiho (firstname.lastname@thl.fi) or Kari Kuulasmaa (firstname.lastname@thl.fi) |
The purpose of this transfer format is to provide an exact and common format for the MORGAM Central Laboratory (MCL) at National Institute for Health and Welfare (THL) to transfer data on the sex determination from DNA to the MORGAM Data Centre (MDC). Data in the format specified here should be submitted to the MORGAM Data Centre (MDC) for every subject on whom DNA was requested by the MDC (see Form 40). The data should be placed on KTL network drive.
| NAME | SPECIFICATION AND CODES | FORMAT or Value | |
|---|---|---|---|
| FORM | Form number | I2 | |4|4| |
| VERSION | Version of this form | I1 | 2 |
| KEY2 | Sample identificaton | I7 | |_|_|_|_|_|_|_| |
| SEX_DATE | Date when sex was determined from DNA (date ANSI) YYYYMMDD (year month day) |
C8 | |_|_|_|_||_|_||_|_| |
| SEX_DNA | Sex determined from DNA 1 = male 2 = female 3 = indeterminate 4 = not genotyped |
I1 | |_| |
| SEX_METH | Method of sex determination 1 = PCR and agarose electroforesis 2 = PCR and ABI fragment analysis 3 = Paternity testing kit 4 = Heterozygosity of X-chromosome genotypes 8 = irrelevant (SEX_DNA = 4) |
I1 | |_| |
| CONTAMINATION | Contamination of the sample: 1 = yes 2 = no 8 = irrelevant (no fragment analysis or SEX_DNA = 4) |
I1 | |_| |
| Item CONTYPE1 ... CONTYPE3: Contamination of the sample | |||
| CONTYPE1-3 | 1 = extra alleles in one marker 2 = extra alleles in more than one marker 3 = imbalanced alleles in heterozygote, in one marker 4 = imbalanced alleles in heterozygote, in more than one marker 5 = abnormal SexPCR 6 = during DNA extraction (SEX_DNA = 4 and SEX_METH = 8) 8 = irrelevant (CONTAMINATION = 2 or 8 or SEX_DNA = 4) 9 = missing |
||
| CONTYPE1 | Contamination type 1 | I1 | |_| |
| CONTYPE2 | Contamination type 2 | I1 | |_| |
| CONTYPE3 | Contamination type 3 | I1 | |_| |
| COMMENTS | Comments on this record | C100 | free text |
The item NAME on the document is a computer variable name used for the item by the MDC.
| FORMAT | Type | Format | Example | Comments |
|---|---|---|---|---|
| C | Character | C8 | 19991210 | Used for dates, trailing zeroes are obligatory |
| C20 | "B 17 E" | Location of sample in a box, surrounded with double quotes (") if contains spaces. Free text, maximum 20 characters is allowed for this variable. | ||
| C3 | -80 | Storage temperature, includes leading '-' sign | ||
| F | Float | F5.2 | 13.1 2.18 |
Variable must include a decimal point (.). Leading and trailing zeroes are not obligatory. |
| I | Integer | I3 | 1 15 180 |
Leading zeroes are not obligatory |
Instructions for making corrections to data that have already been sent to the MDC are
given in Section Data communication between the Participating Centres and the MDC.
Please contact the MDC for instructions if you cannot provide information as specified in
this document or if you have any problems with the interpretation of the coding for any
specific items.
Follow these instructions carefully when creating a computer file for the data transfer from the MCL to the MDC.
| FORM | Form number | I2 | |4|4| |
|---|
Number 45 indicates the "Data transfer format: DNA Extraction Inventory".
| VERSION | Version of this form | I1 | |2| |
|---|
Current version of this data transfer format. If the version number of the data transfer format which you are using is not "2", these instructions do not correspond to the format you are using. Version number changes if some major changes are introduced in the form, for example adding / deleting some variables.
| KEY2 | Sample identification | I7 | |_|_|_|_|_|_|_| |
|---|
KEY2 is a primary key, identifying DNA sample in MORGAM. KEY2 does not include any of the components of KEY1 (identifying individual in MORGAM study). KEY2 sample identification code is generated in the MDC and it's relation with KEY1 is kept in the MDC. The labels used for transferring the DNA samples from the DNA storage unit in KTL to other MORGAM Genotyping Laboratories include only KEY2 codes.
| SEX_DATE | Date when sex was determined from DNA (date ANSI) YYYYMMDD (year month day) |
C8 | |_|_|_|_||_|_||_|_| |
|---|
Enter here a date when sex form DNA was deternined. Format is yyyymmdd (date ANSI, 8 characters, year day month).
| SEX_DNA | Sex determined from DNA: 1 = male 2 = female 3 = indeterminate 4 = not genotyped |
I1 | |_| |
|---|
...
| SEX_METH | Method of sex determination: 1 = PCR and agarose electroforesis 2 = PCR and ABI fragment analysis 3 = Paternity testing kit 4 = Heterozygosity of X-chromosome genotypes 8 = irrelevant (SEX_DNA = 4) |
I1 | |_| |
|---|
Method of sex determination:
1 = PCR and agarose electroforesis
2 = PCR and ABI fragment analysis
3 = Paternity testing kit. ABI fragment analysis with paternity testing panel of
markers analysed by NPHI paternity testing laboratory.
4 = Heterozygosity of X-chromosome genotypes. This code is based on the
Metabochip genotyping quality control at Sanger institute.
8 = irrelevant (SEX_DNA = 4)
| CONTAMINATION | Contamination of the sample: 1 = yes 2 = no 8 = irrelevant (no fragment analysis or SEX_DNA = 4) |
I1 | |_| |
|---|
...
| CONTYPE1-3 | 1 = extra alleles in one marker 2 = extra alleles in more than one marker 3 = imbalanced alleles in heterozygot, in one marker 4 = imbalanced alleles in heterozygot, in more than one marker 5 = abnormal SexPCR 6 = during DNA extraction (SEX_DNA = 4 and SEX_METH = 8) 8 = irrelevant (CONTAMINATION = 2 or 8 or SEX_DNA = 4) 9 = missing |
|---|
| CONTYPE1 | Contamination type 1 | I1 | |_| |
|---|
...
| CONTYPE2 | Contamination type 2 | I1 | |_| |
|---|
...
| CONTYPE3 | Contamination type 3 | I1 | |_| |
|---|
...
| COMMENTS | Comments on this record | C100 | free text |
|---|
Enter additional comments describing a problem in DNA sex determination
The data should be prepared in ASCII comma-delimited format using semicolon (;) as delimiter, with the names of the variables in the first row. If the value of a variable contains text with delimiter used inside the text then the value should be surrounded by double quotes ("). A dot (.) shold be used as a decimal point.
To simplify the tracing of data transfers between the MPC and the MDC, the files should be named as follows:
F44_MPCXX_YYYYMMDD_N.CSV, where:
To avoid possible errors in variable naming use the first row from the example below containing variable names. Example of Form 44 ASCII comma-delimited data file:
|
form;version;key2;sex_date;sex_dna;sex_meth;contamination;contype1;contype2;contype3;comments 44;2;1234567;20030526;1;1;2;8;8;8; 44;2;1012345;20030526;2;1;2;8;8;8; |
The data files should be archived (ZIP) before sending them to the MDC. The archive names should be the same as the file names. MORGAM data Error checking program will be used to prepare data (will be available later). The data should be sent to MDC (Zygimantas Cepaitis (firstname.lastname@thl.fi)) as an attachment to an e-mail message. Please make sure to save a copy of the transferred files in the sending MPC or laboratory.
When the data are received in the MDC, an acknowledgement will be sent to the sender. Then the data will be checked for consistency and quality, and after any problems have been resolved, they will be entered into the MORGAM database.
| Date | Update |
|---|---|
| 2007-01-16 | Version 2 |
| 2010-10-15 | Code "4" was added to item SEX_METH |