MORGAM logo

Data transfer format - DNA sex determination

  • Form: 44
  • Version: 2
  • Date: 2007-01-16

© National Institute for Health and Welfare and the MORGAM Project investigators
Last updated: 2010-10-15
For more information, please contact Päivi Laiho (firstname.lastname@thl.fi) or Kari Kuulasmaa (firstname.lastname@thl.fi)

The purpose of this transfer format is to provide an exact and common format for the MORGAM Central Laboratory (MCL) at National Institute for Health and Welfare (THL) to transfer data on the sex determination from DNA to the MORGAM Data Centre (MDC). Data in the format specified here should be submitted to the MORGAM Data Centre (MDC) for every subject on whom DNA was requested by the MDC (see Form 40). The data should be placed on KTL network drive.

Contents


Format specification

NAME SPECIFICATION AND CODES FORMAT or Value
FORM Form number I2 |4|4|
VERSION Version of this form I1 2
KEY2 Sample identificaton I7 |_|_|_|_|_|_|_|
SEX_DATE Date when sex was determined from DNA (date ANSI)
YYYYMMDD (year month day)
C8 |_|_|_|_||_|_||_|_|
SEX_DNA Sex determined from DNA
1 = male
2 = female
3 = indeterminate
4 = not genotyped
I1 |_|
SEX_METH Method of sex determination
1 = PCR and agarose electroforesis
2 = PCR and ABI fragment analysis
3 = Paternity testing kit
4 = Heterozygosity of X-chromosome genotypes
8 = irrelevant (SEX_DNA = 4)
I1 |_|
CONTAMINATION Contamination of the sample:
1 = yes
2 = no
8 = irrelevant (no fragment analysis or SEX_DNA = 4)
I1 |_|
Item CONTYPE1 ... CONTYPE3: Contamination of the sample
CONTYPE1-3 1 = extra alleles in one marker
2 = extra alleles in more than one marker
3 = imbalanced alleles in heterozygote, in one marker
4 = imbalanced alleles in heterozygote, in more than one marker
5 = abnormal SexPCR
6 = during DNA extraction (SEX_DNA = 4 and SEX_METH = 8)
8 = irrelevant (CONTAMINATION = 2 or 8 or SEX_DNA = 4)
9 = missing
   
CONTYPE1 Contamination type 1 I1 |_|
CONTYPE2 Contamination type 2 I1 |_|
CONTYPE3 Contamination type 3 I1 |_|
COMMENTS Comments on this record C100 free text


General Instructions

The item NAME on the document is a computer variable name used for the item by the MDC.

The item FORMAT specifies the format in which the value should be presented in the transfer data set:

FORMAT Type Format Example Comments
C Character C8 19991210 Used for dates, trailing zeroes are obligatory
C20 "B 17 E" Location of sample in a box, surrounded with double quotes (") if contains spaces. Free text, maximum 20 characters is allowed for this variable.
C3 -80 Storage temperature, includes leading '-' sign
F Float F5.2 13.1
2.18
Variable must include a decimal point (.). Leading and trailing zeroes are not obligatory.
I Integer I3 1
15
180
Leading zeroes are not obligatory

Instructions for making corrections to data that have already been sent to the MDC are given in Section Data communication between the Participating Centres and the MDC.

Please contact the MDC for instructions if you cannot provide information as specified in this document or if you have any problems with the interpretation of the coding for any specific items.


Specific instructions for each item

Follow these instructions carefully when creating a computer file for the data transfer from the MCL to the MDC.

 

FORM Form number I2

|4|4|

Number 45 indicates the "Data transfer format: DNA Extraction Inventory".

 

VERSION Version of this form I1

|2|

Current version of this data transfer format. If the version number of the data transfer format which you are using is not "2", these instructions do not correspond to the format you are using. Version number changes if some major changes are introduced in the form, for example adding / deleting some variables.

 

KEY2 Sample identification I7

|_|_|_|_|_|_|_|

KEY2 is a primary key, identifying DNA sample in MORGAM. KEY2 does not include any of the components of KEY1 (identifying individual in MORGAM study). KEY2 sample identification code is generated in the MDC and it's relation with KEY1 is kept in the MDC. The labels used for transferring the DNA samples from the DNA storage unit in KTL to other MORGAM Genotyping Laboratories include only KEY2 codes.

 

Sex determination from DNA

SEX_DATE Date when sex was determined from DNA (date ANSI)
YYYYMMDD (year month day)
C8 |_|_|_|_||_|_||_|_|

Enter here a date when sex form DNA was deternined. Format is yyyymmdd (date ANSI, 8 characters, year day month).

 

SEX_DNA Sex determined from DNA:
1 = male
2 = female
3 = indeterminate
4 = not genotyped
I1 |_|

...

 

SEX_METH Method of sex determination:
1 = PCR and agarose electroforesis
2 = PCR and ABI fragment analysis
3 = Paternity testing kit
4 = Heterozygosity of X-chromosome genotypes
8 = irrelevant (SEX_DNA = 4)
I1 |_|

Method of sex determination:
1 = PCR and agarose electroforesis
2 = PCR and ABI fragment analysis
3 = Paternity testing kit. ABI fragment analysis with paternity testing panel of markers analysed by NPHI paternity testing laboratory.
4 = Heterozygosity of X-chromosome genotypes. This code is based on the Metabochip genotyping quality control at Sanger institute.
8 = irrelevant (SEX_DNA = 4)

CONTAMINATION Contamination of the sample:
1 = yes
2 = no
8 = irrelevant (no fragment analysis or SEX_DNA = 4)
I1 |_|

...

 

Item CONTYPE1 ... CONTYPE3: Contamination of the sample

CONTYPE1-3 1 = extra alleles in one marker
2 = extra alleles in more than one marker
3 = imbalanced alleles in heterozygot, in one marker
4 = imbalanced alleles in heterozygot, in more than one marker
5 = abnormal SexPCR
6 = during DNA extraction (SEX_DNA = 4 and SEX_METH = 8)
8 = irrelevant (CONTAMINATION = 2 or 8 or SEX_DNA = 4)
9 = missing
CONTYPE1 Contamination type 1 I1 |_|

...

 

CONTYPE2 Contamination type 2 I1 |_|

...

 

CONTYPE3 Contamination type 3 I1 |_|

...

 

COMMENTS Comments on this record C100 free text

Enter additional comments describing a problem in DNA sex determination


Data transfer Instructions

The data should be prepared in ASCII comma-delimited format using semicolon (;) as delimiter, with the names of the variables in the first row. If the value of a variable contains text with delimiter used inside the text then the value should be surrounded by double quotes ("). A dot (.) shold be used as a decimal point.

To simplify the tracing of data transfers between the MPC and the MDC, the files should be named as follows:

F44_MPCXX_YYYYMMDD_N.CSV, where:

To avoid possible errors in variable naming use the first row from the example below containing variable names. Example of Form 44 ASCII comma-delimited data file:

form;version;key2;sex_date;sex_dna;sex_meth;contamination;contype1;contype2;contype3;comments
44;2;1234567;20030526;1;1;2;8;8;8;
44;2;1012345;20030526;2;1;2;8;8;8;

The data files should be archived (ZIP) before sending them to the MDC. The archive names should be the same as the file names. MORGAM data Error checking program will be used to prepare data (will be available later). The data should be sent to MDC (Zygimantas Cepaitis (firstname.lastname@thl.fi)) as an attachment to an e-mail message. Please make sure to save a copy of the transferred files in the sending MPC or laboratory.

When the data are received in the MDC, an acknowledgement will be sent to the sender. Then the data will be checked for consistency and quality, and after any problems have been resolved, they will be entered into the MORGAM database.


Updates

Date Update
2007-01-16 Version 2
2010-10-15 Code "4" was added to item SEX_METH