MORGAM logo

Data transfer format - MORGAM genes

  • Form: 50
  • Version: 3
  • Date: 2005-12-05

Valid HTML 4.01!
© National Institute for Health and Welfare and the MORGAM Project investigators
Last updated: 2009-05-13
For more information, please contact Päivi Laiho (firstname.lastname@thl.fi), Ari Haukijärvi (ari.haukijarvi@thl.fi) or Kari Kuulasmaa (kari.kuulasmaa@thl.fi)

The purpose of this transfer format is to provide an exact and common format for the MORGAM Genotyping Laboratories (GLAB) to transfer genes data to the MORGAM Data centre (MDC)  at National Institute for Health and Welfare (THL). The data should be sent through e-mail.

Contents



Format specification

NAME SPECIFICATION AND CODES FORMAT or Value
FORM Form number I2

|5|0|

VERSION Version of this form I1

|3|

GLAB Genotyping Laboratory code (GLAB), selected from the list of genotyping laboratories I3

|_|_|_|

SHIPMENT Shipment number I5 |_|_|_|_|_|
SHIPDATE Date of this shipment (date ANSI)
YYYYMMDD (year month day)
C8 |_|_|_|_||_|_||_|_|
CHROMOSOME Chromosome / location in gene map C20 free text
GENE_NAME Gene name as used in NCBI Entrez Gene database C20 free text
GENE_ID GeneID in NCBI Entrez Gene database I7 |_|_|_|_|_|_|_|

General Instructions

The item NAME on the document is a computer variable name used for the item by the MDC.

The item FORMAT specifies the format in which the value should be presented in the transfer data set:

FORMAT Type Format Example Comments
C Character C8 20090205 Used for dates, trailing zeroes are mandatory.
C20 "B 17 E" Location of sample in box; surround with double quotes (") if value contains spaces. Maximum of 20 characters allowed for this variable, including quotes.
C3 -80 Storage temperature, includes leading '-' sign.
F Float F5.2 13.1
2.18
Variable must include a decimal point (.). Leading and trailing zeroes are not mandatory.
I Integer I3 1
15
180
Leading zeroes are not mandatory.

Instructions for making corrections to data that have already been sent to the MDC are given in Section Data communication between the Participating Centres and the MDC.

Please contact the MDC for instructions if you can not provide information as specified in this document or if you have any problems with the interpretation of the coding for any specific items.


Specific instructions for each item

Follow these instructions carefully when creating a computer file for the data transfer from the GLAB to the MDC.

To the top of the form

FORM Form number I2

|5|0|

Number 50 indicates the "Data transfer format: MORGAM genes".

To the top of the form

VERSION Version of this form I1

|3|

Current version of this data transfer format. If the version number of the data transfer format which you are using is not "3", these instructions do not correspond to the format you are using. Version number changes if some major changes are introduced in the form, for example adding / deleting some variables.

To the top of the form

GLAB Genotyping Laboratory code C3

|_|_|_|

Enter appropriate code of your MORGAM Genotyping Laboratory from the list of MORGAM Genotyping Laboratories:

To the top of the form

SHIPMENT Shipment number I5 |_|_|_|_|_|

This is shipment number from genotyping laboratory to MDC. Usually it is a sequential number of shipments to MDC. If you do nof use it, enter
number of shipment to MDC on that particular day.

To the top of the form

SHIPDATE Date of this shipment (date ANSI)
YYYYMMDD (year month day)
C8 |_|_|_|_||_|_||_|_|

The date of the shipment (the date when you completed this data form). Format is yyyymmdd (date ANSI, 8 characters, year day month).

To the top of the form

CHROMOSOME Chromosome / location in gene map C20 free text

Enter here the chromosome as used in NCBI Entrez Gene database.

To the top of the form

GENE_NAME Gene name as used in NCBI Entrez Gene database C20 free text

Enter here the gene name as used in NCBI Entrez Gene database.

To the top of the form

GENE_ID GeneID in NCBI Entrez Gene database I7 |_|_|_|_|_|_|_|

Enter here the GeneID of this gene from NCBI Entrez Gene database.


Data transfer Instructions

The data should be prepared in ASCII comma-delimited format using semicolon (;) as delimiter, with the names of the variables in the first row. If the value of a variable contains text with delimiter used inside the text then the value should be surrounded by double quotes ("). A dot (.) shold be used as a decimal point.

To simplify the tracing of data transfers between the MPC and the MDC, the files should be named as follows:

F50_GLAB_YYYYMMDD_N.CSV, where:

To avoid possible errors in variable naming use the first row from the example below containing variable names. Example of Form 50 ASCII comma-delimited data file:

FORM;VERSION;GLAB;SHIPMENT;SHIPDATE;CHROMOSOME;GENE_NAME;GENE_ID
50;3;911;1;20090127;1q31.3;CFH;3075

The data files should be archived (ZIP format) before sending them to the MDC. The archive names should be the same as the file names. MORGAM data Error checking program will be used to prepare data (will be available later). The data should be sent to MDC (ari.haukijarvi@thl.fi) as an attachment to an e-mail message. Please make sure to save a copy of the transferred files in the sending MPC or laboratory.

When the data are received in the MDC, an acknowledgement will be sent to the sender. The data will then be checked for consistency and quality, and after any problems have been resolved, they will be entered into the MORGAM database.