1. Home
  2. QEXPORT procedure

QEXPORT procedure

Exports genotypic data for QTL analysis (D.A. Murray).

Options

OUTFILENAME = text Name of the file to receive the data
MAPFILENAME = text Name of the associated map file for Flapjack or MapQTL(R)
POPULATIONTYPE = string token Type of population (BC1, DH1, F2, RIL, BCxSy, CP, AMP); must be set
NGENERATIONS = scalar Number of generations for a RIL population
NAME = text Name for the header in a .loc file
MISSING = text Character to represent a missing genotype in Flapjack or R/QTL format; default '-'
SEPARATOR = text Character to separate data values in Flapjack format; default separates them by tabs
ASEPARATOR = text Character to separate allele values in Flapjack format; default '/'
FJROWS = string token Specifies whether the genotypes or markers are to be stored on the rows in Flapjack format (genotypes, markers); default geno

Parameters

MKSCORES = pointers Genotype codes for each marker
CHROMOSOMES= factors Linkage groups for the markers
POSITIONS = variates Positions within the linkage groups of markers
MKNAMES = texts Marker names
MKSETS = factors Marker sets
IDMGENOTYPES = texts Labels for genotypes
PARENTS = pointers Parent information
IDPARENTS = texts Labels used to identify the parents

Description

QEXPORT saves genotypic map data for QTL analysis. The data are saved to the file specified by the OUTFILENAME option. The format of the file is specified by the file extension, and can be either a Flapjack text genotype file (.txt), a MapQTL(R) Locus genotype file (.loc) or an R/QTL separate genotype file (.csv). If a Flapjack genotype file or a MapQTL(R) Locus genotype file name is supplied, the associated map information can be saved by setting the MAPFILENAME option to a file name with the extension .txt for Flapjack or .map for MapQTL(R). QEXPORT can thus be used to save data in Flapjack format to use with the QIBDPROBABILITIES procedure.

The type of population must be specified using the POPULATIONTYPE option. The genotypic data can be exported for F2, first generation backcross (BC1), recombinant inbred lines (RIL) and DH1 (double-haploid) populations to any of the file types. The BCxSy (backcross inbred lines), CP (cross pollinator) and AMP (association mapping) populations can be exported only using the Flapjack format. If a RIL population is being exported to MapQTL(R), the number of generations should be specified using the NGENERATIONS option. Also, for exporting to MapQTL(R), the NAME option allows you to include a name for the population (which must not contain spaces).

The marker scores should be supplied in a pointer to a set of factors using the MKSCORES parameter. Each factor within the pointer should contain data for a marker, where the same factor labels are supplied in the same order. For the BC1, DH1, F2, RIL, BCxSy and CP populations, the parent information can be supplied in a pointer to a set of texts using the PARENTS parameter. Note that the PARENTS parameter must be set for a CP population. Each text should contain the parent allele, where the position within the pointer determines the parent: for example, the first text represents parent 1, the second text parent 2 and so on. For the BC1, DH1, F2, RIL and BCxSy populations, if the PARENTS parameter is not set, then the parent information is automatically generated where parent 1 is allocated allele 1 and parent 2 is allocated allele 2. The labels for the parents can be supplied in a text using the IDPARENTS parameter.

By default, in Flapjack or R/QTL files, missing alleles are represented using the '-' character, but an alternative can be supplied using the MISSING option. In Flapjack genotype files, the separator used between marker genotype scores can be supplied using the SEPARATOR option, and the separator used between alleles using the ASEPARATOR option. For the Flapjack genotype format, the FJROWS option indicates whether the genotypes or markers are stored in the rows of the file; by default the genotypes are in the rows.

The linkage groups for each marker are supplied in a factor by the CHROMOSOMES parameter. The names of the markers are supplied in a text using the MKNAMES parameter, and the marker positions are supplied in a variate using the POSITIONS parameter. For the .csv file format, a grouping factor identifying marker sets can be supplied using the MKSETS parameter.

The genotype labels to be stored in a Flapjack or R/QTL file can be specified using the IDMGENOTYPES parameter. If this parameter is not set, the labels will be generated automatically using the values 1 to n, where n is the number of genotypes.

Options: OUTFILENAME, MAPFILENAME, POPULATIONTYPE, NGENERATIONS, NAME. MISSING, SEPARATOR, ASEPARATOR, FJROWS.

Parameters: MKSCORES, CHROMOSOMES, POSITIONS, MKNAMES, MKSETS, IDMGENOTYPES, PARENTS, IDPARENTS.

Method

The .csv file format uses an extended version of the R/QTL comma-delimited separate file for genotype data (.csvsr and .csvs), where there is an optional column for marker sets. For exporting large data sets to .csv format, the procedure uses the Dataload.dll library. In the .csvsr file format, the first row specifies the genotype or id, and there must be a column of data associated with the phenotypic data with exactly the same information. The first cell should be the name of an identifier for the genotype id, and the first row in columns 2 and 3 should be blank. Also, if marker sets are included in the file, the first row of column 4 should be left blank. Starting from row 2 in the file, the first column gives the marker names, the second column gives the linkage group for each marker, and the third column gives the positions of the markers within the linkage groups. The fourth column can be used to contain the marker sets. The remaining columns give the marker genotypes.

Action with RESTRICT

All restrictions are ignored.

See also

Procedures: EXPORT, QIMPORT, QIBDPROBABILITIES.

Commands for: Statistical genetics and QTL estimation.

Example

CAPTION 'QEXPORT example'; STYLE=meta
QIMPORT [POPULATION=F2] FILENAME='%GENDIR%/Examples/F2maize.loc';\ 
        MAPFILENAME='%GENDIR%/Examples/F2maize.map';\
        MKSCORES=mgenotypes; CHROMOSOMES=linkagegroups;\
        POSITIONS=lpos; MKNAMES=markers; IDMGENOTYPES=idmgenotypes;\
        PARENTS=parents; IDPARENTS=idparents
" Export to Flapjack format "
QEXPORT [OUTFILENAME='F2maize_genoExport.txt';\ 
        MAPFILENAME='F2maize_mapExport.txt';\
        POPULATION=F2] MKSCORES=mgenotypes; CHROMOSOMES=linkagegroups;\ 
        POSITIONS=lpos; MKNAMES=markers; IDMGENOTYPES=idmgenotypes;\
        PARENTS=parents; IDPARENTS=idparents
 " Export to R/QTL rotated separate genotype file "
QEXPORT [OUTFILENAME='F2maizeExport.csv'; POPULATION=F2] MKSCORES=mgenotypes;\
        CHROMOSOMES=linkagegroups; POSITIONS=lpos; MKNAMES=markers;\
        IDMGENOTYPES=idmgenotypes; PARENTS=parents; IDPARENTS=idparents

Updated on June 19, 2019

Was this article helpful?