Obtains a representative selection of genotypes by means of genetic distance sampling or genetic distance optimization (J. Jansen & J.T.N.M. Thissen).
|What to print (
||The number of genotypes to be selected; must be set|
||Method to be used (
||Genotype factor; must be set|
||Input similarity matrix for each selection; must be set|
||Defines prior groupings of the genotypes|
||Logical variate indicating whether a genotype is selected (1) as cluster centre or not (0)|
||Saves the nearest cluster centres of the genotypes|
||Saves the distances of the genotypes to the nearest cluster centre|
||Seed for randomization at the start; default 0|
QGSELECT selects a representative subset of genotypes using a similarity matrix, provided by the
METHOD option specifies whether to use genetic distance sampling or genetic distance optimization, by setting it to one of the following settings:
||genetic distance sampling using the method of Jansen & Van Hintum (2006), or|
||genetic distance optimization based on K-medoids cluster analysis (Kaufman & Rouseeuw 1990).|
The default is
The factor identifying the genotypes must be supplied by the
GENOTYPES parameter, and the number of genotypes to be selected must be specified by the
NCLUSTERS option. Prior information about the grouping of the genotypes can be supplied using the
SEED parameter specifies the seed to use to randomize the genotypes at the start. The default value of zero continues an existing sequence, or (if none) initializes the seed automatically.
The genotype selection can be saved by the
SELECTED parameter, in a logical variate containing one for each genotype selected as a cluster centre, and zero for the genotypes that are not selected. The
NEIGHBOURS parameter saves the nearest cluster centre for each genotype, and the
DISTANCES parameter saves the distances of each genotype to the nearest cluster centre.
||for a summary of the selection, and|
||for monitoring information.|
Restrictions are not allowed.
Jansen, J. & Th.J.L. van Hintum (2006). Genetic distance sampling: a novel sampling method for obtaining core collections using genetic distances with an application to cultivated lettuce. Theor. Appl. Genet., 114, 421-428.
Kaufman, P. & P.J. Rousseuw (1990). Finding Groups in Data: an Introduction to Cluster Analysis. Wiley, New York.
Commands for: Statistical genetics and QTL estimation.
CAPTION 'QGSELECT example'; STYLE=meta QIMPORT [POPULATION=F2] '%GENDIR%/Examples/F2maize_geno.txt';\ MKSCORES=mkscores; MKNAMES=mknames; IDMGENOTYPES=idmgeno GROUPS idmgeno; FACTOR=geno QKINSHIP [METHOD=dice] mkscores; IDMGENOTYPES=idmgeno; KMATRIX=kmatrix QGSELECT [PRINT=MONITORING, SUMMARY; NCLUSTERS=10; METHOD=sampling]\ GENOTYPES=geno; SIMILARITY=kmatrix;\ SELECTED=SELECTED; NEIGHBOURS=NN; DIST=DISTNN; SEED=12345