Estimates linkage disequilibrium (LD) decay along a chromosome (M. Malosetti & J.T.N.M. Thissen).
|What to print (
||What to plot (
||What model to use to account for genetic relatedness (
||Provides the scores of significant principal components, obtained from an eigenvalue analysis|
||Defines groupings of genotypes into subpopulations|
||Defines which chromosome to analyse, using a level of the
||Markers with more than the specified % of missing values will be excluded from the LD calculations; default 20|
||Defines the maximum distance between markers to show in LD plots; default 30|
||General title for the plots|
||Title for the y-axis|
||Title for the x-axis|
||Genotype codes for each marker; must be set|
||Linkage groups for the markers; must be set|
||Positions within the linkage groups of markers; must be set|
||Saves the distances between markers|
||Saves the value of r2 between markers|
QLDDECAY estimates linkage disequilibrium (LD) between pairs of markers on a chromosome. The association between two markers is assessed by a linear regression model, with one marker set as response and the second one as regressor, and LD is expressed in terms of r2 values.
The model to account for genetic relatedness between genotypes is specified by the
RELATIONSHIPMODEL option, with one of the following settings:
||infers the underlying genetic substructure in the population by retaining the most significant principal components from the molecular marker matrix (Patterson et al. 2006) – the scores of the significant axes are used as covariables in the regression model, which is effectively an approximation to the structuring of the genetic variance covariance matrix by a coefficient of coancestry matrix (kinship matrix);|
||includes a factor supplied by the
||makes no correction for genetic relatedness.|
RELATIONSHIPMODEL=eigenanalysis; the scores of the significant axes are then calculated by the
QEIGENANALYSIS procedure with options
SCALE=none. Alternatively, scores calculated elsewhere can be supplied, in a pointer, using the option
LD is estimated per chromosome. It is not calculated between markers with too many missing values. The threshold is specified by the
MAX%MISSING option; default 20 (i.e. 20%). While LD is calculated along the whole of the chromosome, one expects LD decay at relatively short distances. Therefore, when plotting r2 values versus marker distances, only pairs of markers that are closer than the value specified by the
MAXDISTANCE option are displayed (default 30).
The marker scores are supplied by the
MKSCORES parameter, in a pointer containing a factor for each marker. The corresponding map information for the markers is supplied by the
POSITIONS parameters. The
CHRANALYSE option must be set to specify the chromosome for which the analysis is to be performed.
MKNAMES can be used to supply marker names that will be used to name rows and columns of output matrices. The
DISTANCE parameter can save a symmetric matrix of distances between the markers, and the
R2 parameter can save a symmetric matrix of r2 values between markers.
progress, to monitor the progress of the analysis.
PLOT option selects the graphs to plot, with settings:
||plots the probability values for the deviance ratios, on a -log10 scale, against the marker distance, and|
||gives a shade plot of the LD matrix.|
TITLE option can be used to provide a title for the graphs, and the
XTITLE options can supply titles for the y- and x-axis, respectively.
QLDDECAY handles any type of marker, taking the first allele as reference (if a bi-allelic marker) or the most frequent allele if a marker has multiple alleles. The procedure fits a linear regression with one marker taken as response and a second one used as regressor. To account for genetic relatedness, the model can also include extra covariables (either principal component scores, or a grouping factor). Models are fitted using
RYPARALLEL to perform several fits in parallel. From each fit the r2 value is stored as measure of LD between the markers. Plots are produced to display results according to the settings of the
Restrictions are not allowed.
Commands for: Statistical genetics and QTL estimation.
CAPTION 'QLDDECAY example'; STYLE=meta QIMPORT [POPULATION=amp] '%GENDIR%/Examples/LD_example_geno.txt';\ MAPFILE='%GENDIR%/Examples/LD_example_map.txt';\ MKSCORES=scores; CHROMOSOMES=mkchr; POSITIONS=mkpos;\ MKNAMES=mknames " calculate LD decay with eigenanalysis " QLDDECAY [PRINT=progress; PLOT=lddecay,ldmatrix;\ RELATIONSHIPMODEL=eigenanalysis;\ CHRANALYSE=2; MAXDISTANCE=25] \ scores; CHROMOSOMES=mkchr; POSITIONS=mkpos;\ MKNAMES=mknames; DISTANCE=distance; R2=r2 PRINT r2