RELATE directive

Relates the observed values on a set of variates or factors to the results of a principal coordinates analysis (synonym of PCORELATE).

Options

`COORDINATES` = matrix	Points in reduced space; no default i.e. this option must be specified
`NROOTS` = scalar	Number of latent roots for printed output; default `*` requests them all to be printed

Parameters

`DATA` = variates or factors	The data variables
`TEST` = string tokens	Test type, defining how each variable is treated in the calculation of the similarity between each unit (`simplematching`, `jaccard`, `russellrao`, `dice`, `antidice`, `sneathsokal`, `rogerstanimoto`, `cityblock`, `manhattan`, `ecological`, `euclidean`, `pythagorean`, `minkowski`, `divergence`, `canberra`, `braycurtis`, `soergel`); default `*` ignores that variable
`RANGE` = scalars	Range of possible values of each variable; if omitted, the observed range is taken

Description

This directive was renamed PCORELATE in Release 14, but the original name of RELATE is currently still retained as a synonym. However, it may be removed in a future release.

One way of interpreting the principal coordinates obtained from a similarity matrix by PCO is by relating them to the original data variables. For each coordinate and each data variable, an F-statistic can be computed as if the variable and the coordinate vector were independent. This is not the case but, although the exact distribution of these pseudo F-values is not known, they do serve to rank the variables in order of importance of their contribution to the coordinate vector.

The DATA parameter lists the variables (variates or factors) that are to be related to the PCO results and the TEST parameter indicates their “type” as in the FSIMILARITY directive. The RANGE parameter contains a list of scalars, one for each variable in the DATA list, allowing you to standardize quantitative variates.

Qualitative variables (variates or factors with TEST settings simplematching - rogerstanimoto) are treated as grouping factors, and the mean coordinate for each group is calculated. Only 10 groups are catered for; group levels above 10 are combined. The pseudo F-statistic gives the between-group to within-group variance ratio. Missing values are excluded.

Quantitative variables (i.e. variates with other settings) are grouped on a scale of 0-10 (where zero signifies a value up to 0.05 of the range), and mean coordinates for each group are calculated. The printed pseudo F statistic is for a linear regression of the principal coordinate on the ungrouped data variate, after standardizing the data variate to have unit range; the regression coefficient is also printed.

The COORDINATES option must be present and must be a matrix. This represents the units in reduced space. Usually the coordinates will be from a principal coordinates analysis. The number of rows of the matrix must match the number of units present in the variables, taking account of any restriction.

The output from RELATE can be extensive. You may not be interested in relating the variables to the higher dimensions of the principal coordinates analysis even though you may have saved these in the coordinate matrix. The NROOTS option can request that results for only some of the dimensions are printed. If NROOTS is not specified, RELATE prints information for all the saved dimensions: that is, for the number of columns of the coordinates matrix.

Options: COORDINATES, NROOTS.

Parameters: DATA, TEST, RANGE.

Was this article helpful?

Yes No

Options

Parameters

Description

See also

Was this article helpful?