RLFUNCTIONAL procedure

Fits a linear functional relationship model (M.S. Dhanoa & D.B. Baird).

Options

`PRINT` = string token	Controls printed output (`summary`, `estimates`, `fittedvalues`, `confidencelimits`, `grouptests`); default `summ`, `esti`, `conf`, `grou`
`METHOD` = string tokens	Specifies what methods to use to fit the regression (`bartlett`, `majoraxis`, `errorsinvariables`, `yonx`, `xony`, `reducedmajoraxis`, `standardmajoraxis`, `rangedmajoraxis`, `geometricmean`, `bisector`, `medyonx`, `medxony`, `qgeometricmean`, `bisector`, `qmajoraxis`, `theisenbartlettq`); default `bart`
`PLOT` = string tokens	Controls what to plot (`fitted`, `residuals`, `bootestimates`, `confidencelimits`); default `fitt`
`TITLE` = text	The title for the analysis; default title uses the `Y` and `X` identifiers
`NBOOT` = scalar	The number of samples to take for the bootstrap confidence limits; default 200
`SEED` = scalar	Seed for bootstrap randomization; default 0
`CIPROBABILITY` = scalar	Defines the size of the confidence interval; default 0.95 i.e. 95%
`CIMETHOD` = string token	Method for confidence limits (`parametric`, `bootstrap`); default `boot`
`GMETHOD` = string token	Method for comparing slopes, elevations and locations between groups (`majoraxis`, `standardmajoraxis`); default uses `standardmajoraxis` for `METHOD` settings `standardmajoraxis`, `reducedmajoraxis`, `rangedmajoraxis`, `geometricmean` or `bisector`, and `majoraxis` otherwise
`VRATIO` = scalar	Ratio between variance of `Y` and `X` variables for `METHOD=errorsinvariables`; default 1
`YRANGEMETHOD` = string token	Type of range used for `Y` when `METHOD=rangedmajoraxis` (`relative`, `interval`); default `rela`
`XRANGEMETHOD` = string token	Type of range used for `X` when `METHOD=rangedmajoraxis` (`relative`, `interval`); default `rela`
`WINDOW` = scalar	Graphics window to use for fitted-value plots; default 1
`KEYWINDOW` = scalar	Graphics window to use for key; default 2

Parameters

`Y` = variates	Y-variate for each model
`X` = variates	X-variate for each model
`SLOPE` = scalars, variates or matrices	Saves the estimated slopes
`INTERCEPT` = scalars, variates or matrices	Saves the estimated intercepts
`GROUPS` = factors	Defines groups of units
`RESIDUALS` = variates, matrices or pointers	Saves the residuals from the fitted models
`FITTEDVALUES` = variates, matrices or pointers	Saves the fitted values
`ESTIMATES` = variates, matrices or pointers	Saves the estimates
`SE` = variates, matrices or pointers	Saves the standard errors of the estimates
`LOWER` = variates, matrices or pointers	Saves lower values of confidence intervals for the estimates
`UPPER` = variates, matrices or pointers	Saves upper values of confidence intervals for the estimates
`LOWFITTEDVALUES` = variates, matrices or pointers	Saves the lower confidence limits from a bootstrap analysis of fitted values
`UPPFITTEDVALUES` = variates, matrices or pointers	Saves the upper confidence limits from a bootstrap analysis of fitted values
`TESTPROBABILITIES` = pointers	Saves the between-group test probabilities (in a symmetric matrix) for differences in slopes, elevations and locations

Description

RLFUNCTIONAL can be used to estimate the slope and intercept of a linear equation describing the relationship between two variables, when the observations on both variables are subject to error variation. This contrasts with the situation in ordinary linear regression, where we assume that only the y-variate is subject to error (the x-variate is assumed to be observed exactly). If the variation in the x-values is not accounted for, the estimate of the slope will be biased towards zero. For further details see Sokal & Rohlf (1995, Section 14.13) and Bartlett (1949). RLFUNCTIONAL can also fit standard linear regression models and quantile regression models so that these can be compared with the functional relationship models.

The y- and x-variates must be specified by the Y and X parameters respectively. The estimation methods to use are specified by the METHOD option, using the following settings.

`bartlett`	uses Bartlett’s three-group method (default).
`majoraxis`	takes the major axis from a principal component analysis (this assumes that `X` and `Y` are equally variable).
`errorsinvariables`	This fits a model that assumes the errors in `Y` and `X` are in proportion to the value specified by the `VRATIO` option. When `VRATIO` is one, this gives the same estimates as `majoraxis` (but not the same parametric confidence limits).
`yonx`	uses ordinary least squares with the dependent variable Y and independent variable `X`.
`xony`	uses ordinary least squares but with the dependent variable `X` and independent variable `Y`.
`reducedmajoraxis`	estimates the slope as the geometric mean of the regression coefficients from regressions of `Y` on `X` and `X` on `Y`.
`standardmajoraxis`	takes the geometric mean of the ordinary regression slopes (`Y` on `X` and `X` on `Y`). This is the same as reduced major axis regression, except that a different parametric estimator is used for the confidence limits.
`rangedmajoraxis`	This scales the `Y` and `X` variables before fitting a major axis regression. The scalings are controlled by the `YRANGEMETHOD` and `XRANGEMETHOD` options, respectively. The `relative` setting scales the variable by its maximum, while the `interval` setting uses its range. With the `relative` setting, the values of the variable should all be positive.
`geometricmean`	takes the geometric mean of the ordinary regression slopes (`Y` on `X` and `X` on `Y`). This is the same the reduced major axis regression, except that a different parametric estimator is used for the confidence limits.
`bisector`	estimates the slope as the bisector of the ordinary regression slopes (`Y` on `X` and `X` on `Y`).
`medyonx`	fits the median (50% quantile) regression of `Y` on `X`.
`medxony`	fits the median (50% quantile) regression of `X` on `Y`.
`qgeometricmean`	takes the geometric mean of the median regression slopes (`Y` on `X` and `X` on `Y`).
`qbisector`	estimates the slope as the bisector of the median regression slopes (`Y` on `X` and `X` on `Y`).
`qmajoraxis`	estimates the slope as the major axis of the median regression.
`theisenbartlett`	estimates the slope as the median of the median slopes for each point in the three Bartlett groups.

The GROUPS parameter allows a factor to be specified to define groupings of the data units, so that separate relationships can be investigated for each group. The probabilities of

differences in slopes, elevations (assuming a common slope) and locations (assuming a common slope and intercept for each group) between groups can be printed, or saved in a pointer using the TESTPROBABILITIES parameter. The pointer has three elements (labelled 'slopes', 'elevations' and 'locations') which save symmetric matrices. The element on the diagonal of each symmetric matrix contains the overall probability that all groups have the same estimates, and the lower triangle contains the pairwise probabilities that two groups have the same estimates. The GMETHOD option allows you to specify whether the majoraxis or standardmajoraxis method is used to calculate these tests; the default is to use standardmajoraxis for METHOD settings standardmajoraxis, reducedmajoraxis, rangedmajoraxis, geometricmean or bisector, and majoraxis for the other METHOD settings. For details of the tests see Warton et al. (2006).

The PRINT option controls printed output, with settings:

`summary`	summary of the analyses,
`estimates`	estimated slopes and intercepts with standard errors,
`fittedvalues`	fitted values and residuals,
`confidencelimits`	includes confidence intervals with the estimates,
`grouptests`	tests of slopes, elevations and locations between groups.

The default is PRINT=summ,esti,conf,grou.

The PLOT option controls what graphs are printed, with settings:

`fitted`	creates a graph showing the observed data and the lines fitted by the various methods (all on a single graph),
`residuals`	uses the `DRESIDUALS` procedure to display diagnostic plots of the residuals from each method,
`bootestimates`	creates a histogram with a skewNormal fit of the estimates from the bootstrap analysis for each method, and
`confidencelimits`	plot the fitted model for each method, with lower and upper confidence limits.

The TITLE option can supply a title for these plots. When there are no groups, the WINDOW option specifies the window to use for the fitted plot and each confidence plot, and the KEYWINDOW specifies the window to use for their keys. If there are groups, these graphs are plotted in a trellis arrangement to show all results from every group simultaneously.

The slope and intercept can be saved individually using the SLOPE and INTERCEPT parameters, or together using the ESTIMATES parameter. Their standard errors can be saved using the SE parameter. Residuals and fitted values can be saved using the FITTEDVALUES and RESIDUALS parameters. Lower and upper values from a confidence interval for the estimates can be saved using the LOWER and UPPER parameters. The probability for the confidence interval is specified by the CIPROBABILITY option (default 0.95 i.e. 95%). The type of confidence interval (parametric or bootstrap) is controlled by the CIMETHOD option. The randomization seed for CIMETHOD=bootstrap is specified by the SEED option; the default of zero continues an existing sequence of random numbers if any have already been used in the current Genstat job, or obtains a random seed using system clock if none have been used already. The number of bootstrap samples is specified by the NBOOT option (default 200). When bootstrap confidence intervals are used, the upper and lower confidence interval for the fitted values can be saved using the LOWFITTEDVALUES and UPPFITTEDVALUES parameters.

If there are no groups and a single method, SLOPE and INTERCEPT save their estimates in scalars, while ESTIMATES, SE, FITTEDVALUES, RESIDUALS, LOWER and UPPER save variates. Alternatively, if there are groups and a single method, SLOPE and INTERCEPT save their estimates in variates, while ESTIMATES, SE, FITTEDVALUES, RESIDUALS, LOWER and UPPER save matrices with a column for each group. If there are several methods, each of these parameters saves a pointer with elements labelled by the relevant METHOD setting. The pointer elements are scalars, variates or matrices according to what is being saved and whether there are groups (as defined above).

Options: PRINT, METHOD, PLOT, TITLE, NBOOT, SEED, CIPROBABILITY, CIMETHOD, GMETHOD, VRATIO, YRANGEMETHOD, XRANGEMETHOD, WINDOW, KEYWINDOW.

Parameters: Y, X, SLOPE, INTERCEPT, GROUPS, RESIDUALS, FITTEDVALUES, ESTIMATES, SE, LOWER, UPPER, LOWFITTEDVALUES, UPPFITTEDVALUES, TESTPROBABILITIES.

Method

RLFUNCTIONAL uses the methods described in Section 14.13 of Sokal & Rohlf (1995) and Warton et al. (2006). For further information, see Dhanoa et al. (2011).

Action with `RESTRICT`

If either the Y or X variates is restricted, the model is estimated using only the units not excluded by the restriction.

References

Bartlett, M.S. (1949). Fitting a straight line when both variables are subject to error. Biometrics, 5, 207-212.

Dhanoa, M.S., Sanderson, R., Lopez, S., Dijkstra, E., Kebreab, E. & France, J. (2011). Regression procedures for relationships between random variables. In: Modelling nutrient digestion and utilization in farm animals (ed. D. Sauvant, J. Van Milgen, P. Faverdin & N. Friggens), 31-39. Wageningen Academic Publishers, Wageningen.

Sokal, R.R. & Rohlf, F.J. (1995). Biometry (3rd edition). W.H. Freeman & Company, New York.

Warton, D.I., Wright, I.J., Falster, D.S. & Westoby, M. (2006). Bivariate line-fitting methods for allometry. Biological Reviews, 81, 259-291.

Example

CAPTION 'RLFUNCTIONAL example',\
        'Data from Sokal & Sneath, 1995, Biometry, page 546.';\
        STYLE=meta,plain
VARIATE [VALUES=14,17,24,25,27,33,34,37,40,41,42] Weight
&       [VALUES=61,37,65,69,54,93,87,89,100,90,97] Eggs
RLFUNCTIONAL [METHOD=Bartlett,majoraxis,reducedmajoraxis] Eggs; Weight

Updated on February 7, 2023

Was this article helpful?

Yes No