RMPLCONFIDENCE procedure

Estimates profile likelihood confidence intervals of predicted group means from a linear or generalized linear model analysis (V.M. Cave).

Options

`PRINT` = string token	Controls printed output (`intervals`); default `inte`
`BACKTRANSFORM` = string token	What back-transformation to apply to the intervals on the linear scale (`link`, `none`); default `link` (i.e. the intervals are presented on the natural scale)
`PLOT` = string tokens	What to plot (`profiles`, `intervals`); default * i.e. no plots
`DECIMALS` = scalar	Number of decimal places for printing
`CIPROBABILITY` = scalar	Probability for the confidence intervals; default 0.95
`RANGE` = scalar, variate or matrix	Defines the range of values over which to evaluate the profile likelihoods; default 3
`NPOINTS` = scalar	Defines the number of values at which to evaluate the profile likelihoods initially; default 10
`NEXTRAPOINTS` = scalar	Defines the number of extra values to evaluate the profile likelihoods at, within the neighbourhoods of the confidence limits identified during the initial scan as defined by `RANGE` and `NPOINTS`; default 10
`SAVE` = regression save structure	Regression save structure to provide the information on the regression model; default * uses the most recently fitted regression model

Parameter

CISAVE = pointer Pointer that saves the lower and upper limits of the profile likelihood intervals, along with the estimated group means, labels identifying the groups, and an indicator variable for one-sided intervals

Description

RMPLCONFIDENCE estimates profile likelihood confidence intervals of group means from a linear or generalized linear model analysis. The model must have been fitted already using the regression commands MODEL and FIT in the usual way.

The groups are defined by the unique combinations of the factors in the regression model. Therefore, the model formula (defined by the FIT directive) must contain at least one factor, and if more than one factor is included, the model formula must represent the full factorial of these factors. Furthermore, the model formula must not contain covariates.

By default, the confidence intervals are produced for the most recent regression analysis. However, you can use the SAVE option to specify the save structure from another regression analysis (see the SAVE option of MODEL).

The probability for the confidence intervals is specified by the CIPROBABILITY option; the default 0.95 gives 95% confidence intervals.

To estimate the profile likelihood confidence interval of a group’s mean, RMPLCONFIDENCE refits the model with the group’s mean fixed at a given value, and then computes the change in deviance between this model and the original model, in which the group’s mean has been estimated as per normal. This process is repeated across a range of values for the group mean, aimed to be broad enough to include its lower and upper confidence limits. These changes in deviance are then compared to the critical value for the CIPROBABILITY × 100th percentile of the chi-square distribution on 1 degree of freedom, and the confidence limits are estimated by interpolating between the fixed values for the group mean that resulted in deviance changes closest to this critical value. The procedure uses the INTERPOLATE directive to perform cubic interpolation, or linear interpolation if cubic interpolation is not possible.

The range of values over which RMPLCONFIDENCE evaluates the profile likelihood of a group’s mean is defined by the option RANGE, in either a scalar, variate or matrix. If a scalar is supplied, this specifies a multiplier of the standard error of the mean, and points are evaluated at this multiple of the standard error, in both the positive and negative directions from the estimated group mean. Alternatively, if a variate is supplied, it must contain two values, specifying the deviations in the negative and positive directions from the estimated mean, respectively. Finally, if a matrix is supplied, it must have two columns and the same number of rows as group means. The value in the first column of the ith row specifies the deviation in the negative direction from the estimated group mean i, and the value in the second column of the ith row specifies the deviation in the positive direction from the estimated group mean i. The default of three evaluates the profile likelihood at points ranging up to three times the standard error of the mean in both the negative and positive directions. In general, this default works well. However, for very skewed or flat profile likelihoods, a more appropriate RANGE will need to be specified.The NPOINTS option specifies the number of points where the profile likelihood is evaluated initially, in both the negative and positive directions from the estimated mean. The default is 10. That is, 10 evenly spaced values between the lower bound defined by RANGE, up to (but not including) the estimated mean, are evaluated in order to estimate the lower limit of the confidence interval. Similarly, 10 evenly spaced values between the upper bound defined by RANGE, down to (but not including) the estimated mean, are evaluated in order to estimate the upper limit of the confidence interval.

To get more accurate estimates of the confidence limits, a finer search of the profile likelihood can be performed by setting the option NEXTRAPOINTS to an integer greater than zero. The profile likelihood at NEXTRAPOINTS is then evaluated around the neighbourhoods of both the lower and upper confidence limits identified during the initial search (as defined by RANGE and NPOINTS). For example, if the initial search identified that the lower limit was located between the values of 1 and 3, NEXTRAPOINTS=4 would cause the profile likelihood to be further evaluated at 1.4, 1.8, 2.2, 2.6. Setting NEXTRAPOINTS=0 means a finer scan is not performed; default 10.

Printed output is controlled by the PRINT option. By default, RMPLCONFIDENCE prints the profile likelihood confidence intervals, but you can suppress this by setting option PRINT=*. The number of decimals places to appear in the printed output can be set using the DECIMALS option.

The PLOT option allows you to request plots of the results, using the following settings.

profiles for plots of the profile likelihoods for each group mean. Superimposed onto these plots are a horizontal reference line at the critical value (in red), vertical projections from the critical value to the estimated confidence limits (also in red) and a vertical reference line at the estimated mean (green dotted). Blue crosses on the profile plot indicate fixed values for the group mean at which the regression model failed to successfully fit.

intervals for a plot of the estimated profile likelihood confidence intervals of the group means. By default, no plots are produced.

The CISAVE parameter can supply a pointer to store the results. The elements of the pointer are labelled as follows:

'label' text structure storing labels to identify the group means,

'estimate' variate storing the estimated group means,

'lower' variate storing the lower limits of the profile likelihood intervals,

'upper' variate storing the upper limits of the profile likelihood intervals,

'one-sided' indicator variable identifying one-sided intervals.

A one-sided interval is produced when the estimated mean lies at the bound of the assumed distribution. Thus, for regression models with a Poisson, geometric or negative binomial distribution, right-sided intervals are produced for groups whose estimated mean is 0 (i.e. at the lower bound of the distribution). For models with binomial or Bernoulli distributions, left- and right-sided intervals are produced when the number of failures, or the number of successes, is the same as the number of trials, respectively (i.e. when the estimated proportion for a group is 0 or 1).

RMPLCONFIDENCE prints, plots and saves the profile confidence intervals on the back-transformed scale (i.e. on the natural scale). However, you can obtain the intervals on the scale of the linear predictor by setting the option BACKTRANSFORM=none.

The procedure accommodates most linear and generalized linear models defined by the MODEL directive, except for models with a user defined (i.e. calculated) DISTRIBUTION or LINK function, a multinomial distribution or an absorbing factor (i.e. when the GROUPS option is used). In addition, although the model formula defined by the FIT directive may include POL, REG and COMPARISON functions, these are ignored by the procedure.

Options: PRINT, BACKTRANSFORM, PLOT, DECIMALS, CIPROBABILITY, RANGE, NPOINTS, NEXTRAPOINTS, SAVE
Parameter: CISAVE.

Example

CAPTION        'RMPLCONFIDENCE examples'; STYLE=meta
" Linear regression model "
SPLOAD         '%Data%/Calcium.gsh'
MODEL          calcium
FIT            region
RMPLCONFIDENCE [PLOT=profiles,intervals]
" GLM: log-linear model "
SPLOAD         '%Data%/Pups.gsh'
MODEL          [DISTRIBUTION=Poisson; LINK=log] pups
FIT            diet*time
RMPLCONFIDENCE [PLOT=profiles,intervals]

Updated on February 3, 2023

Was this article helpful?

Yes No

Options

Parameter

Description

See also

Example

Was this article helpful?