1. Home
  2. REML Outlier Detection

REML Outlier Detection

This menu allows you to fit the variance shift outlier model (VSOM) to detect outliers in a REML analysis.

Residual term to check for outliers

The dropdown list contains the residual strata in the REML model. Select the strata that is to be checked for outliers.

Display

This controls what output is printed from the analysis:

Outliers A list of possible outliers
False discovery rate The false discovery rates for the potential outliers, which are estimated from the distribution of p-values calculated from the t-statistics using an asymptotic model, See the FDRMIXTURE procedure for more information on the estimation of false discovery rates

Plot

This controls what plots are displayed from the analysis:

Index plots The estimated statistics from the analysis plotted against their index (order) in the data set
Residual plot The usual residual plot with outlying points coloured by their outlier status

Components in the index plot

The index plot can have three components plotted:

Omega Variance shift as a ratio to the residual variance
Sigma squared Estimated residual variance under VSOM
Statistic The statistic selected in the Methods section

The graph can be given a Title by entering the text in that field. If no title is given, then default titles will be used. Use a single space in the field to suppress titles.

Methods

This controls what plots are displayed from the analysis:

Calculating statistics

The deviance associated with a variance shift for a particular unit in the strata can be calculated using 3 statistics:

t Uses the squared t-statistics (i.e. squared standardized residuals) to approximate the change in likelihood (default); this is the fastest approach
Partial likelihood The partial likelihood which approximates the change in likelihood, where the baseline model parameters are held fixed, and only the extra variance component for each unit is estimated; this is much faster than re-estimating the full model
Full likelihood The full likelihood calculated by refitting the full model with the added variance term for each unit; this can be very time-consuming

Constrain variance components to be positive

This constrains the variance components in the model, including the variance shift for the units, to be positive.

Calculating thresholds

The thresholds used to decide if an outlier is significant at the 5%, 1% or 0.1% level can be calculated two ways:

Bootstrap Uses parametric bootstrap samples, with the variance components in the baseline model, to calculate the thresholds from the percentiles of the order statistics
Approximate Uses the asymptotic distribution to calculate the thresholds

Number of samples

This option specifies the number of bootstrap samples that are performed, default 499 (as well as the “null” model where the data keep their original values).

Seed

Specifies the seed for the random number generator used to make the bootstrap samples; default 0 continues from the previous generation or (if none) initializes the seed automatically.

Save

Use these options to save results from the VSOM analysis. After selecting the Results box, you need to type the identifier of a pointer into the In: field. The elements in the results pointer are given in the VSOM help.

Display in spreadsheet

The saved results will be displayed in a new spreadsheet.

See also

Updated on April 1, 2019

Was this article helpful?