Defines the fixed model for a hierarchical or double hierarchical generalized linear model (R.W. Payne, Y. Lee, J.A. Nelder & M. Noh).
Options
DISTRIBUTION = string token |
Distribution of the data (binomial , poisson , normal , gamma ); default norm |
---|---|
LINK = string token |
Link for the fixed model (identity , logarithm , logit , reciprocal , probit , complementaryloglog ); default iden |
DISPERSION = scalar |
Value of dispersion parameter in calculation of s.e.s etc; default * for DIST=norm or gamm , and 1 for DIST=pois or bino |
DLINK = string token |
Link for the dispersion model (logarithm , reciprocal ); default loga |
DTERMS = formula |
Dispersion model; default * i.e. none |
CONSTANT = string token |
How to treat the constant (estimate , omit ) default esti |
FACTORIAL = scalar |
Limit on number of variates and/or factors in a fixed model term; default 3 |
WEIGHTS = variate |
Prior weights; default * i.e. 1 |
OFFSET = variate |
Offset variate; default * i.e. none |
DOFFSET = variate |
Offset variate for dispersion model; default * i.e. none |
DDISPERSION = scalar |
Dispersion parameter to use in a dispersion model for the residual dispersion parameter phi; default 1 |
IDISPERSION = scalar |
Initial value for the residual dispersion parameter phi; default * i.e. formed automatically |
Parameter
TERMS = formula |
Fixed model |
---|
Description
HGFIXEDMODEL
is one of several procedures with the prefix HG, which provide tools for fitting the hierarchical generalized linear models defined by Lee & Nelder (1996, 2001, 2006) and described by Lee, Nelder & Pawitan (2006). These models extend generalized linear models (GLMs) to include additional random terms in the linear predictor. They include generalized linear mixed models (GLMMs) as a special case, but do not constrain the additional terms to follow a Normal distribution and to have an identity link (as in the GLMM). For example, if the basic generalized linear model is a log-linear model (Poisson distribution and log link), a more appropriate assumption or the additional random terms might be a gamma distribution and a log link.
The role of HGFIXEDMODEL
is to specify the fixed model terms in the HGLM, and to define the distribution of the data (this corresponds to error distribution of a GLM). The fixed model is given by the TERMS
parameter. Most of the options operate similarly to those occurring in the directives FIT
and MODEL
. The link function for the fixed model is defined by the LINK
option, and the FACTORIAL
option sets a limit on the number of variates and/or factors for a term to be included in the fixed model (default 3). The CONSTANT
option indicates whether or not to include a constant term or intercept (by default this is included), and the OFFSET
option allows an offset variate to be included. The DISTRIBUTION
option defines the distribution of the data, the WEIGHTS
option allows you to specify a variate of prior weights, and the DISPERSION
option governs how the dispersion parameter is obtained.
The HGLM methodology also caters for structured dispersion models, in which fixed terms are included in the generalized linear models that are used to estimate the dispersion parameters. Currently these GLMs must have a gamma distribution. The DTERMS
option allows you to specify fixed terms for the GLM that estimates the residual dispersion parameter phi. The DLINK
parameter specifies the link to use with the dispersion model, the DOFFSET
option allows you to specify an offset variate, and the DDISPERSION
option defines the dispersion parameter for the dispersion GLM (default 1). You can also extend the GLM to become an HGLM (thus making the full model a double hierarchical generalized linear model or DHGLM), by using the HGDRANDOMMODEL
procedure to add some random terms.
The IDISPERSION
option allows you to define an initial value for the residual dispersion parameter phi. Initial values for the dispersion parameters of the additional random terms of the HGLM can be defined using the IDISPERSION
parameter of the HGRANDOMMODEL
procedure. If you set both of these, the HGANALYSE
procedure will then use them to initialize the weights that are involved in the fitting of the augmented mean model; for details see Chapter 6 of Lee, Nelder & Pawitan (2006). The default weights that are formed automatically if either of these is unset are satisfactory in most circumstances, but you may want to try your own initial values if you encounter convergemce problems.
Options: DISTRIBUTION
, LINK
, DISPERSION
, DLINK
, DTERMS
, CONSTANT
, FACTORIAL
, WEIGHTS
, OFFSET
, DOFFSET
, DDISPERSION
, IDISPERSION
.
Parameter: TERMS
.
Method
The information is stored in a workspace G5PL_HG
(accessed using the WORKSPACE
directive) for later use by HGANALYSE
.
References
Lee, Y., & Nelder, J.A. (1996). Hierarchical generalized linear models (with discussion). Journal of the Royal Statistical Society, Series B, 58, 619-678.
Lee, Y., & Nelder, J.A. (2001). Hierarchical generalized linear models: a synthesis of generalised linear models, random-effect models and structured dispersions. Biometrika, 88, 987-1006.
Lee, Y. & Nelder, J.A. (2006). Double hierarchical generalized linear models (with discussion). Appl. Statist., 55, 139-185.
Lee, Y., Nelder, J.A. & Pawitan, Y. (2006). Generalized Linear Models with Random Effects: Unified Analysis via H-likelihood. Chapman & Hall, London.
Lee, Y., Nelder, J.A. & Pawitan, Y. (2006). Generalized Linear Models with Random Effects: Unified Analysis via H-likelihood. Chapman and Hall, Boca Raton.
See also
Procedures: HGANALYSE
, HGDISPLAY
, HGDRANDOMMODEL
, HGFTEST
, HGGRAPH
, HGKEEP
, HGNONLINEAR
, HGPLOT
, HGPREDICT
, HGRANDOMMODEL
, HGRTEST
, HGSTATUS
, HGTOBITPOISSON
, HGWALD
.
Commands for: Regression analysis.
Example
CAPTION 'HGFIXEDMODEL example',!t(\ 'Number of faults in rolls of fabric of various lengths',\ '(data from Bissell (1972) Biometrika, 59, 435-441).'),\ 'Fit negative binomial: var(y) = mu + alpha * mu * mu',\ '(equivalent to Poisson gamma HGLM with saturated random effect).';\ STYLE=meta,3(plain) VARIATE [NVALUES=32] length,faults READ length,faults 551 6 651 4 832 17 375 9 715 14 868 8 271 5 630 7 491 7 372 7 645 6 441 8 895 28 458 4 642 10 492 4 543 8 842 9 905 23 542 9 522 6 122 1 657 9 170 4 738 9 371 14 735 17 749 10 495 7 716 3 952 9 417 2 : CALCULATE loglength = log(length) & loglength = loglength - mean(loglength) FACTOR [LEVELS=32; VALUES=1...32] saturated HGFIXEDMODEL [DISTRIBUTION=poisson; LINK=log] loglength HGRANDOMMODEL [DISTRIBUTION=normal; LINK=identity] saturated HGANALYSE faults