Abstract

Statistical approaches to overdispersion, correlated errors, shrinkage estimation, and smoothing of regression relationships may be encompassed within the framework of the generalized linear mixed model (GLMM). Given an unobserved vector of random effects, observations are assumed to be conditionally independent with means that depend on the linear predictor through a specified link function and conditional variances that are specified by a variance function, known prior weights and a scale factor. The random effects are assumed to be normally distributed with mean zero and dispersion matrix depending on unknown variance components. For problems involving time series, spatial aggregation and smoothing, the dispersion may be specified in terms of a rank deficient inverse covariance matrix. Approximation of the marginal quasi-likelihood using Laplace's method leads eventually to estimating equations based on penalized quasilikelihood or PQL for the mean parameters and pseudo-likelihood for the variances. Implementation involves repeated calls to normal theory procedures for REML estimation in variance components problems. By means of informal mathematical arguments, simulations and a series of worked examples, we conclude that PQL is of practical value for approximate inference on parameters and realizations of random effects in the hierarchical model. The applications cover overdispersion in binomial proportions of seed germination; longitudinal analysis of attack rates in epilepsy patients; smoothing of birth cohort effects in an age-cohort model of breast cancer incidence; evaluation of curvature of birth cohort effects in a case-control study of childhood cancer and obstetric radiation; spatial aggregation of lip cancer rates in Scottish counties; and the success of salamander matings in a complicated experiment involving crossing of male and female effects. PQL tends to underestimate somewhat the variance components and (in absolute value) fixed effects when applied to clustered binary data, but the situation improves rapidly for binomial observations having denominators greater than one.

Keywords

OverdispersionMathematicsGeneralized linear mixed modelStatisticsQuasi-likelihoodRandom effects modelSmoothingMixed modelVariance functionLinear modelEconometricsNegative binomial distributionApplied mathematicsLinear regressionPoisson distribution

Affiliated Institutions

Related Publications

Categorical Data Analysis

Preface. 1. Introduction: Distributions and Inference for Categorical Data. 1.1 Categorical Response Data. 1.2 Distributions for Categorical Data. 1.3 Statistical Inference for ...

2002 Wiley series in probability and stati... 6519 citations

Publication Info

Year
1993
Type
article
Volume
88
Issue
421
Pages
9-25
Citations
4139
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

4139
OpenAlex

Cite This

N. E. Breslow, David Clayton (1993). Approximate Inference in Generalized Linear Mixed Models. Journal of the American Statistical Association , 88 (421) , 9-25. https://doi.org/10.1080/01621459.1993.10594284

Identifiers

DOI
10.1080/01621459.1993.10594284