Abstract
Propensity-score matching is frequently used in the medical literature to reduce or eliminate the effect of treatment selection bias when estimating the effect of treatments or exposures on outcomes using observational data. In propensity-score matching, pairs of treated and untreated subjects with similar propensity scores are formed. Recent systematic reviews of the use of propensity-score matching found that the large majority of researchers ignore the matched nature of the propensity-score matched sample when estimating the statistical significance of the treatment effect. We conducted a series of Monte Carlo simulations to examine the impact of ignoring the matched nature of the propensity-score matched sample on Type I error rates, coverage of confidence intervals, and variance estimation of the treatment effect. We examined estimating differences in means, relative risks, odds ratios, rate ratios from Poisson models, and hazard ratios from Cox regression models. We demonstrated that accounting for the matched nature of the propensity-score matched sample tended to result in type I error rates that were closer to the advertised level compared to when matching was not incorporated into the analyses. Similarly, accounting for the matched nature of the sample tended to result in confidence intervals with coverage rates that were closer to the nominal level, compared to when matching was not taken into account. Finally, accounting for the matched nature of the sample resulted in estimates of standard error that more closely reflected the sampling variability of the treatment effect compared to when matching was not taken into account.
Keywords
Affiliated Institutions
Related Publications
Confidence intervals for the effect of a prognostic factor after selection of an ‘optimal’ cutpoint
Abstract When investigating the effects of potential prognostic or risk factors that have been measured on a quantitative scale, values of these factors are often categorized in...
Approximate variance formulas for standardized rate ratios
Some of the techniques which are used to estimate the variance of and confidence intervals for standardized rate ratios either ignore variability of comparison rates or tend to ...
Significance, Errors, Power, and Sample Size: The Blocking and Tackling of Statistics
Inferential statistics relies heavily on the central limit theorem and the related law of large numbers. According to the central limit theorem, regardless of the distribution o...
Correcting for spatial autocorrelation in sequential sampling
1 Sequential sampling is attractive because it permits the user to choose, and efficiently achieve, desired confidence interval lengths. Sequential sampling has been broadly app...
Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies
In genome-wide association studies (GWAS) for thousands of phenotypes in large biobanks, most binary traits have substantially fewer cases than controls. Both of the widely used...
Publication Info
- Year
- 2009
- Type
- article
- Volume
- 5
- Issue
- 1
- Pages
- Article 13-Article 13
- Citations
- 184
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.2202/1557-4679.1146