Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 May 1;12(1):131-55.
doi: 10.1515/ijb-2015-0028.

A Case Study of the Impact of Data-Adaptive Versus Model-Based Estimation of the Propensity Scores on Causal Inferences from Three Inverse Probability Weighting Estimators

A Case Study of the Impact of Data-Adaptive Versus Model-Based Estimation of the Propensity Scores on Causal Inferences from Three Inverse Probability Weighting Estimators

Romain Neugebauer et al. Int J Biostat. .

Abstract

Objective: Consistent estimation of causal effects with inverse probability weighting estimators is known to rely on consistent estimation of propensity scores. To alleviate the bias expected from incorrect model specification for these nuisance parameters in observational studies, data-adaptive estimation and in particular an ensemble learning approach known as Super Learning has been proposed as an alternative to the common practice of estimation based on arbitrary model specification. While the theoretical arguments against the use of the latter haphazard estimation strategy are evident, the extent to which data-adaptive estimation can improve inferences in practice is not. Some practitioners may view bias concerns over arbitrary parametric assumptions as academic considerations that are inconsequential in practice. They may also be wary of data-adaptive estimation of the propensity scores for fear of greatly increasing estimation variability due to extreme weight values. With this report, we aim to contribute to the understanding of the potential practical consequences of the choice of estimation strategy for the propensity scores in real-world comparative effectiveness research.

Method: We implement secondary analyses of Electronic Health Record data from a large cohort of type 2 diabetes patients to evaluate the effects of four adaptive treatment intensification strategies for glucose control (dynamic treatment regimens) on subsequent development or progression of urinary albumin excretion. Three Inverse Probability Weighting estimators are implemented using both model-based and data-adaptive estimation strategies for the propensity scores. Their practical performances for proper confounding and selection bias adjustment are compared and evaluated against results from previous randomized experiments.

Conclusion: Results suggest both potential reduction in bias and increase in efficiency at the cost of an increase in computing time when using Super Learning to implement Inverse Probability Weighting estimators to draw causal inferences.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Crude estimates over 16 quarters of the four survival curves associated with the four TI initiation strategies dθ with θ = 7, 7.5, 8, 8.5.
Figure 2
Figure 2
Unbounded IPW estimates (no weight truncation) over 16 quarters of the four survival curves associated with the four TI initiation strategies dθ with θ = 7, 7.5, 8, 8.5. The estimates on the top left, top right, bottom left, and bottom right plots were obtained based on the estimators gn, gn,t, gn,t,x, and gn,t,SL, respectively.
Figure 3
Figure 3
Bounded IPW estimates (no weight truncation) over 16 quarters of the four survival curves associated with the four TI initiation strategies dθ with θ = 7, 7.5, 8, 8.5. The estimates on the top left, top right, bottom left, and bottom right plots were obtained based on the estimators gn, gn,t, gn,t,x, and gn,t,SL, respectively.
Figure 4
Figure 4
Bounded hazard-based IPW estimates (no weight truncation) over 16 quarters of the four survival curves associated with the four TI initiation strategies dθ with θ = 7, 7.5, 8, 8.5. The estimates on the top left, top right, bottom left, and bottom right plots were obtained based on the estimators gn, gn,t, gn,t,x, and gn,t,SL, respectively.
Figure 5
Figure 5
Distribution of the IP weights assigned to failures observed under any of the four TI initiation strategies in the unbounded IPW estimation approach. Each plot represents the distribution of the IP weights derived with the estimators gn, gn,t, gn,t,x, and gn,t,SL in the interval indicated by the labels on the x axis. These intervals are defined by the 25th, 50th, 75th, 99th, and 99.9th percentiles of the IP weights derived with the estimator gn.
Figure 6
Figure 6
Distribution of the IP weights assigned to failures observed under any of the four TI initiation strategies in the bounded IPW estimation approach. Each plot represents the distribution of the IP weights derived with the estimators gn, gn,t, gn,t,x, and gn,t,SL in the interval indicated by the labels on the x axis. These intervals are defined by the 25th, 50th, 75th, 99th, and 99.9th percentiles of the IP weights derived with the estimator gn.
Figure 7
Figure 7
Distribution of the IP weights assigned to failures observed under any of the four TI initiation strategies in the hazard-based bounded IPW estimation approach. Each plot represents the distribution of the IP weights derived with the estimators gn, gn,t, gn,t,x, and gn,t,SL in the interval indicated by the labels on the x axis. These intervals are defined by the 25th, 50th, 75th, 99th, and 99.9th percentiles of the IP weights derived with the estimator gn.

Similar articles

Cited by

References

    1. Robins JM. Association, causation and marginal structural models. Synthese. 1999;121:151–79.
    1. McCaffrey DF, Ridgeway G, Morral AR. Propensity score estimation with boosted regression for evaluating causal effects in observational studies. Psychol Methods. 2004;9:403–25. - PubMed
    1. Ridgeway G, McCaffrey DF. Comment: Demystifying double robustness: a comparison of alternative strategies for estimating a population mean from incomplete data. Stat Sci. 2007;22:540–3. - PMC - PubMed
    1. Petersen ML, Wang Y, van der Laan MJ, Guzman D, Riley E, Bangsberg DR. Pillbox organizers are associated with improved adherence to HIV antiretroviral therapy and viral suppression: a marginal structural model analysis. Clin Infect Dis. 2007;45:908–15. - PMC - PubMed
    1. Moore K, Neugebauer R, Lurmann F, et al. Ambient ozone concentrations cause increased hospitalizations for asthma in children: an 18-year study in Southern California. Environ Health Perspect. 2008;116:1063–70. - PMC - PubMed

MeSH terms

LinkOut - more resources

  NODES
Association 1
twitter 2