2016
DOI: 10.1515/ijb-2015-0028
|View full text |Cite
|
Sign up to set email alerts
|

A Case Study of the Impact of Data-Adaptive Versus Model-Based Estimation of the Propensity Scores on Causal Inferences from Three Inverse Probability Weighting Estimators

Abstract: Objective Consistent estimation of causal effects with inverse probability weighting estimators is known to rely on consistent estimation of propensity scores. To alleviate the bias expected from incorrect model specification for these nuisance parameters in observational studies, data-adaptive estimation and in particular an ensemble learning approach known as Super Learning has been proposed as an alternative to the common practice of estimation based on arbitrary model specification. While the theoretical a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
37
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 26 publications
(38 citation statements)
references
References 69 publications
0
37
0
Order By: Relevance
“…Researchers in epidemiology and medicine have embraced propensity based methods as a tool for estimating the average effects of an exposure of interest. Considerable research efforts have been devoted to understanding the impact of misspecification of the propensity score model,) and more recently, authors have suggested that the need for less parametric approaches to the widely used approach of logistic regression . Pirracchio et alstudied the use of ensemble package Super Learner to estimate propensity scores and concluded that it performed better than logistic regression using propensity score matching and weighting.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Researchers in epidemiology and medicine have embraced propensity based methods as a tool for estimating the average effects of an exposure of interest. Considerable research efforts have been devoted to understanding the impact of misspecification of the propensity score model,) and more recently, authors have suggested that the need for less parametric approaches to the widely used approach of logistic regression . Pirracchio et alstudied the use of ensemble package Super Learner to estimate propensity scores and concluded that it performed better than logistic regression using propensity score matching and weighting.…”
Section: Discussionmentioning
confidence: 99%
“…Diaz and Kelly focused on inverse weighted estimators and suggested that, in the absence of subject matter knowledge regarding parametric functional forms of the propensity score, predictive accuracy should be used to select an estimator among a collection of candidates. These authors also advocated the use of Super Learner, as have others . Super Learner has also been considered in the context of longitudinal data, where it has been found useful in the presence of model misspecification .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Data-adaptive regression methods have recently been adopted in the missing data and causal inference literature to tackle the problem of model misspecification. [8][9][10][11][12][13][14][15][16][17] Techniques such as classification and regression trees, adaptive splines, neural networks, 1 -regularization, support vector machines, boosting and ensembles, etc, offer flexibility in functional form specification that is not available for traditional approaches such as the Cox proportional hazards model. However, under inconsistency of one nuisance estimator, the large-sample analysis of the resulting data-adaptive doubly robust estimators requires empirical process conditions that are often not verifiable.…”
Section: Introductionmentioning
confidence: 99%
“…However, the large sample analysis of treatment effect estimates based on machine learning requires hard-to-verify assumptions, and often yield estimators which are not n 1/2 consistent, and for which no statistical inference (ie, p values and confidence intervals) is available. Nonetheless, data-adaptive estimation has been widely used in estimation of causal effects from observational data (for a few examples, see previous works 7,[9][10][11][12] ). Indeed, the statistics field of "_targeted learning" (see, eg, van der Laan et al [13][14][15] ) is concerned with the development of optimal (n 1/2 consistent, asymptotically normal, and efficient) estimators of smooth low-dimensional parameters through the use state-of-the-art machine learning.We develop estimators for analyzing data from randomized trials with missing outcomes, when the missingness probabilities and the outcome regression are estimated with data-adaptive methods.…”
mentioning
confidence: 99%
  NODES
Idea 1
idea 1
INTERN 1
Verify 1