Covariate Balancing Propensity Score Kosuke Imai Princeton - PowerPoint PPT Presentation

Covariate Balancing Propensity Score Kosuke Imai Princeton University Winter Conference in Statistics Borgafjäll, Sweden March 9, 2015 Joint work with Christian Fong, and Marc Ratkovic Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 1 / 48

Motivation and Overview Central role of propensity score in causal inference Adjusting for observed confounding in observational studies Generalizing experimental and instrumental variables estimates Propensity score tautology sensitivity to model misspecification adhoc specification searches Covariate Balancing Propensity Score (CBPS) Estimate the propensity score such that covariates are balanced Inverse probability weights for marginal structural models Three cases: Binary treatment 1 Time-varying binary treatments in longitudinal settings 2 Multi-valued and continuous treatments 3 Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 2 / 48

Propensity Score Notation: T i ∈ { 0 , 1 } : binary treatment X i : pre-treatment covariates Dual characteristics of propensity score: Predicts treatment assignment: 1 π ( X i ) = Pr ( T i = 1 | X i ) Balances covariates (Rosenbaum and Rubin, 1983): 2 T i ⊥ ⊥ X i | π ( X i ) But, propensity score must be estimated (more on this later) Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 3 / 48

Use of Propensity Score for Causal Inference Matching Subclassification Weighting (Horvitz-Thompson): � T i Y i � n � 1 π ( X i ) − ( 1 − T i ) Y i n ˆ 1 − ˆ π ( X i ) i = 1 where weights are often normalized Doubly-robust estimators (Robins et al. ): n 1 �� µ ( 1 , X i ) + T i ( Y i − ˆ µ ( 1 , X i )) � � µ ( 0 , X i ) + ( 1 − T i )( Y i − ˆ µ ( 0 , X i )) �� ˆ ˆ − n π ( X i ) ˆ 1 − ˆ π ( X i ) i = 1 They have become standard tools for applied researchers Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 4 / 48

Weighting to Balance Covariates � � π β ( X i ) − ( 1 − T i ) X i T i X i = 0 Balancing condition: E 1 − π β ( X i ) 6 5 4 3 2 ATE weighted ATE weighted Treated units Control units 1 0 0.0 0.2 0.4 0.6 0.8 1.0 Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 5 / 48

Propensity Score Tautology Propensity score is unknown and must be estimated Dimension reduction is purely theoretical: must model T i given X i Diagnostics: covariate balance checking In theory: ellipsoidal covariate distributions = ⇒ equal percent bias reduction In practice: skewed covariates and adhoc specification searches Propensity score methods are sensitive to model misspecification Tautology: propensity score methods only work when they work Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 6 / 48

Kang and Schafer (2007, Statistical Science ) Simulation study: the deteriorating performance of propensity score weighting methods when the model is misspecified 4 covariates X ∗ i : all are i.i.d. standard normal Outcome model: linear model Propensity score model: logistic model with linear predictors Misspecification induced by measurement error: X i 1 = exp ( X ∗ i 1 / 2 ) X i 2 = X ∗ i 2 / ( 1 + exp ( X ∗ 1 i ) + 10 ) X i 3 = ( X ∗ i 1 X ∗ i 3 / 25 + 0 . 6 ) 3 X i 4 = ( X ∗ i 1 + X ∗ i 4 + 20 ) 2 Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 7 / 48

Weighting Estimators Evaluated Horvitz-Thompson (HT): 1 � T i Y i � n � 1 π ( X i ) − ( 1 − T i ) Y i n ˆ 1 − ˆ π ( X i ) i = 1 Inverse-probability weighting with normalized weights (IPW): 2 HT with normalized weights (Hirano, Imbens, and Ridder) Weighted least squares regression (WLS): linear regression with 3 HT weights Doubly-robust least squares regression (DR): consistently 4 estimates the ATE if either the outcome or propensity score model is correct (Robins, Rotnitzky, and Zhao) Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 8 / 48

Weighting Estimators Do Fine If the Model is Correct Bias RMSE Sample size Estimator GLM True GLM True (1) Both models correct HT 0 . 33 1 . 19 12 . 61 23 . 93 IPW − 0 . 13 − 0 . 13 3 . 98 5 . 03 n = 200 WLS − 0 . 04 − 0 . 04 2 . 58 2 . 58 DR − 0 . 04 − 0 . 04 2 . 58 2 . 58 HT 0 . 01 − 0 . 18 4 . 92 10 . 47 IPW 0 . 01 − 0 . 05 1 . 75 2 . 22 n = 1000 WLS 0 . 01 0 . 01 1 . 14 1 . 14 DR 0 . 01 0 . 01 1 . 14 1 . 14 (2) Propensity score model correct HT − 0 . 05 − 0 . 14 14 . 39 24 . 28 IPW − 0 . 13 − 0 . 18 4 . 08 4 . 97 n = 200 WLS 0 . 04 0 . 04 2 . 51 2 . 51 DR 0 . 04 0 . 04 2 . 51 2 . 51 HT − 0 . 02 0 . 29 4 . 85 10 . 62 IPW 0 . 02 − 0 . 03 1 . 75 2 . 27 n = 1000 WLS 0 . 04 0 . 04 1 . 14 1 . 14 DR 0 . 04 0 . 04 1 . 14 1 . 14 Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 9 / 48

Weighting Estimators are Sensitive to Misspecification Bias RMSE Sample size Estimator GLM True GLM True (3) Outcome model correct HT 24 . 25 − 0 . 18 194 . 58 23 . 24 IPW 1 . 70 − 0 . 26 9 . 75 4 . 93 n = 200 WLS − 2 . 29 0 . 41 4 . 03 3 . 31 DR − 0 . 08 − 0 . 10 2 . 67 2 . 58 HT 41 . 14 − 0 . 23 238 . 14 10 . 42 IPW 4 . 93 − 0 . 02 11 . 44 2 . 21 n = 1000 WLS − 2 . 94 0 . 20 3 . 29 1 . 47 DR 0 . 02 0 . 01 1 . 89 1 . 13 (4) Both models incorrect HT 30 . 32 − 0 . 38 266 . 30 23 . 86 IPW 1 . 93 − 0 . 09 10 . 50 5 . 08 n = 200 WLS − 2 . 13 0 . 55 3 . 87 3 . 29 DR − 7 . 46 0 . 37 50 . 30 3 . 74 HT 101 . 47 0 . 01 2371 . 18 10 . 53 IPW 5 . 16 0 . 02 12 . 71 2 . 25 n = 1000 WLS − 2 . 95 0 . 37 3 . 30 1 . 47 DR − 48 . 66 0 . 08 1370 . 91 1 . 81 Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 10 / 48

Smith and Todd (2005, J. of Econometrics ) LaLonde (1986; Amer. Econ. Rev. ): Randomized evaluation of a job training program Replace experimental control group with another non-treated group Current Population Survey and Panel Study for Income Dynamics Many evaluation estimators didn’t recover experimental benchmark Dehejia and Wahba (1999; J. of Amer. Stat. Assoc. ): Apply propensity score matching Estimates are close to the experimental benchmark Smith and Todd (2005): Dehejia & Wahba (DW)’s results are sensitive to model specification They are also sensitive to the selection of comparison sample Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 11 / 48

Propensity Score Matching Fails Miserably One of the most difficult scenarios identified by Smith and Todd: LaLonde experimental sample rather than DW sample Experimental estimate: $ 886 (s.e. = 488) PSID sample rather than CPS sample Evaluation bias: Conditional probability of being in the experimental sample Comparison between experimental control group and PSID sample “True” estimate = 0 Logistic regression for propensity score One-to-one nearest neighbor matching with replacement Propensity score model Estimates Linear − 835 (886) Quadratic − 1620 (1003) Smith and Todd (2005) − 1910 (1004) Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 12 / 48

Covariate Balancing Propensity Score (CBPS) Idea: Estimate propensity score such that covariates are balanced Goal: Robust estimation of parametric propensity score model Covariate balancing conditions: � T i X i � π β ( X i ) − ( 1 − T i ) X i = 0 E 1 − π β ( X i ) Over-identification via score conditions: � � T i π ′ ( 1 − T i ) π ′ β ( X i ) β ( X i ) π β ( X i ) − = 0 E 1 − π β ( X i ) Can be interpreted as another covariate balancing condition Combine them with the Generalized Method of Moments Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 13 / 48

Revisiting Kang and Schafer (2007) Bias RMSE Estimator GLM CBPS1 CBPS2 True GLM CBPS1 CBPS2 True (1) Both models correct HT 0 . 33 2 . 06 − 4 . 74 1 . 19 12 . 61 4 . 68 9 . 33 23 . 93 IPW − 0 . 13 0 . 05 − 1 . 12 − 0 . 13 3 . 98 3 . 22 3 . 50 5 . 03 n = 200 WLS − 0 . 04 − 0 . 04 − 0 . 04 − 0 . 04 2 . 58 2 . 58 2 . 58 2 . 58 DR − 0 . 04 − 0 . 04 − 0 . 04 − 0 . 04 2 . 58 2 . 58 2 . 58 2 . 58 HT 0 . 01 0 . 44 − 1 . 59 − 0 . 18 4 . 92 1 . 76 4 . 18 10 . 47 IPW 0 . 01 0 . 03 − 0 . 32 − 0 . 05 1 . 75 1 . 44 1 . 60 2 . 22 n = 1000 WLS 0 . 01 0 . 01 0 . 01 0 . 01 1 . 14 1 . 14 1 . 14 1 . 14 DR 0 . 01 0 . 01 0 . 01 0 . 01 1 . 14 1 . 14 1 . 14 1 . 14 (2) Propensity score model correct HT − 0 . 05 1 . 99 − 4 . 94 − 0 . 14 14 . 39 4 . 57 9 . 39 24 . 28 IPW − 0 . 13 0 . 02 − 1 . 13 − 0 . 18 4 . 08 3 . 22 3 . 55 4 . 97 n = 200 WLS 0 . 04 0 . 04 0 . 04 0 . 04 2 . 51 2 . 51 2 . 51 2 . 51 DR 0 . 04 0 . 04 0 . 04 0 . 04 2 . 51 2 . 51 2 . 52 2 . 51 HT − 0 . 02 0 . 44 − 1 . 67 0 . 29 4 . 85 1 . 77 4 . 22 10 . 62 IPW 0 . 02 0 . 05 − 0 . 31 − 0 . 03 1 . 75 1 . 45 1 . 61 2 . 27 n = 1000 WLS 0 . 04 0 . 04 0 . 04 0 . 04 1 . 14 1 . 14 1 . 14 1 . 14 DR 0 . 04 0 . 04 0 . 04 0 . 04 1 . 14 1 . 14 1 . 14 1 . 14 Kosuke Imai (Princeton) Covariate Balancing Propensity Score Sweden (March 9, 2015) 14 / 48

Covariate Balancing Propensity Score Kosuke Imai Princeton - PowerPoint PPT Presentation

Covariate Balancing Propensity Score Kosuke Imai Princeton University Winter Conference in Statistics Borgafjll, Sweden March 9, 2015 Joint work with Christian Fong, and Marc Ratkovic Kosuke Imai (Princeton) Covariate Balancing Propensity

Covariate Balancing Propensity Score Kosuke Imai Princeton University June 1, 2012 Joint work

Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton

Covariate Adjustment and Statistical Power Tara Slough EGAP Learning Days X Covariate Adjustment

Propensity Score Matching James H. Steiger Department of Psychology and Human Development

MARC Fall Meeting 09/24/17 MARC Fall Meeting 09/24/17 SCORE Presentation SCORE

Balancing Gas system information provision 12 June 2018 GRTgaz balancing in a nutshell -> 2

Model Selection Model Selection under Covariate Shift under Covariate Shift Masashi Sugiyama

Motivation: disease progression modelling Covariate-GPLVM Motivation: disease progression

Load Balancing Load Balancing Load balancing: distributing data and/or computations across

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Qingyuan Zhao

Internal Load Balancing in 5 mins Deliver scalable and resilient internal-only services on GCP

Dynamic Load Balancing in Dynamic Load Balancing in Charm+ + Charm+ + Abhinav S Bhatele

Epidemic Algorithm for Load Balancing Harshitha Menon, Laxmikant Kal e 15th April 1 / 25

Linking Design to Analysis of Cluster Randomized Trials: Covariate Balancing Strategies Fan

Importance-Weighted Cross- Importance-Weighted Cross- Validation for Covariate Shift Validation

Treatment choice with many covariate values Aleksey Tetenov (University of Bristol) Cemmap

Coupling 3D radiative transfer models with soil vegetation transfer models for sparse vegetation

High-dimensional covariance estimation based on Gaussian graphical models Shuheng Zhou

DAKAR WORKSHOP, Dakar, 12 th September 2013 MONITORING AND EVALUATION OF THE DAKAR STRATEGY

Modelling and Estimating the Clustering of Extreme Events Rob Lamb 3 , 4 Ross Towe 1 Jonathan Tawn

Neural Models for Documents with Metadata Dallas Card, Chenhao Tan, Noah A. Smith July 18, 2018

The Ontario Health & Environment Integrated Surveillance (OHEIS) Project Presented to: The

GeoCom putational I ntelligence and GeoCom putational I ntelligence and High-perform ance

Enrique Baca Garca Disclosure of conflict of interest Grants/research support AFSP NARSAD

Covariate Balancing Propensity Score Kosuke Imai Princeton - PowerPoint PPT Presentation

Covariate Balancing Propensity Score Kosuke Imai Princeton University Winter Conference in Statistics Borgafjll, Sweden March 9, 2015 Joint work with Christian Fong, and Marc Ratkovic Kosuke Imai (Princeton) Covariate Balancing Propensity

Covariate Balancing Propensity Score Kosuke Imai Princeton University June 1, 2012 Joint work

Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton

Covariate Adjustment and Statistical Power Tara Slough EGAP Learning Days X Covariate Adjustment

Propensity Score Matching James H. Steiger Department of Psychology and Human Development

MARC Fall Meeting 09/24/17 MARC Fall Meeting 09/24/17 SCORE Presentation SCORE

Balancing Gas system information provision 12 June 2018 GRTgaz balancing in a nutshell -&gt; 2

Model Selection Model Selection under Covariate Shift under Covariate Shift Masashi Sugiyama

Motivation: disease progression modelling Covariate-GPLVM Motivation: disease progression

Load Balancing Load Balancing Load balancing: distributing data and/or computations across

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Qingyuan Zhao

Internal Load Balancing in 5 mins Deliver scalable and resilient internal-only services on GCP

Dynamic Load Balancing in Dynamic Load Balancing in Charm+ + Charm+ + Abhinav S Bhatele

Epidemic Algorithm for Load Balancing Harshitha Menon, Laxmikant Kal e 15th April 1 / 25

Linking Design to Analysis of Cluster Randomized Trials: Covariate Balancing Strategies Fan

Importance-Weighted Cross- Importance-Weighted Cross- Validation for Covariate Shift Validation

Treatment choice with many covariate values Aleksey Tetenov (University of Bristol) Cemmap

Coupling 3D radiative transfer models with soil vegetation transfer models for sparse vegetation

High-dimensional covariance estimation based on Gaussian graphical models Shuheng Zhou

DAKAR WORKSHOP, Dakar, 12 th September 2013 MONITORING AND EVALUATION OF THE DAKAR STRATEGY

Modelling and Estimating the Clustering of Extreme Events Rob Lamb 3 , 4 Ross Towe 1 Jonathan Tawn

Neural Models for Documents with Metadata Dallas Card, Chenhao Tan, Noah A. Smith July 18, 2018

The Ontario Health &amp; Environment Integrated Surveillance (OHEIS) Project Presented to: The

GeoCom putational I ntelligence and GeoCom putational I ntelligence and High-perform ance

Enrique Baca Garca Disclosure of conflict of interest Grants/research support AFSP NARSAD

Balancing Gas system information provision 12 June 2018 GRTgaz balancing in a nutshell -> 2

The Ontario Health & Environment Integrated Surveillance (OHEIS) Project Presented to: The