classifying treatment responders under causal effect
play

Classifying Treatment Responders Under Causal Effect Monotonicity - PowerPoint PPT Presentation

June 12, 2019 ICML 2019 Classifying Treatment Responders Under Causal Effect Monotonicity Nathan Kallus ORIE and Cornell Tech, Cornell University Heterogeneous Treatment Effect Estimation X Age X Weight X BMI X SysBP T (Anticoagulant) Y


  1. June 12, 2019 ICML 2019 Classifying Treatment Responders Under Causal Effect Monotonicity Nathan Kallus ORIE and Cornell Tech, Cornell University

  2. Heterogeneous Treatment Effect Estimation X Age X Weight X BMI X SysBP T (Anticoagulant) Y (Hemorrhage) 49 106 31 Warfarin 1 54 89 26 None 0 43 130 38 None 1 . . . . . . . . . . . . . . . . . . Fit CATE τ ( X ) = E [ Y (1) − Y (0) | X ] to data on X, T, Y E.g. : Causal Forest (Wager & Athey ’17), TARNet (Shalit et al. ’17), ... Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 2

  3. Often Outcome is Binary Treatment Outcome Observed ( T ) ( Y ) Give anticoagulant Hemorrhage? Personalized discount Buy? Target job training Employed in 6 months? Homelessness prevention program Re-enter? Recidivism prevention program Recidivate? Support for minority CS students Drop out? Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 3

  4. Often We Want to Predict Response Treatment Individual Label of Interest ( T ) ( Y (1) − Y (0) ) Give anticoagulant Hemorrhage iff medicated Personalized discount Would buy iff discounted Target job training Would get job iff trained Homelessness prevention program Re-enter iff not targeted Recidivism prevention program Recidivate iff not targeted Support for minority CS students Drop out iff not targeted Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 4

  5. Classifying Responders: The Problem ◮ Each unit consists of ◮ Features X ◮ Potential outcomes Y (1) , Y (0) ∈ { 0 , 1 } ◮ “Non-responder” has Y (0) = Y (1) ◮ Would’ve bought (or, not bought) regardless of discount ◮ Would’ve hemorrhaged (or, not) regardless of anticoagulant ◮ “Responder” has Y (1) = 1 > 0 = Y (0) ◮ Would’ve bought if and only if offered discount ◮ R = I [ Y (1) > Y (0)] ◮ Ground truth NOT observed in X, T, Y data ◮ Want classifier f : X → { 0 , 1 } with small loss L θ ( f ) = θ P ( false positive ) + (1 − θ ) P ( false negative ) = θ P ( f ( X ) = 1 , R = 0) + (1 − θ ) P ( f ( X ) = 0 , R = 1) . Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 5

  6. Monotonicity ◮ Monotone treatment response assumption : Y (1) ≥ Y (0) ◮ Discount never causes a would-be buyer to not buy ◮ Job training never causes someone to not get employed? Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 6

  7. Monotonicity ◮ Monotone treatment response assumption : Y (1) ≥ Y (0) ◮ Discount never causes a would-be buyer to not buy ◮ Job training never causes someone to not get employed? ◮ Under monotonicity, R = Y (1) − Y (0) ∈ { 0 , 1 } ◮ So, P ( R = 1 | X ) = τ ( X ) = E [ Y (1) − Y (0) | X ] ◮ f ( X ) = I [ τ ( X ) ≥ θ ] minimizes L θ ( f ) ◮ Can take plug-in approach using any CATE estimator ˆ τ ◮ Question: any value to a direct classification approach? Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 6

  8. Classifying Responders ◮ For simplicity, consider completely randomized data with P ( T = 1) = 0 . 5 ◮ Let Z = I [ Y = T ] (observable!) ◮ R = 1 = ⇒ Z = 1 ◮ R = 0 = ⇒ Z ∼ Bernoulli(0 . 5) ◮ Z is like a corrupted observation of R ◮ Seeing Z = 0 is more informative about R ◮ Using Z as a surrogate label for R leads to new direct approaches to the classification problem ◮ Two instantiations of this are RespSVM, RespNet Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 7

  9. Empirical Results: Synthetic Responder Z = + 1 3 3 Z = − 1 Non-responder 2 2 1 1 0 0 −1 −1 −2 −2 −3 −3 −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 The true label R The observable label Z T = + 1, Y = + 1 T = − 1, Y = + 1 3 3 T = + 1, Y = − 1 T = − 1, Y = − 1 2 2 1 1 0 0 −1 −1 −2 −2 −3 −3 −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 T = +1 T = 0 Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 8

  10. Empirical Results: Synthetic Linear responder classification boundary 1.0 1.0 5eVS690 lLn 0.9 0.9 5eVS690 5BF 0.9 5eVSL5-gen 0.8 0.8 Accuracy 5eVSL5-dLVF 0.8 5eVS1et-gen 0.7 0.7 5eVS1et-dLVF 0.7 5F 0.6 0.6 CF 0.6 7A51et 0.5 0.5 0.5 5eVS690 lLn 0.9 10 1 10 2 10 3 10 1 10 2 10 3 10 1 10 2 10 3 5eVS690 5BF d = 2 d = 10 d = 20 5eVSL5-gen 0.8 5eVSL5-dLVF Spherical responder classification boundary 5eVS1et-gen 0.7 5eVS1et-dLVF 1.0 5eVS690 lLn 0.9 0.9 5F 0.9 5eVS690 5BF 5eVSL5-gen 0.6 0.8 0.8 CF Accuracy 0.8 5eVSL5-dLVF 5eVS1et-gen 0.7 0.7 7A51et 0.7 5eVS1et-dLVF 5F 0.6 0.6 0.5 0.6 CF 7A51et 0.5 0.5 0.5 10 1 10 2 10 3 10 1 10 2 10 3 10 1 10 2 10 3 10 1 10 2 10 3 d = 2 d = 10 d = 20 Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 9

  11. Empirical Results: Census Data ◮ Predict whether the sex-at-birth of mother’s first two kids being the same influences her decision to have a third ◮ Follows data construction by Angirst & Evans ’96 ◮ Covariates: ethnicity of mother and father; their ages at marriage, at census, at 1st kid, and at 2nd kid, year of marriage, and education level Method L θ (in 0 . 01 ) % 1st % 2nd % 3rd RespSVM lin 49 ± 2 . 7 100% RespLR-gen 57 ± 2 . 4 100% RespLR-disc 58 ± 2 . 3 2% LR 58 ± 2 . 3 92% RF 58 ± 2 . 3 6% Nathan Kallus Classifying Treatment Responders Under Causal Effect Monotonicity 10

  12. Thank you! Poster: Today 6:30pm @ Pacific Ballroom #74

Recommend


More recommend