learning models from data with measurement error tackling
play

Learning Models from Data with Measurement Error: Tackling - PowerPoint PPT Presentation

Learning Models from Data with Measurement Error: Tackling Underreporting Roy Adams, Yuelong Ji, Xiaobin Wang, and Suchi Saria Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data.


  1. Learning Models from Data with Measurement Error: Tackling Underreporting Roy Adams, Yuelong Ji, Xiaobin Wang, and Suchi Saria

  2. Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data.

  3. Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data. Measurement error is common source of bias when using non- experimental data.

  4. Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data. Measurement error is common source of bias when using non- experimental data. • We focus on underreporting error.

  5. Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data. Measurement error is common source of bias when using non- experimental data. • We focus on underreporting error. • E.g. survey data of sensitive variables such as drug use.

  6. Model Model Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations à are subject to underreporting errors . X à A Y

  7. Model Model Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations à are subject to underreporting errors . Assumptions: X à A Y

  8. Model Model Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations à are subject to underreporting errors . Assumptions: 1. Strict underreporting ( A = 0 ⟹ à = 0 ) X à A Y

  9. Model Model Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations à are subject to underreporting errors . Assumptions: 1. Strict underreporting ( A = 0 ⟹ à = 0 ) X 2. à is independent of X given A à A Y

  10. Model Model Outcome model … p 𝜄 (Y | A, X) X Exposure model … p 𝜚 (A | X) Error model ……… p 𝜐 (Ã | A) Ã A Y )

  11. Model Model Outcome model … p 𝜄 (Y | A, X) X Exposure model … p 𝜚 (A | X) Error model ……… p 𝜐 (Ã | A) Ã A Y ) Maximize the log marginal likelihood : θ , ϕ , τ ∑ log ∑ p θ ( y i | a , x i ) p τ ( ˜ a i | a ) p ϕ ( a | x i ) max i a

  12. Identifiability Identifiability

  13. Identifiability Identifiability We prove three separate identifiability conditions:

  14. Identifiability Identifiability We prove three separate identifiability conditions: 1. The error distribution is known

  15. Identifiability Identifiability We prove three separate identifiability conditions: 1. The error distribution is known 2. We have a second error-prone exposure observation

  16. Identifiability Identifiability We prove three separate identifiability conditions: 1. The error distribution is known 2. We have a second error-prone exposure observation 3. Under assumptions about the form of the exposure distribution (see paper/poster for details)

  17. Identifiability Identifiability We prove three separate identifiability conditions: 1. The error distribution is known 2. We have a second error-prone exposure observation 3. Under assumptions about the form of the exposure distribution (see paper/poster for details) In particular: If X is not independent of A and p(A | X) is a logit, probit, or cloglog regression model , then p(Y, Ã | X) is identifiable.

  18. Maternal drug use and childhood obesity Drug use and childhood obesity 6mRking (a) 0.25 sensitivity analysis subject-reSRrt 5isk difference Average causal e ff ect 0.20 subject-reSRrt + cRtinine 0.15 0.10 0.05 0.00 0.0 0.1 0.2 0.3 0.4 0.5 0.6 τ Underreporting rate

  19. Thanks! Drug use and childhood obesity Come see poster #75

Recommend


More recommend