generalized inverse classification
play

GENERALIZED INVERSE CLASSIFICATION SDM 17 Michael T. Lash 1 , - PowerPoint PPT Presentation

GENERALIZED INVERSE CLASSIFICATION SDM 17 Michael T. Lash 1 , Qihang Lin 2 , W. Nick Street 2 , Jennifer G. Robinson 3 , and Jeffrey Ohlmann 2 1 Department of Computer Science, 2 Deparment of Management Sciences, 3 Department of Epidemiology


  1. GENERALIZED INVERSE CLASSIFICATION SDM ’17 Michael T. Lash 1 , Qihang Lin 2 , W. Nick Street 2 , Jennifer G. Robinson 3 , and Jeffrey Ohlmann 2 1 Department of Computer Science, 2 Deparment of Management Sciences, 3 Department of Epidemiology www.michaeltlash.com

  2. What is inverse classification ? � The process of making meaningful perturbations to a test instance such that the probability of a desirable outcome is maximized. 1

  3. What is inverse classification ? � The process of making meaningful perturbations to a test instance such that the probability of a desirable outcome is maximized. 1

  4. What is inverse classification ? � The process of making meaningful perturbations to a test instance such that the probability of a desirable outcome is maximized. 1

  5. What is inverse classification ? � The process of making meaningful perturbations to a test instance such that the probability of a desirable outcome is maximized. 1

  6. What is inverse classification ? � The process of making meaningful perturbations to a test instance such that the probability of a desirable outcome is maximized. 1

  7. What is inverse classification ? � The process of making meaningful perturbations to a test instance such that the probability of a desirable outcome is maximized. 1

  8. What is inverse classification ? � The process of making meaningful perturbations to a test instance such that the probability of a desirable outcome is maximized. 1

  9. What is inverse classification ? � The process of making meaningful perturbations to a test instance such that the probability of a desirable outcome is maximized. What about the meaningful part of the definition? 1

  10. M eaningful Perturbations Well...lets visit some past work! � Michael T. Lash, Qihang Lin, W. Nick Street, and Jennifer G. Robinson, “A budget-constrained inverse classification framework for smooth classifiers”, arXiv preprint arXiv:1605.09068 , submitted. 2

  11. M eaningful Perturbations Begin with a basic formulation. 2

  12. M eaningful Perturbations 2

  13. M eaningful Perturbations 2

  14. M eaningful Perturbations Some regressor Segment features. 2

  15. M eaningful Perturbations Some regressor Estimate indirectly changeable. 2

  16. M eaningful Perturbations Some regressor Update objective function. Add constraints. 2

  17. M eaningful Perturbations Some regressor Cost-change function 2

  18. M eaningful Perturbations Budget Some regressor 2

  19. M eaningful Perturbations Bounds Some regressor 2

  20. M ain Contributions 1. Relax assumptions about f ( · ) . Bounds Some regressor 3

  21. M ain Contributions 1. Relax assumptions about f ( · ) . Bounds Some regressor 3

  22. M ain Contributions 1. Relax assumptions about f ( · ) . Bounds Some regressor Generalized inverse classification 3

  23. M ain Contributions 1. Relax assumptions about f ( · ) . 2. Quadratic cost-change function. Some regressor 4

  24. M ain Contributions 1. Relax assumptions about f ( · ) . 2. Quadratic cost-change function. Some regressor 4

  25. M ain Contributions 1. Relax assumptions about f ( · ) . 2. Quadratic cost-change function. Some regressor 4

  26. M ain Contributions 1. Relax assumptions about f ( · ) . 2. Quadratic cost-change function. 3. Three real-valued heuristic optimization methods and two sensitivity analysis-based optimization methods. * Projection operator to maintain feasibility. 5

  27. Op timization Methodology Heuristic � Hill Climbing + Local Search (HC+LS) � Genetic Algorithm (GA) � Genetic Algorithm + Local Search (GA+LS) Sensitivity Analysis � Local Variable Perturbation – First Improvement (LVP-FI) � Local Variable Perturbation – Best Improvement (LVP-BI) 6

  28. Experiment Decisions and Data � f ( · ) : Random forest � H ( · ) : Kernel regression � Dataset 1: Student Performance (UCI Machine Learning Repository). � Dataset 2: ARIC � One f for optimization, separate f for heldout evaluation. 7

  29. R esults: Student Performance 8

  30. R esults: Student Performance 8

  31. R esults: ARIC 9

  32. R esults: ARIC Need sparsity constraints 9

  33. C onclusions � Generalized Inverse Classification: can use virtually any learned f (as shown by experiments w/ Random Forest classifier). � Our proposed methods were successful, although this varied by dataset. 10

  34. GENERALIZED INVERSE CLASSIFICATION SDM ’17 Michael T. Lash 1 , Qihang Lin 2 , W. Nick Street 2 , Jennifer G. Robinson 3 , and Jeffrey Ohlmann 2 1 Department of Computer Science, 2 Deparment of Management Sciences, 3 Department of Epidemiology www.michaeltlash.com 10

  35. Causality and Inverse Classification 11

  36. Causality and Inverse Classification � Yes! .... 11

  37. Causality and Inverse Classification � Yes! .... � What we’re doing: 1. Imposing our own causal structure (DAG). 2. We’re not taking the usual counterfactual approach. 11

  38. Causality and Inverse Classification � Yes! .... � What we’re doing: 1. Imposing our own causal structure (DAG). 2. We’re not taking the usual counterfactual approach. � Future work will focus on incorporating causal methodology... 11

Recommend


More recommend