The Mythos of Model Interpretability Zachary C. Lipton - PowerPoint PPT Presentation

The Mythos of Model Interpretability Zachary C. Lipton https://arxiv.org/abs/1606.03490

Outline • What is interpretability ? • What are its desiderata? • What model properties confer interpretability? • Caveats, pitfalls, and takeaways

What is Interpretability? • Many papers make axiomatic claims   This model is {interpretable, explainable, intelligible, transparent, understandable} • But what is interpretability? & why is it desirable? • Does it hold consistent meaning across papers?

We want good models Evaluation Metric

We also want interpretable models Evaluation Metric Interpretation

The Human Wants Something the Metric Doesn’t Evaluation Metric Interpretation

So What’s Up? It seems either: • Metric captures everything and people are crazy • The metric mismatched from real objectives We hope to refine the discourse on interpretability In dialogue with the literature, we create a taxonomy of both objectives & methods  

Trust • Does the model know   when it’s uncertain? • Does the model make   same mistakes as humans? • Are we comfortable   with the model?

Causality • Tell us something about   the natural world • Predictions vs actions • Caruana (2015) shows a mortality predictor (for use in triage) that assigns lower risk to asthma patients

Transferability • Training setups differ from   the wild • Reality may be   non-stationary, noisy • Don’t want model to   depend on weak setup

Informativeness • We may train a model   to make a *decision* • But it’s real purpose is   to be a feature • Thus an interpretation   may simply be valuable for the extra bits it carries

Transparency • Proposed solutions conferring interpretability tend to fall into two categories • Transparency addresses understanding how the model works • Explainability concerns the model’s ability to offer some (potentially post-hoc) explanation

Simulatability • One notion of transparency   is simplicity • Small decision trees, sparse   linear models, rules • A model is simulatable if a   person can *run* it

Decomposability • A relaxed notion requires   understanding individual   components of a model • Such as: weights of a linear   model or the nodes of a   decision tree

Transparent Algorithms • We understand the   behavior algorithm   (but maybe not output) • E.g. convergence of   convex optimizations,   generalization bounds  

Post-Hoc Interpretability A h y e s , s o m e t h i n g c o o l i s h a p p e n i n g i n n o d e 7 5 0 , 3 4 5 , 1 6 7 … m a y b e i t s e e s a c a t ?   T r y j i g g l i n g t h e i n p u t s ?

Verbal Explanations • Just as people generate   explanations (absent   transparency), we might   train a (possibly separate)   model to generate   explanations • Could think of captions   as interpretations   of classification model   (Image: Karpathy et al 2015)

Saliency Maps • Mapping b/w input & output   might be impossible to   describe succinctly,   local explanations are   potentially useful. (Image: Wang et al 2016)

Case-Based Explanations • Retrieve labeled items   that look similar to the model • Doctors employ this technique   to explain treatments   (Image: Mikolov et al 2014)

Discussion Points • Linear models not strictly more interpretable than deep learning • Claims about interpretability must be qualified • Transparency may be at odds with the goals of AI • Post-hoc interpretations may potentially mislead

Thanks! Acknowledgments: Zachary C. Lipton was supported by the Division of Biomedical Informatics at UCSD, via training grant (T15LM011271) from the NIH/NLM. Thanks to Charles Elkan, Julian McAuley, David Kale, Maggie Makar, Been Kim, Lihong Li, Rich Caruana, Daniel Fried, Jack Berkowitz, & Sepp Hochreiter References: The Mythos of Model Interpretability (ICML Workshop on Human Interpretability 2016) - ZC Lipton http://arxiv.org/abs/1511.03677

The Mythos of Model Interpretability Zachary C. Lipton - PowerPoint PPT Presentation

The Mythos of Model Interpretability Zachary C. Lipton https://arxiv.org/abs/1606.03490 Outline What is interpretability ? What are its desiderata? What model properties confer interpretability? Caveats, pitfalls, and takeaways

The Mythos of Model Interpretability Zachary C. Lipton https://arxiv.org/abs/1606.03490 Outline

The Mythos Project: The Mythos Project: a distance learning experience a distance learning

Interpretability of Machine Learning for Computer Vision Xinshuo Weng* *Most slides borrowed

INTERPRETABILITY AND INTERPRETABILITY AND EXPLAINABILITY EXPLAINABILITY Christian Kaestner

Interpretability and functional transparency Tommi Jaakkola in collaboration with David Alvarez

Evangelion Mythos and the Plot You Thought the Show Forgot Anime Is Lit Podcast Twitter:

Interpretability in NLP: Moving Beyond Vision Shuoyang Ding Microsoft Translator Talk Series

Explaining Machine Learning Models Armen Donigian Director of Data Science Engineering Roadmap

Interpretability in PRA Marta Bilkova , Dick de Jongh , and Joost J. Joosten ,

Interpretability and the arithmetized completeness theorem (Taishi Kurahashi)

Interpretability and Robustness for Multi-Hop QA Mohit Bansal (MRQA-EMNLP 2019 Workshop) 1

Model Interpretation Danish Pruthi April 28, 2020 Why interpretability? Task:

Attacks Meet Interpretability: Attribute-steered Detection of Adversarial Samples Guanhong Tao ,

Driverless AI for GPUs Interpretability, Accuracy, Speed Sri Ambati, Arno Candel GTC 2017 Time

Interpretability in Machine Learning Why Interpret ? The current state of machine learning And

The Evolution of Tox21: Enhancing Physiological Relevance & Interpretability with Emerging

Deep Dive into Android IPC/Binder Framework at Android Builders Summit 2013 Aleksandar (Saa)

On Leveraging Pretrained GANs for Generation with Limited Data Miaoyun Zhao, Yulai Cong, Lawrence

Fate of Renewable Energy Under Trump Dan Whi(en Lauren Randall Peter Kelley Ma(hew Wagner

Neural machines with nonstandard input structure During the talk I will show work done by

LBNE So(ware and Compu3ng Face-to-Face Mee3ng Tom Junk,

rt t strt

Advanced SQL injection to operating system full control Bernardo Damele Assumpo Guimares

Provisioning Robust & Interpretable AI/ML-based Service Bundles Alun Preece, Dan Harborne

The Mythos of Model Interpretability Zachary C. Lipton - PowerPoint PPT Presentation

The Mythos of Model Interpretability Zachary C. Lipton https://arxiv.org/abs/1606.03490 Outline What is interpretability ? What are its desiderata? What model properties confer interpretability? Caveats, pitfalls, and takeaways

The Mythos of Model Interpretability Zachary C. Lipton https://arxiv.org/abs/1606.03490 Outline

The Mythos Project: The Mythos Project: a distance learning experience a distance learning

Interpretability of Machine Learning for Computer Vision Xinshuo Weng* *Most slides borrowed

INTERPRETABILITY AND INTERPRETABILITY AND EXPLAINABILITY EXPLAINABILITY Christian Kaestner

Interpretability and functional transparency Tommi Jaakkola in collaboration with David Alvarez

Evangelion Mythos and the Plot You Thought the Show Forgot Anime Is Lit Podcast Twitter:

Interpretability in NLP: Moving Beyond Vision Shuoyang Ding Microsoft Translator Talk Series

Explaining Machine Learning Models Armen Donigian Director of Data Science Engineering Roadmap

Interpretability in PRA Marta Bilkova , Dick de Jongh , and Joost J. Joosten ,

Interpretability and the arithmetized completeness theorem (Taishi Kurahashi)

Interpretability and Robustness for Multi-Hop QA Mohit Bansal (MRQA-EMNLP 2019 Workshop) 1

Model Interpretation Danish Pruthi April 28, 2020 Why interpretability? Task:

Attacks Meet Interpretability: Attribute-steered Detection of Adversarial Samples Guanhong Tao ,

Driverless AI for GPUs Interpretability, Accuracy, Speed Sri Ambati, Arno Candel GTC 2017 Time

Interpretability in Machine Learning Why Interpret ? The current state of machine learning And

The Evolution of Tox21: Enhancing Physiological Relevance &amp; Interpretability with Emerging

Deep Dive into Android IPC/Binder Framework at Android Builders Summit 2013 Aleksandar (Saa)

On Leveraging Pretrained GANs for Generation with Limited Data Miaoyun Zhao, Yulai Cong, Lawrence

Fate of Renewable Energy Under Trump Dan Whi(en Lauren Randall Peter Kelley Ma(hew Wagner

Neural machines with nonstandard input structure During the talk I will show work done by

LBNE So(ware and Compu3ng Face-to-Face Mee3ng Tom Junk,

rt t strt

Advanced SQL injection to operating system full control Bernardo Damele Assumpo Guimares

Provisioning Robust &amp; Interpretable AI/ML-based Service Bundles Alun Preece, Dan Harborne

The Evolution of Tox21: Enhancing Physiological Relevance & Interpretability with Emerging

Provisioning Robust & Interpretable AI/ML-based Service Bundles Alun Preece, Dan Harborne