Human-in-the-Loop Interpretability Prior Isaac Lage 1 , Andrew - PowerPoint PPT Presentation

Human-in-the-Loop Interpretability Prior Isaac Lage 1 , Andrew Slavin Ross 1 , Been Kim 2 , Samuel J. Gershman 1 & Finale Doshi-Velez 1 1 Harvard University & 2 Google Brain Poster: Today, 10:45 AM - 12:45 PM, Room 210 & 230 AB #119

Interpretability clipart-library.com

Optimizing for Interpretability Previous Work Choose a Optimize User Proxy for Proxy for Study Interpretability Interpretability

Optimizing for Interpretability Previous Work Choose a Optimize User Proxy for Proxy for Study Interpretability Interpretability How to use results to Which proxy? choose a better proxy?

Optimizing for Interpretability Human-in-the-Loop Interpretability Update User Model Study Update model directly No proxy! with results!

Interpretability Prior Goal: Bias model to be human interpretable Bayesian Inference

Interpretability Prior First: Formulate Interpretability Encouraging Prior

Optimizing for Interpretability Can define a prior Previous Work Choose a Optimize User Proxy for Proxy for Study Interpretability Interpretability Which prior captures human interpretability?

Optimizing for Interpretability Human-in-the-Loop Interpretability Update User Model Study Evaluate interpretability encouraging prior

Interpretability Prior First: Formulate Interpretability Encouraging Prior Then: Identify MAP Solution

Interpretability Prior Likelihood: Easy Evaluate computationally No users!

Interpretability Prior Prior: Hard No closed form Evaluate with user studies! Likelihood: Easy Evaluate computationally No users!

Interpretability Prior Prior: Hard No closed form Evaluate with user studies! Challenge: Approximate MAP with few evaluations of prior

Simplified Cartoon of Our Approach Step 1: Identify Diverse, High Likelihood Models

Simplified Cartoon of Our Approach Step 1: Identify Diverse, High Likelihood Models Candidate MAP 1: Candidate MAP 2: Candidate MAP 3: Likelihood = HIGH Likelihood = HIGH Likelihood = HIGH

Simplified Cartoon of Our Approach Step 1: Identify Diverse, High Likelihood Models Candidate MAP 1: Candidate MAP 2: Candidate MAP 3: Likelihood = HIGH Likelihood = HIGH Likelihood = HIGH Prior = ? Prior = ? Prior = ?

Simplified Cartoon of Our Approach Step 2: Bayesian Optimization with User Studies Similarity Based on Explanation Features

Simplified Cartoon of Our Approach Step 2: Bayesian Optimization with User Studies Similarity Based on Explanation Features User study 1: Prior = MEDIUM

Simplified Cartoon of Our Approach Step 2: Bayesian Optimization with User Studies Similarity Based on Explanation Features Prior Estimate: User study 1: Prior = HIGH? Prior = MEDIUM

Simplified Cartoon of Our Approach Step 2: Bayesian Optimization with User Studies Similarity Based on Explanation Features User study 2: User study 1: Prior = LOW Prior = MEDIUM

Simplified Cartoon of Our Approach Step 2: Bayesian Optimization with User Studies Similarity Based on Explanation Features Prior Estimate: User study 2: User study 1: Prior = HIGH? Prior = LOW Prior = MEDIUM

Simplified Cartoon of Our Approach Step 2: Bayesian Optimization with User Studies Similarity Based on Explanation Features User study 3: User study 2: User study 1: Prior = HIGH Prior = LOW Prior = MEDIUM

Main Takeaways • We optimize for interpretability directly with human feedback • Our approach efficiently identifies human-interpretable and predictive models Census Dataset • MAP approximations correspond to different interpretability proxies on different datasets MORE Number of Iterations Interpretable Poster: Today, 10:45 AM - 12:45 PM, Room 210 & 230 AB #119

Human-in-the-Loop Interpretability Prior Isaac Lage 1 , Andrew - PowerPoint PPT Presentation

Human-in-the-Loop Interpretability Prior Isaac Lage 1 , Andrew Slavin Ross 1 , Been Kim 2 , Samuel J. Gershman 1 & Finale Doshi-Velez 1 1 Harvard University & 2 Google Brain Poster: Today, 10:45 AM - 12:45 PM, Room 210 & 230 AB #119

Closing the Loop Closing the Loop Closing the Loop Closing the Loop Closing the Loop Closing

Interpretability of Machine Learning for Computer Vision Xinshuo Weng* *Most slides borrowed

Repetition Types of Loops Counting loop Know how many times to loop

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

The Mythos of Model Interpretability Zachary C. Lipton https://arxiv.org/abs/1606.03490 Outline

The Mythos of Model Interpretability Zachary C. Lipton https://arxiv.org/abs/1606.03490 Outline

INTERPRETABILITY AND INTERPRETABILITY AND EXPLAINABILITY EXPLAINABILITY Christian Kaestner

Interpretability and functional transparency Tommi Jaakkola in collaboration with David Alvarez

Trading Strategies Introduction Trading Loop Trading Loop Trading Loop Trading Loop Three

Coarse-Grained Parallelism Variable Privatization, Loop Alignment, Loop Fusion, Loop

Loop Invariants: Part 2 7 January 2019 OSU CSE 1 Maintaining the Loop Invariant A claimed

Loop Optimizations Important because lots of execution Loop Optimizations Loop Optimizations

Upper and Lower Loop Bound Estimation by Symbolic Execution and Loop Acceleration Pavel Cadek

Enhancing Fine- Grained Parallelism Loop vectorization, Loop distribution, Scalar expansion

c } false loop body P (postcondition) Loop Invariant Defn : A boolean condition that

Explaining Machine Learning Models Armen Donigian Director of Data Science Engineering Roadmap

Probabilistic Graphical Models Lecture 3 Bayesian Networks Semantics CS/CNS/EE 155 Andreas

Planning.Maryland.gov Planning.Maryland.gov E XAMPLE U SES THE L AND U SE M AP Planning Activity

Discrete Laplace-Darboux sequences, Menelaus theorem and the pentagram map by W.K. Schief

Word histogram Map data type To compare different authors, or to identify a good match in a We

Probabilistic Graphical Models David Sontag New York University Lecture 2, February 2, 2012

for mapping environments Joo F. Henriques, Andrea Vedaldi Visual Geometry Group Motivation

Supercompilation for Haskell Neil Mitchell, Colin Runciman www.cs.york.ac.uk/~ndm/supero The

Punctured logarithmic maps and punctured invariants Dan Abramovich, Brown University Work with