A User Study on the Effect of Aggregating Explanations for - PowerPoint PPT Presentation

A User Study on the Effect of Aggregating Explanations for Interpreting Machine Learning Models [work in progress] Josua Krause* , Adam Perer**, Enrico Bertini* Mon, August 20th 2018 * **

Instance Explanations "Why Should I Trust You?" Explaining the Predictions of Any Classifier   2 Marco Riberio, Sameer Singh, Carlos Guestrin   International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD 2016)

Finding Data Biases "Why Should I Trust You?" Explaining the Predictions of Any Classifier   3 Marco Riberio, Sameer Singh, Carlos Guestrin   International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD 2016)

Problem: Inspecting single instances   does not scale well 4

Solution: Aggregating data and explanations 5

Ground Truth Positive Negative Prediction Positive Negative Correct Incorrect Solution: Aggregating data and explanations 6

Ground Truth Positive Negative Prediction Positive Negative Correct Incorrect Solution: Aggregating data and explanations Living Area (numeric) 7

Ground Truth Positive Negative Prediction Positive Negative Correct Incorrect Living Area (numeric) Feature Value 8

Ground Truth Positive Negative Prediction Positive Negative Correct Incorrect Living Area (numeric) Concentration Within Subset Feature Value 9

Ground Truth Positive Negative Prediction Positive Negative Correct Incorrect Feature Living Area (numeric) Importance Concentration Within Subset Feature Value 10

Sorted by Importance

What is the impact of aggregation? What is the impact of instance-level explanations? How do those settings affect the ability to detect biases in the data? 12

Four Conditions T able H istogram N o Explanation E xplanation 13

Two Data Sets 18

Two Data Sets High Price Low Price Living Area (numeric) 19

Two Data Sets Model Accuracy: 81.959% Model Accuracy: 88.325% 20

Questions Individual models: • Do you think the predictions of the model make sense ? 5 point Likert scale (Not at all – Very much) • How well does the model perform in terms of accuracy ? 5 point Likert scale (Not much – Very well) • How much do you trust the model? 5 point Likert scale (Not at all – Very much) • Why do you trust or not trust this model? Free text answer Summary: Which model do you prefer? Multiple choice and text answer 21

Study 100 participants 4 conditions (25 each): • Table without Explanations ( T/N ) • Table with Explanations ( T/E ) • Histogram without Explanations ( H/N ) • Histogram with Explanations ( H/E ) Random model order Correctly identified more accurate model Evaluation metrics: Model preference (trust) Bias detection 22

Participants Who Trusted the Correct Model 40% 30% 20% 10% 00% T/E H/N H/E 23 T: Table H: Histogram E: Explanation N: No Explanation

Participants Who Trusted the Correct Model vs. 40% 30% Significant improvement! 20% p-value 0.0477 < 0.05 10% 00% T/E H/N H/E 24 T: Table H: Histogram E: Explanation N: No Explanation

Participants Who Trusted the Correct Model vs. 40% 30% 20% p-value 0.0982 > 0.05 10% 00% T/E H/N H/E 25 T: Table H: Histogram E: Explanation N: No Explanation

Participants Who Trusted the Correct Model vs. 40% "It has higher accuracy so should be more trustworthy than the other one. However some of the results don’t make sense to me. Maybe this is just an atypical property market." 30% "It is accurate, yet the predictions do not make much sense. Higher quality houses 20% having a larger amount of low priced houses, percentage-wise? More rooms, area, or stories resulting in lower prices? The logic does not work out." 10% "larger houses are valued lower than others which are smaller" 00% T/E H/N H/E 26 T: Table H: Histogram E: Explanation N: No Explanation

Participants Who Trusted the Correct Model vs. 40% "If the data says it’s true, then it’s true I suppose and it’s more trustworthy than my common sense." 30% "I feel like the results of [the biased model] where strange even though they where correct according to the dataset." 20% "I’m drawn to trusting the model which was more accurate even though it didn’t entirely make sense to me." 10% 25% of the participants who found the bias did not change their mind! 00% T/E H/N H/E 27 T: Table H: Histogram E: Explanation N: No Explanation

Participants Who Detected the Bias 50% 40% vs. 30% Significant improvement! p-value 0.0359 < 0.05 20% 10% 00% T/E H/N H/E 28 T: Table H: Histogram E: Explanation N: No Explanation

Participants Who Detected the Bias 50% 40% 30% p-value 0.0311 < 0.05 20% 10% 00% T/N T/E H/N H/E 29 T: Table H: Histogram E: Explanation N: No Explanation

Number of Hovered Cells T/N T/E 0 100 200 300 400 500 Number of Hovered Bars H/N H/E 0 200 400 600 800 1000 Bootstrapped 95% Confidence Intervals 30 T: Table H: Histogram E: Explanation N: No Explanation

Number of Hovered Cells T/N T/E 0 100 200 300 400 500 Number of Hovered Bars Number of Hovered Bars H/N H/E 0 200 400 600 800 1000 Bootstrapped 95% Confidence Intervals 31

Participants Who Detected the Bias 50% Similar performance! 40% 30% 20% 10% 00% T/N T/E H/N H/E 32 T: Table H: Histogram E: Explanation N: No Explanation

vs. Note that the task was chosen in a way that under all conditions it was possible to find the bias. Histograms scale better to larger data sets or more complex errors in the data. In tables you have to extrapolate... 33

Lessons Learned People trust accuracy (too much). Aggregating instance-level explanations significantly helps detecting biases compared to individual explanations. Individual instance-level explanations may hurt performance. 34

Further Work More targeted studies   to confirm hypotheses Different results for expert users? 35

Thank You! A User Study on the Effect of Aggregating Explanations for Interpreting Machine Learning Models [work in progress] Josua Krause* , Adam Perer**, Enrico Bertini* * **

A User Study on the Effect of Aggregating Explanations for - PowerPoint PPT Presentation

A User Study on the Effect of Aggregating Explanations for Interpreting Machine Learning Models [work in progress] Josua Krause* , Adam Perer**, Enrico Bertini* Mon, August 20th 2018 * ** Instance Explanations "Why Should I Trust

2020 Projections were then developed by 2020 Projections were then developed by aggregating

RUN groupadd -r user && useradd -r -g user user USER user $ docker run --read-only debian

Ubiquitous and Mobile Computing CS 528: The Effect of Developer Specified Explanations for

Argumentative explanations for recommendations - Effect of display style and profile transparency

Quantum Hall effect effect Quantum Hall integer integer Hall bar geometry classical quantum

Spin Hall Effect and Experimental Observation 1701110147@pku.edu.cn 2017.12.15

Generating Visual Explanations Lisa et al. Seoul National University

Finding Explanations Instead of finding structure in a data set, we are now focusing on methods

Z Explanations of Neutral Current B Anomalies by Ben Allanach (University of Cambridge)

What Gets Echoed? Understanding the Pointers in Explanations of Persuasive Arguments David

Thoughts on Martin-Lfs Meaning Explanations Peter Dybjer Chalmers tekniska hgskola,

Table of Contents I Diagnostic Agents Recording the History of a Domain Defining Explanations

Counterfactual Visual Explanations Yash Goyal Ziyan Wu Jan Ernst Dhruv Batra Devi Parikh

Proof explanations: using natural language and graph view Fr ed erique GUILHOT Hanane

Explanations in Constraint Programming Barry OSullivan Cork Constraint Computation Centre

Tests, Games, and Martin-Lfs Meaning Explanations for Intuitionistic Type Theory Meeting on

What Kind of Theory Is a Model-Theore5c Seman5cs of a

Michel Electron: Truth Study Aidan Reynolds 8 March 2017 1 Samples Dorota generated two

Local Hadron Calibration in ATLAS P A O L A G I O V A N N I N I on behalf of the ATLAS Liquid

Dynamics during the Stanley Cup Playoffs Daniel de Leng, Mattias Tiger, Mathias Almquist,

About the course From the CSE catalog: CSE 321 Discrete Structures (4) CSE 321 Discrete

SI232 Slide Set #6: Digital Logic (Appendix B) 1 2 Appendix Goals Logic Design Digital

John 18:37, Pilate therefore said to Him, Are You John 18:37, Pilate therefore said to Him,

Saving Truth from Orthodoxy Equations Better Logic Through Algebra, Probability, and Dynamical

A User Study on the Effect of Aggregating Explanations for - PowerPoint PPT Presentation

A User Study on the Effect of Aggregating Explanations for Interpreting Machine Learning Models [work in progress] Josua Krause* , Adam Perer**, Enrico Bertini* Mon, August 20th 2018 * ** Instance Explanations "Why Should I Trust

2020 Projections were then developed by 2020 Projections were then developed by aggregating

RUN groupadd -r user &amp;&amp; useradd -r -g user user USER user $ docker run --read-only debian

Ubiquitous and Mobile Computing CS 528: The Effect of Developer Specified Explanations for

Argumentative explanations for recommendations - Effect of display style and profile transparency

Quantum Hall effect effect Quantum Hall integer integer Hall bar geometry classical quantum

Spin Hall Effect and Experimental Observation 1701110147@pku.edu.cn 2017.12.15

Generating Visual Explanations Lisa et al. Seoul National University

Finding Explanations Instead of finding structure in a data set, we are now focusing on methods

Z Explanations of Neutral Current B Anomalies by Ben Allanach (University of Cambridge)

What Gets Echoed? Understanding the Pointers in Explanations of Persuasive Arguments David

Thoughts on Martin-Lfs Meaning Explanations Peter Dybjer Chalmers tekniska hgskola,

Table of Contents I Diagnostic Agents Recording the History of a Domain Defining Explanations

Counterfactual Visual Explanations Yash Goyal Ziyan Wu Jan Ernst Dhruv Batra Devi Parikh

Proof explanations: using natural language and graph view Fr ed erique GUILHOT Hanane

Explanations in Constraint Programming Barry OSullivan Cork Constraint Computation Centre

Tests, Games, and Martin-Lfs Meaning Explanations for Intuitionistic Type Theory Meeting on

What Kind of Theory Is a Model-Theore5c Seman5cs of a

Michel Electron: Truth Study Aidan Reynolds 8 March 2017 1 Samples Dorota generated two

Local Hadron Calibration in ATLAS P A O L A G I O V A N N I N I on behalf of the ATLAS Liquid

Dynamics during the Stanley Cup Playoffs Daniel de Leng, Mattias Tiger, Mathias Almquist,

About the course From the CSE catalog: CSE 321 Discrete Structures (4) CSE 321 Discrete

SI232 Slide Set #6: Digital Logic (Appendix B) 1 2 Appendix Goals Logic Design Digital

John 18:37, Pilate therefore said to Him, Are You John 18:37, Pilate therefore said to Him,

Saving Truth from Orthodoxy Equations Better Logic Through Algebra, Probability, and Dynamical

RUN groupadd -r user && useradd -r -g user user USER user $ docker run --read-only debian