Make sure we can query black box algorithms - PowerPoint PPT Presentation

Make sure we can query black box algorithms http://www.bloomberg.com/graphics/2016-amazon-same-day/ Auditing Black Box Models

Training vs Testing No access to training Training data or algorithm data ✔ Test data ✖ Auditing Black Box Models

How can we understand a model If we use a “simple” model we can interpret it directly. Decision trees Linear classifiers SLIM (Sparse Linear Interpretable Models) Auditing Black Box Models

Simple models are hard Paul Raccuglia, Katherine C. Elbert, Philip D. F. Adler, Casey Falk, Malia B. Wenny, Aurelio Mollo, Matthias Zeller, Sorelle A. Friedler, Joshua Schrier, and Alexander J. Norquist. Machine- learning-assisted materials discovery using failed experiments. Nature, 533: 73 - 76, May 5, 2016. http://dx.doi.org/10.1038/nature17439 Auditing Black Box Models

Research Question Given a black box function Y = f ( x 1 , . . . , x n ) Determine the influence each variable has on the outcome How do we quantify influence How do we model it (random perturbations?) How do we handle indirect and joint influence Auditing Black Box Models

Direct vs Indirect Influence Auditing Does a feature (or group of features) directly influence the outcome? E.g a feature used in a decision tree Intervention: Replace feature with random noise and see how much model accuracy degrades. Auditing Black Box Models

Direct vs Indirect Influence Auditing Does a feature (or group of features) in directly influence the outcome? E.g zipcode as a proxy for race? Intervention: Direct perturbation no longer works, because more than one variable carries the desired signal. Auditing Black Box Models

Information content and indirect influence the information content of a feature can be estimated by trying to predict it from the remaining features If the removed feature can’t be predicted from the remaining features, then the information from that feature can’t influence the outcome of the model. Auditing Black Box Models

Information content and indirect influence the information content of a feature can be estimated by trying to predict it from the remaining features Given variables X, Y that are correlated, find Y’ conditionally independent of X such that Y’ is as similar to X as possible . Auditing Black Box Models

Gradient Feature Audit For each feature, 1. Remove indirect influence of feature on other features in data 2. Run model on modified test data 3. Feature influence = original accuracy – resulting accuracy Example: Auditing Amazon model: Feature to remove: race Eliminate (obscure) influence of race on zipcode Auditing Black Box Models

Gradient Feature Audit For each feature, 1. Remove indirect influence of feature on other features in data 2. Run model on modified test data 3. Feature influence = original accuracy – resulting accuracy All our measures of influence are Example: Auditing Amazon model: relative to a fixed Feature to remove: race model. Eliminate (obscure) influence of race on zipcode Auditing Black Box Models

How do we remove indirect influence? 0.008 0.006 0.004 0.002 0.000 200 400 600 800 Hypothetical SAT scores Merge conditional distributions of obscured feature based on eliminated feature. Auditing Black Box Models

How do we remove indirect influence? 0.008 0.006 0.004 0.002 0.000 200 400 600 800 Hypothetical SAT scores This will ensure that F-test will fail to tell them apart (provably*) Auditing Black Box Models

How do we remove indirect influence? 0.008 0.006 0.004 0.002 0.000 200 400 600 800 Hypothetical SAT scores Need different approaches for categorical and numerical removed and eliminated variables. Auditing Black Box Models

Representation matters! Should race be categorical or numerical? Should it be “white/non-white” or multi-valued? These issues matter! For more, see https://arxiv.org/abs/1802.04422 https://github.com/algofairness/fairness-comparison Auditing Black Box Models

Make sure we can query black box algorithms - PowerPoint PPT Presentation

Make sure we can query black box algorithms http://www.bloomberg.com/graphics/2016-amazon-same-day/ Auditing Black Box Models Training vs Testing No access to training Training data or algorithm data Test data Auditing Black Box

This week, we are looking at words that end with a complicated sound. trea sure lei sure mea sure

Paradoxes in Probability How probability continues to amuse me! Let's play a game! Box A Box B

Improve Query Performance with the Query Log Analyzer Kees Vegter Field Engineer Query Log

Query Execution 2 and Query Optimization Instructor: Matei Zaharia cs245.stanford.edu Query

Welcome to the Festival of Learning Step 1 Make sure that your PC Make sure that your

Black Box Scanning Tool + White Box Testing Tool Toshis Black Box Scanning Tool Same

A recipe for black box functors Maru Sarazola and Brendan Fong What is a black box functor? In

Query Processing Relevance feedback; query expansion; Web Search 1 Overview Indexes Query

Query Op)miza)on 1 Query op)miza)on Given an SQL query,

Kid s Box American English Level 1 Presentation Plus: Kid s Box American English Kid s Box

Flux Box Flux Box A concept by Flux Laboratory Flux box : concept Flux box : concept What is Flux

[7] Gaussian Elimination Starting to peek inside the black box So far sol ve( A, b) is a black

TOOL GEN ENERA ERAL L RUL RULES ES Make sure a mentor is helping you Make sure safety

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

Query Understanding: A Manifesto Daniel Tunkelang queryunderstanding.com Overview What is

Perfect Query FORMULA 5 critical sections in every successful query letter (c) 2019

SpaceTech REDDI 2016 Q & A Sessions v1 Jan 20 & 22, 2016 Facilitators:

The Economics of Migration Alan Manning Centre for Economic Performance LSE Outline of talk

Refugees, Integration, Inequality: Experiences from Finland Tuomas Martikainen Contents

Employment law update Wednesday 2 nd May Newcastle | Leeds | Manchester 2 Housekeeping Ward

11.1 Global Illumination Hao Li http://cs420.hao-li.com 1 Global Illumination Lighting

Realistic Image Synthesis - Lightcuts - Philipp Slusallek Karol Myszkowski Gurprit Singh

The Birthday Problem MDM4U: Mathematics of Data Management What is the minimum number of people

Recap MDM4U: Mathematics of Data Management Example In how many ways can six marbles be arranged

Make sure we can query black box algorithms - PowerPoint PPT Presentation

Make sure we can query black box algorithms http://www.bloomberg.com/graphics/2016-amazon-same-day/ Auditing Black Box Models Training vs Testing No access to training Training data or algorithm data Test data Auditing Black Box

This week, we are looking at words that end with a complicated sound. trea sure lei sure mea sure

Paradoxes in Probability How probability continues to amuse me! Let's play a game! Box A Box B

Improve Query Performance with the Query Log Analyzer Kees Vegter Field Engineer Query Log

Query Execution 2 and Query Optimization Instructor: Matei Zaharia cs245.stanford.edu Query

Welcome to the Festival of Learning Step 1 Make sure that your PC Make sure that your

Black Box Scanning Tool + White Box Testing Tool Toshis Black Box Scanning Tool Same

A recipe for black box functors Maru Sarazola and Brendan Fong What is a black box functor? In

Query Processing Relevance feedback; query expansion; Web Search 1 Overview Indexes Query

Query Op)miza)on 1 Query op)miza)on Given an SQL query,

Kid s Box American English Level 1 Presentation Plus: Kid s Box American English Kid s Box

Flux Box Flux Box A concept by Flux Laboratory Flux box : concept Flux box : concept What is Flux

[7] Gaussian Elimination Starting to peek inside the black box So far sol ve( A, b) is a black

TOOL GEN ENERA ERAL L RUL RULES ES Make sure a mentor is helping you Make sure safety

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

Query Understanding: A Manifesto Daniel Tunkelang queryunderstanding.com Overview What is

Perfect Query FORMULA 5 critical sections in every successful query letter (c) 2019

SpaceTech REDDI 2016 Q &amp; A Sessions v1 Jan 20 &amp; 22, 2016 Facilitators:

The Economics of Migration Alan Manning Centre for Economic Performance LSE Outline of talk

Refugees, Integration, Inequality: Experiences from Finland Tuomas Martikainen Contents

Employment law update Wednesday 2 nd May Newcastle | Leeds | Manchester 2 Housekeeping Ward

11.1 Global Illumination Hao Li http://cs420.hao-li.com 1 Global Illumination Lighting

Realistic Image Synthesis - Lightcuts - Philipp Slusallek Karol Myszkowski Gurprit Singh

The Birthday Problem MDM4U: Mathematics of Data Management What is the minimum number of people

Recap MDM4U: Mathematics of Data Management Example In how many ways can six marbles be arranged

SpaceTech REDDI 2016 Q & A Sessions v1 Jan 20 & 22, 2016 Facilitators: