Logic Extraction for Explainable AI Susmit Jha Computer Science - PowerPoint PPT Presentation

Logic Extraction for Explainable AI Susmit Jha Computer Science Laboratory SRI July, 2019 1

AI reaches human-level accuracy on benchmark datasets Going deeper with convolutions. (Inception) C Szegedy et al, 2014 Switchboard benchmark Face Detection. Taigman et al, 2014 2

Beyond aggregate numbers Machine learning very susceptible to adversarial attacks. Only allowed to modify Szegedy et al, 2013, 2014 the value of 1 pixel. 70.97% of the natural images can be perturbed to at least one target class by modifying just one pixel with 97.47% confidence on average. 3

Beyond aggregate numbers Machine learning very susceptible to adversarial attacks. Only allowed to modify Szegedy et al, 2013, 2014 the value of 1 pixel. 70.97% of the natural images can be perturbed to at least one target class by modifying just one pixel with 97.47% confidence on average. Low robustness to benign noise Dodge et al. 2017 4

Beyond aggregate numbers Machine learning very susceptible to adversarial attacks. Only allowed to modify Szegedy et al, 2013, 2014 the value of 1 pixel. 70.97% of the natural images can be perturbed to at least one target class by modifying just one pixel with 97.47% confidence on average. Statistically good doesn’t mean logically/conceptually good. Low robustness to Understanding deep learning requires rethinking generalization. benign noise Dodge et al. 2017 C. Zhang, S. Bengio, M. Hardt, B. Recht, O. Vinyals 5

TRINITY: Trust, Resilience and Interpretability Trust Global Assume/Guarantee Contracts on DNNs • Closed-loop verification of NN controllers • Extracting and Integrating Temporal Logic into • Learned Control Resilience Adversarial Robustness • Interpretability Explaining Decisions as Sparse Boolean • Formula Learning Inverse Reinforcement Learning of • Temporal Specifications 6

TRINITY: Trust, Resilience and Interpretability Demonstrations Specifications

TRINITY: Trust, Resilience and Interpretability Demonstrations Specification Mining RV17 Specifications

TRINITY: Trust, Resilience and Interpretability Demonstrations Specification Mining RV17 Specifications Uncertainty-aware Synthesis from Chance-constrained STL FORMAT’16, NASA FM’16, FORMATS’18, JAR’18, ACC’19

TRINITY: Trust, Resilience and Interpretability Demonstrations Specification Mining RV17 Logic-guided Specifications And Robust RL DISE/ICML’18 Uncertainty-aware Synthesis from World Allerton Control’18 Chance-constrained STL FORMAT’16, NASA FM’16, FORMATS’18, JAR’18, ACC’19 10

TRINITY: Trust, Resilience and Interpretability Verification: ML model + closed loop NASA FM’18, ADHS’18, HSCC’19, VNN/AAAI’19 Demonstrations Specification Mining RV17 Logic-guided Specifications And Robust RL DISE/ICML’18 Uncertainty-aware Synthesis from World Allerton Control’18 Chance-constrained STL FORMAT’16, NASA FM’16, FORMATS’18, JAR’18, ACC’19 11

TRINITY: Trust, Resilience and Interpretability Verification: ML model + closed loop NASA FM’18, ADHS’18, HSCC’19, VNN/AAAI’19 Demonstrations Resilience to Adversarial Attacks Specification MILCOM’18, NATO-SET’18, Mining SafeML/ICLR’19 RV17 Logic-guided Specifications And Robust RL DISE/ICML’18 Uncertainty-aware Synthesis from World Allerton Control’18 Chance-constrained STL FORMAT’16, NASA FM’16, FORMATS’18, JAR’18, ACC’19 12

TRINITY: Trust, Resilience and Interpretability Verification: ML model + closed loop NASA FM’18, ADHS’18, HSCC’19, VNN/AAAI’19 Demonstrations Resilience to Adversarial Attacks Specification MILCOM’18, NATO-SET’18, Mining SafeML/ICLR’19 RV17 Logic-guided Specifications And Robust RL DISE/ICML’18 Uncertainty-aware Synthesis from World Allerton Control’18 Chance-constrained STL FORMAT’16, NASA FM’16, FORMATS’18, JAR’18, ACC’19 Explanations Human NASA FM’17, JAR’18, User NeurIPS’18, ConsciousAI/AAAI’19 13

TRINITY: Trust, Resilience and Interpretability Verification: ML model + closed loop NASA FM’18, ADHS’18, HSCC’19, VNN/AAAI’19 Demonstrations Resilience to Adversarial Attacks Specification MILCOM’18, NATO-SET’18, Mining SafeML/ICLR’19 RV17 Logic-guided Specifications And Robust RL DISE/ICML’18 Uncertainty-aware Synthesis from World Allerton Control’18 Chance-constrained STL FORMAT’16, NASA FM’16, Ongoing Work FORMATS’18, JAR’18, ACC’19 U.S. Army Internet of Battlefield • Things DARPA Assured Autonomy • Explanations DARPA Competency-Aware • Human NASA FM’17, JAR’18, Machine Learning User NeurIPS’18, ConsciousAI/AAAI’19 14

Need for explanation Scalable but less interpretable : Interpretable but less scalable: Neural Networks, Support Vector Decision Trees, Linear Regression Machines • This route is faster. • There is traffic on Bay Bridge. • There is an accident just after Bay Bridge backing up traffic. Why did we take the San Mateo bridge instead of the Bay Bridge ? 7/14/19 15

Local Explanations of Complex Models Not reverse engineering an ML model but finding explanation locally for one decision. 7/13/19 16

Local Explanations of Complex Models Not reverse engineering an ML model but finding explanation locally for one decision. Sufficient Cause 7/14/19 17

Local Explanations of Complex Models Not reverse engineering an ML model but finding explanation locally for one decision. Simplified Sufficient Cause 7/14/19 18

Local Explanations in AI Not reverse engineering an ML model but finding explanation locally for one decision. Simplified Sufficient Cause Measure of how well g approximates f Measure of complexity of g Formulation in AI: • Ribeiro, Marco Tulio, Sameer Singh, and Carlos Guestrin. "Why Should I Trust You?: Explaining the Predictions of Any Classifier." International Conference on Knowledge Discovery and Data Mining . ACM, 2016. • Hayes, Bradley, and Julie A. Shah. "Improving Robot Controller Transparency Through Autonomous Policy Explanation." International Conference on Human-Robot Interaction . ACM, 2017. 7/14/19 19

Model Agnostic Explanation through Boolean Learning Maps in which optimum path goes via green Maps in which optimum path does not go via green Find a Boolean formula ! Let each point in k-dimensions such that (for some k) correspond to a ! ⇔ #$%ℎ '()%$*) + map. ! ⇒ #$%ℎ '()%$*) + Why does the path not go through Green? 7/13/19 20

Explanations as Learning Boolean Formula A* ! )*"+, : ! "#$%&'( : Some property of the output Using explanation vocabulary Ex: Some cells not selected Ex: Obstacle presence - ./01234 ⇒ - 67.89 - ./01234 ⇔ - 67.89

How difficult is it? Boolean formula learning 50x50 grid has 2 " #$%#$ possible explanations even if & '()*+,- ⇒ & /0'12 vocabulary only considers presence/absence of obstacles. & '()*+,- ⇔ & /0'12 Scalability: Usually the feature space or vocabulary is large. For a map, its order of features in the map. For an image, it is order of the image’s resolution. Guarantee: Is the sampled space of maps enough to generate the explanation with some quantifiable probabilistic guarantee?

How difficult is it? Boolean formula learning 50x50 grid has 2 " #$%#$ possible explanations even if & '()*+,- ⇒ & /0'12 vocabulary only considers presence/absence of obstacles. & '()*+,- ⇔ & /0'12 Scalability: Usually the feature space or vocabulary is large. For a map, its order of features in the map. For an image, it is order of the image’s resolution. Guarantee: Is the sampled space of maps enough to generate the explanation with some quantifiable probabilistic guarantee? Theoretical Result: Learning Boolean formula even approximately is hard. 3- DNF is not learnable in Probably Approximately Correct framework unless RP = NP.

Two Key Ideas 1. Vocabulary is large. 2. How many samples (and what distribution) to consider for learning explanation ? 3. Learning Boolean formula with PAC guarantees is hard. Active learning Boolean formula ! "#$%&'( and not learning from fixed sample. Explanations are often short and involve only few variables !

Two Key Ideas Active learning Boolean formula ! "#$%&'( and not learning from fixed sample. Explanations are often short and involve only few variables !

Two Key Ideas Involves only two variables. If we knew which two, we had only 2 * + = 16 possible explanations. How do we find these relevant variables? Active learning Boolean formula ! "#$%&'( and not learning from fixed sample. Explanations are often short and involve only few variables !

Actively Learning Boolean Formula A* ! "#$%& : Some property of the output Assignments to V m1 = (0,0,0,1,1,0,1) Ex: Some cells not selected m2 = (0,0,1,1,0,1,0) ! ! $'()*+, (V) : Evaluates assignments and returns T,F Using explanation vocabulary Ex: Obstacle presence

Actively Learning Relevant Variables !"#$ % &'(ℎ *ℎ+* , -./0123 V ≡ , -./0123 % 6ℎ787 % ≪ |;| , -./0123 "& &<+8&7

Actively Learning Relevant Variables !"#$ % &'(ℎ *ℎ+* , -./0123 V ≡ , -./0123 % 6ℎ787 % ≪ |;| Assignments to V m1 = (0,0,0,1,1,0,1) m1 : True

Logic Extraction for Explainable AI Susmit Jha Computer Science - PowerPoint PPT Presentation

Logic Extraction for Explainable AI Susmit Jha Computer Science Laboratory SRI July, 2019 1 AI reaches human-level accuracy on benchmark datasets Going deeper with convolutions. (Inception) C Szegedy et al, 2014 Switchboard benchmark

uf: Minimizing the Coq Extraction TCB Eric Mullen , Stuart Pernsteiner, James Wilcox, Zachary

Soil Extraction Cell: An Alternative Soil Extraction Cell: An Alternative Method of Soil

Automated Reasoning for EXplainable Artificial Intelligence Maria Paola Bonacina Dipartimento di

The Role of Normware in Trustworthy and Explainable AI Giovanni Sileno (g.sileno@uva.nl),

Explainable (Deep) Learning and Simulation approaches Torsten Mller Visualization and

Visualization for Explainable Classifiers Yao MING THE HONG KONG UNIVERSITY OF SCIENCE AND

Kowledge-Based Programs as Explainable Policies for Contingent Planning J. Lang, A. Saffidine,

Explainable(?) Statistical ML Derek Doran Dept. of Computer Science and Engineering Wright

Markov Logic Markov Logic Probability First-Order Logic Propositional Logic Markov Logic

Declarative Information Extraction Declarative Information Extraction Using Datalog Datalog with

Logic Modeling Outline What is a logic model? How to use a logic model How to build a

Combining equilibrium logic and dynamic logic (an introduction and a very brief overview) Luis

Explainable Artificial Intelligence Student: Nedeljko Radulovi Supervisors: Mr. Albert Bifet

Variability Extraction and Analysis Toolkit (VEXA) VEXA Introduction The Variability Extraction

3. Feature Extraction 3.1 Feature Extraction from Speech or other types of audio like music

Automated Feature Extraction Automated Feature Extraction for Object Recognition for Object

Artificial Neural Networks Roger Barlow CODATA School - Roger Barlow -Artificial Neural Networks

University of Huddersfield International Study Centre Career-focused courses, exceptional teaching

E Evolution of NTCIR: l Infrastructure of Large-Scale Infrastructure of Large Scale

FY2020 Results Announcement Analyst Briefing: Third Quarter Ended 31 March 2020 21 May 2020 3Q

Slide 1 Peter Hepburn, ADCH Trustee and Vice Chair Governance, Trustee Res esponsi sibili

Crowd-Financing Solar for Nonprofits Serving Low-Income Communities May 25, 2017 Housekeeping

AllHandsMee2ng ESnet:NetworkingforScience March10,2010

Update on RegCM4 developments and plans Filippo Giorgi Abdus Salam ICTP, Trieste, Italy Eighth