arXiv:1708.08296v1 [cs.AI] 28 Aug 2017 ABSTRACT were crucial - PDF document

EXPLAINABLE ARTIFICIAL INTELLIGENCE: UNDERSTANDING, VISUALIZING AND INTERPRETING DEEP LEARNING MODELS Wojciech Samek 1 , Thomas Wiegand 1 , 2 , Klaus-Robert M¨ uller 2 , 3 , 4 1 Dept. of Video Coding & Analytics, Fraunhofer Heinrich Hertz Institute, 10587 Berlin, Germany 2 Dept. of Computer Science,Technische Universit¨ at Berlin, 10587 Berlin, Germany 3 Dept. of Brain & Cognitive Engineering, Korea University, Seoul 136-713, South Korea 4 Max Planck Institute for Informatics, Saarbr¨ ucken 66123, Germany arXiv:1708.08296v1 [cs.AI] 28 Aug 2017 ABSTRACT were crucial factors to success. Today’s machine learning- based AI systems excel in a number of complex tasks ranging With the availability of large databases and recent improve- from the detection of objects in images [14] and the under- ments in deep learning methodology, the performance of AI standing of natural languages [8] to the processing of speech signals [10]. On top of that, recent AI 1 systems can even out- systems is reaching or even exceeding the human level on an increasing number of complex tasks. Impressive examples play professional human players in difficult strategic games of this development can be found in domains such as image such as Go [34] and Texas hold’em poker [28]. These im- classification, sentiment analysis, speech understanding or mense successes of AI systems, especially deep learning strategic game playing. However, because of their nested models, show the revolutionary character of this technology, non-linear structure, these highly successful machine learn- which will have a large impact beyond the academic world ing and artificial intelligence models are usually applied in and will also give rise to disruptive changes in industries and a black box manner, i.e., no information is provided about societies. what exactly makes them arrive at their predictions. Since However, although these models reach impressive predic- this lack of transparency can be a major drawback, e.g., in tion accuracies, their nested non-linear structure makes them medical applications, the development of methods for visual- highly non-transparent, i.e., it is not clear what information izing, explaining and interpreting deep learning models has in the input data makes them actually arrive at their decisions. recently attracted increasing attention. This paper summa- Therefore these models are typically regarded as black boxes . rizes recent developments in this field and makes a plea for The 37th move in the second game of the historic Go match more interpretability in artificial intelligence. Furthermore, between Lee Sedol, a top Go player, and AlphaGo, an artifi- it presents two approaches to explaining predictions of deep cial intelligence system built by DeepMind, demonstrates the learning models, one method which computes the sensitiv- non-transparency of the AI system. AlphaGo played a move ity of the prediction with respect to changes in the input and which was totally unexpected and which was commented on one approach which meaningfully decomposes the decision by a Go expert in the following way: in terms of the input variables. These methods are evaluated on three classification tasks. “It’s not a human move. I’ve never seen a human play this move.” (Fan Hui, 2016). Index Terms — Artificial intelligence, deep neural networks, black box models, interpretability, sensitivity analysis, layer- Although during the match it was unclear why the system wise relevance propagation played this move, it was the deciding move for AlphaGo to win the game. In this case the black box character of the AlphaGo did not matter, but in many applications the impos- 1. INTRODUCTION sibility of understanding and validating the decision process of an AI system is a clear drawback. For instance, in medical The field of machine learning and artificial intelligence has diagnosis it would be irresponsible to trust predictions of a progressed over the last decades. A driving force for this black box system by default. Instead every far reaching de- development were earlier improvements in support vector cision should be made accessible for appropriate validation machines and more recent improvements in deep learning by a human expert. Also in self-driving cars, where a sin- methodology [22]. Also the availability of large databases gle incorrect prediction can be very costly, the reliance of the such as ImageNet [9] or Sports1M [17], the speed-up gains model on the right features must be guaranteed. The use of obtained with powerful GPU cards and the high flexibility of explainable and human interpretable AI models is a prereq- software frameworks such as Caffe [15] or TensorFlow [1] uisite for providing such a guarantee. More discussion on the This work was supported by the German Ministry for Education and Re- 1 The terms artificial intelligence and machine learning are used synony- search as Berlin Big Data Center BBDC (01IS14013A). We thank Gr´ egore Montavon for his valuable comments on the paper. mously.

arXiv:1708.08296v1 [cs.AI] 28 Aug 2017 ABSTRACT were crucial - PDF document

EXPLAINABLE ARTIFICIAL INTELLIGENCE: UNDERSTANDING, VISUALIZING AND INTERPRETING DEEP LEARNING MODELS Wojciech Samek 1 , Thomas Wiegand 1 , 2 , Klaus-Robert M uller 2 , 3 , 4 1 Dept. of Video Coding & Analytics, Fraunhofer Heinrich Hertz

Introductiontothelarge chargeexpansion Domenico Orlando Introduction Whos who S. Reffert

Michael Duff Imperial College London based on [arXiv:1301.4176 arXiv:1309.0546 arXiv:1312.6523

35 30 33 20 10 10 8 7 0 Feb 10 Aug 10 Feb 11 Aug 11 Feb 12 Aug 12 Feb 13 Aug 13

Introductiontothelarge chargeexpansion Domenico Orlando Introduction Whos who S. Reffert

Alargecharge torulestrongcoupling Domenico Orlando Introduction Whos who S. Reffert (AEC

Oleg Starykh University of Utah arXiv:1708.02980 E d i t o r s S u g g e s t

The Entropy of a Hole in Space-Time Based on: arXiv:1305.0856, arXiv:1310.4204, arXiv:1406.nnnn

Induced Ellipticity for Inspiraling Binary Systems LR w/ Zhong-Zhi Xianyu

Strategic Planning The Process Jul/Aug Online Consultation Survey, analysis 16 Aug Feedback to

Alpha-bits, Teleportation and Black Holes ArXiv:1706.09434, ArXiv:1807.06041 Geoffrey Penington,

Double Gauge Boson Production in the SM EFT Ian Lewis University of Kansas 1708.soon in

Nobody Solves the Quintic University of Sydney Undergraduate Lecture, August 2017 Abstract.

Dr G D Pol Foundation YMT College of Management PRESENTATION SCHEDULE M M S SEM III Week Roll

Close Down Governance Plan A summary of the remaining key governance meetings from 31 Jul 17 to 31

B. Blok, C. Jackel, M. Strikman, U. Wiedemann Hep-ph Arxive 1708.08241 Pictures from CMS 2016

Yong Tang University of Tokyo KEK-PH, 2018 YL. Wu & Y. Tang , 1708.05138, 1604.04701 Yong

Stream$Driven Galaxy Formation at High Redshift Avishai Dekel The Hebrew University of Jerusalem

Markus Ackermann SLAC National Accelerator Laboratory on behalf of the Fermi LAT collaboration

NO A Liquid Scintillator Production Stuart Mufson, Indiana University FroST 2016 March 19,

Populations R.W. Oldford Problem: Structure of human immunoglobulin G1 (IgG1) Recall exploring

All wireless communication stacks are equally broken Jiska Classen Secure Mobile Networking Lab

Replenishing the Microarchitecture Treasure Chest Prof. John Paul Shen Electrical and

RANDOM PHENOMENA FUNDAMENTALS OF PROBABILITY AND STATISTICS FOR ENGINEERS BABATUNDE A. OGUNNAIKE

What can we learn from averaging Cosmological Observables in

arXiv:1708.08296v1 [cs.AI] 28 Aug 2017 ABSTRACT were crucial - PDF document

EXPLAINABLE ARTIFICIAL INTELLIGENCE: UNDERSTANDING, VISUALIZING AND INTERPRETING DEEP LEARNING MODELS Wojciech Samek 1 , Thomas Wiegand 1 , 2 , Klaus-Robert M uller 2 , 3 , 4 1 Dept. of Video Coding & Analytics, Fraunhofer Heinrich Hertz

Introductiontothelarge chargeexpansion Domenico Orlando Introduction Whos who S. Reffert

Michael Duff Imperial College London based on [arXiv:1301.4176 arXiv:1309.0546 arXiv:1312.6523

35 30 33 20 10 10 8 7 0 Feb 10 Aug 10 Feb 11 Aug 11 Feb 12 Aug 12 Feb 13 Aug 13

Introductiontothelarge chargeexpansion Domenico Orlando Introduction Whos who S. Reffert

Alargecharge torulestrongcoupling Domenico Orlando Introduction Whos who S. Reffert (AEC

Oleg Starykh University of Utah arXiv:1708.02980 E d i t o r s S u g g e s t

The Entropy of a Hole in Space-Time Based on: arXiv:1305.0856, arXiv:1310.4204, arXiv:1406.nnnn

Induced Ellipticity for Inspiraling Binary Systems LR w/ Zhong-Zhi Xianyu

Strategic Planning The Process Jul/Aug Online Consultation Survey, analysis 16 Aug Feedback to

Alpha-bits, Teleportation and Black Holes ArXiv:1706.09434, ArXiv:1807.06041 Geoffrey Penington,

Double Gauge Boson Production in the SM EFT Ian Lewis University of Kansas 1708.soon in

Nobody Solves the Quintic University of Sydney Undergraduate Lecture, August 2017 Abstract.

Dr G D Pol Foundation YMT College of Management PRESENTATION SCHEDULE M M S SEM III Week Roll

Close Down Governance Plan A summary of the remaining key governance meetings from 31 Jul 17 to 31

B. Blok, C. Jackel, M. Strikman, U. Wiedemann Hep-ph Arxive 1708.08241 Pictures from CMS 2016

Yong Tang University of Tokyo KEK-PH, 2018 YL. Wu &amp; Y. Tang , 1708.05138, 1604.04701 Yong

Stream$Driven Galaxy Formation at High Redshift Avishai Dekel The Hebrew University of Jerusalem

Markus Ackermann SLAC National Accelerator Laboratory on behalf of the Fermi LAT collaboration

NO A Liquid Scintillator Production Stuart Mufson, Indiana University FroST 2016 March 19,

Populations R.W. Oldford Problem: Structure of human immunoglobulin G1 (IgG1) Recall exploring

All wireless communication stacks are equally broken Jiska Classen Secure Mobile Networking Lab

Replenishing the Microarchitecture Treasure Chest Prof. John Paul Shen Electrical and

RANDOM PHENOMENA FUNDAMENTALS OF PROBABILITY AND STATISTICS FOR ENGINEERS BABATUNDE A. OGUNNAIKE

What can we learn from averaging Cosmological Observables in

Yong Tang University of Tokyo KEK-PH, 2018 YL. Wu & Y. Tang , 1708.05138, 1604.04701 Yong