explainable deep learning and simulation approaches
play

Explainable (Deep) Learning and Simulation approaches Torsten - PowerPoint PPT Presentation

Explainable (Deep) Learning and Simulation approaches Torsten Mller Visualization and Data Analysis University of Vienna Explainable (Deep) Learning and Simulation approaches Torsten Mller Visualization and Data Analysis


  1. A modern microscope Models 
 Data Decisions (predictions) • making difficult algorithmic solutions accessible to a broad audience: enable model users to become model builders Summerschool, Sep 2018 Torsten Möller 42

  2. Modern microscope Visual Data Science = k-means scatterplot + kNN Making modelling techniques scatterplot accessible to a broad set of users without requiring a PhD in Stats/ = DBScan ML. ? + SVM Isomap Summerschool, Sep 2018 Torsten Möller 43

  3. Why?: Societal factors Summerschool, Sep 2018 Torsten Möller 44

  4. Ethics • cars make decisions on who to run over and who not • who should the company hire? • which update from which friend should you be shown? • which convict is more likely to re-offend? • which news item / movie should we recommend to people? https://www.ted.com/talks/zeynep_tufekci_machine_intelligence_makes_human_morals_more_important#t-157020 Summerschool, Sep 2018 Torsten Möller 45

  5. Laws • EU’s General Data Protection Regulation: • incl Article 22: Automated individual decision-making, including profiling • prohibits any “decision based solely on automated processing, including profiling” which “significantly affects” a data subject • Discrimination : Paragraph 71 of the recitals (the preamble to the GDPR, which explains the rationale behind it but is not itself law) explicitly requires data controllers to “implement appropriate technical and organizational measures” that “prevents, inter alia, discriminatory effects” on the basis of processing sensitive data • Right to explanation : Articles 13 and 14 state that, when profiling takes place, a data subject has the right to “meaningful information about the logic involved.” Goodman, B. & Flaxman, S. European Union regulations on algorithmic decision-making and a “right to explanation” AI Magazine, 2017 Summerschool, Sep 2018 Torsten Möller 46

  6. Outline today • Why explainable? - the promise of data science - extrinsic factors • How? - a process model for simulations - (machine) learning environments Summerschool, Sep 2018 Torsten Möller 47

  7. How? From Philip Grohs Summerschool, Sep 2018 Torsten Möller 48

  8. How? Alex Schindler Summerschool, Sep 2018 Torsten Möller 49

  9. How? Alex Schindler Summerschool, Sep 2018 Torsten Möller 50

  10. How — our approach https://youtu.be/5d71xhEbjDg Summerschool, Sep 2018 Torsten Möller 51

  11. FluidExplorer Fluid animation Summerschool, Sep 2018 Torsten Möller 52

  12. Special effects • Fluid simulation is heavily used in the motion picture industry • Most common animation packages include solvers or offer add-ons • Problem: Difficult to control for visual Summerschool, Sep 2018 Torsten Möller 53

  13. Special effects (2) • Tens of parameters • Hard to predict results • Time-consuming trial & error Autodesk Maya 2010 Summerschool, Sep 2018 Torsten Möller 54

  14. Overview Summerschool, Sep 2018 Torsten Möller 55

  15. Visualization • Body Level One - Body Level Two • Body Level Three • Body Level Four • Body Level Five Summerschool, Sep 2018 Torsten Möller 56

  16. Abstraction: 
 (visual) Parameter space exploration (vPSA) 57 Summerschool, Sep 2018 Torsten Möller

  17. Other tools Summerschool, Sep 2018 Torsten Möller 58

  18. Much recent attention in vPSA • Image segmentation [Torsney Weir et al. 2011] • Weather forecast [Potter et al. 2009] • Disaster simulation [Waser et al. 2010] • many more … [Torsney-Weir et al. 2011] [Bruckner & Möller 2010] [Bergner et al. 2013] [Piringer et al. 2010] [Amirkhanov et al. 2010] …etc. [Potter et al. 2009] [Waser et al. 2010] [Coffey et al. 2013] [Pretorius et al. 2011] Summerschool, Sep 2018 Torsten Möller 59

  19. Much recent attention in vPSA • Image segmentation [Torsney Weir et al. 2011] • Weather forecast [Potter et al. 2009] • Disaster simulation [Waser et al. 2010] • many more … [Torsney-Weir et al. 2011] [Bruckner & Möller 2010] [Bergner et al. 2013] [Piringer et al. 2010] [Amirkhanov et al. 2010] …etc. [Potter et al. 2009] [Waser et al. 2010] [Coffey et al. 2013] [Pretorius et al. 2011] Summerschool, Sep 2018 Torsten Möller 60

  20. Much recent attention in vPSA • Image segmentation [Torsney Weir et al. 2011] • Weather forecast [Potter et al. 2009] • Disaster simulation [Waser et al. 2010] • many more … [Torsney-Weir et al. 2011] [Bruckner & Möller 2010] [Bergner et al. 2013] [Piringer et al. 2010] [Amirkhanov et al. 2010] …etc. [Potter et al. 2009] [Waser et al. 2010] [Coffey et al. 2013] [Pretorius et al. 2011] Summerschool, Sep 2018 Torsten Möller 61

  21. Much recent attention in vPSA • Image segmentation [Torsney Weir et al. 2011] • Weather forecast [Potter et al. 2009] • Disaster simulation [Waser et al. 2010] • many more … [Torsney-Weir et al. 2011] [Bruckner & Möller 2010] [Bergner et al. 2013] [Piringer et al. 2010] [Amirkhanov et al. 2010] …etc. [Potter et al. 2009] [Waser et al. 2010] [Coffey et al. 2013] [Pretorius et al. 2011] Summerschool, Sep 2018 Torsten Möller 62

  22. Much recent attention in vPSA • comprehensive study of 21 different tools [Torsney-Weir et al. 2011] [Bruckner & Möller 2010] [Bergner et al. 2013] [Piringer et al. 2010] [Amirkhanov et al. 2010] …etc. [Potter et al. 2009] [Waser et al. 2010] [Coffey et al. 2013] [Pretorius et al. 2011] Summerschool, Sep 2018 Torsten Möller 63

  23. Data Flow Model Summerschool, Sep 2018 Torsten Möller 64

  24. Build an estimator Summerschool, Sep 2018 Torsten Möller 65

  25. Model Input Output Summerschool, Sep 2018 Torsten Möller 66

  26. Model Input Output Model • simulation model, prediction model, … • … but also algorithm • stochastic, deterministic • usually black box (to us as Vis researchers) Summerschool, Sep 2018 Torsten Möller 67

  27. Model Input Output Inputs • well chosen by the scientist, i.e. people care about their inputs • normally continuous (quantitative data) - need to sample the space • categorical data common too (e.g. use of a different algorithm) Summerschool, Sep 2018 Torsten Möller 68

  28. Model Input Output Outputs • typically complex objects, e.g. - 2D, 3D images (Tuner) - animations (FluidExplorer) - performance graphs (fuel cells) • hard to evaluate / compare many complex outputs Summerschool, Sep 2018 Torsten Möller 69

  29. Model Derive Derived Input Output Outputs Derive • one-dimensional (“goodness”) rating: d(O 1 ) • two-dimensional comparison: d(O 1 , O 2 ) • objective measures can be - exact (reliable) - approximate - about right, but not 100% precise - unknown (active learning) Summerschool, Sep 2018 Torsten Möller 70

  30. Complex objects (in 18/21 papers) Model 1.0 2.1 [Torsney-Weir et al. 2011] 3.7 Input Parameters Outputs 1.0 2.1 3.7 ? 6.3 3.3 5.2 2.2 2.1 2.0 1.1 5.6 7.8 … … … … … Summerschool, Sep 2018 Torsten Möller 71

  31. Derive objective measures 7.1 Model Derive 1.0 2.1 3.7 Summerschool, Sep 2018 Torsten Möller 72

  32. Surrogate models ? Model Derive 1.5 2.5 3.5 ? expensive! Summerschool, Sep 2018 Torsten Möller 73

  33. Surrogate models Model Derive 1.5 1.5 2.5 2.5 3.5 3.5 Surrogate Model Summerschool, Sep 2018 Torsten Möller 74

  34. Data flow model Direct Derived Input Model Derive Output Output Predicted Surrogate Model Output Summerschool, Sep 2018 Torsten Möller 75

  35. Navigation Strategies Summerschool, Sep 2018 Torsten Möller 76

  36. Navigation strategies • Trial and error (traditional approach) Summerschool, Sep 2018 Torsten Möller 77

  37. Navigation strategies • Trial and error (traditional approach) • Local —> global tweaking Design by Dragging [Coffey et al., SciVis 2013] Summerschool, Sep 2018 Torsten Möller 78

  38. Navigation strategies • Trial and error (traditional approach) • Local —> global tweaking • Global —> local exploration - FluidExplorer, Vismon, Tuner - many others: Paramorama [Pretorius et al., InfoVis 2011] Summerschool, Sep 2018 Torsten Möller 79

  39. Navigation strategies • Trial and error (traditional approach) • Local —> global tweaking • Global —> local exploration • Steering - simulation steering: e.g. real-time simulators - computational steering: e.g. change the grid size, stop if no insight World Lines [Waser et al., Vis 2010] Summerschool, Sep 2018 Torsten Möller 80

  40. Analysis Tasks Summerschool, Sep 2018 Torsten Möller 81

  41. Analysis tasks • Optimization • Partitioning • Fitting • Outliers • Uncertainty • Sensitivity Summerschool, Sep 2018 Torsten Möller 82

  42. Analysis tasks Find the best parameter • Optimization combination given some objectives. • Partitioning • Fitting Model 1 1 1 3 1 1 2 4 • Outliers 0 0 3 5 • Uncertainty • Sensitivity in 19/21 papers Summerschool, Sep 2018 Torsten Möller 83

  43. Analysis tasks How many different types of • Optimization model behaviors are possible? • Partitioning aka clustering • Fitting Model • Outliers 2 1 3 1 4 1 1 1 3 1 3 3 4 3 2 1 2 4 0 3 5 3 5 0 0 3 5 • Uncertainty • Sensitivity in 6/21 papers Summerschool, Sep 2018 Torsten Möller 84

  44. Analysis tasks Where in the input parameter • Optimization space would actual measured data occur? • Partitioning • Fitting aka regression analysis • Outliers Model Derive • Uncertainty ground truth • Sensitivity in 9/21 papers Summerschool, Sep 2018 Torsten Möller 85

  45. Analysis tasks • Optimization What outputs are special? • Partitioning Model • Fitting • Outliers • Uncertainty • Sensitivity in 9/21 papers Summerschool, Sep 2018 Torsten Möller 86

  46. Analysis tasks • Optimization How reliable is the output? • Partitioning • model vs. reality • non-deterministic 
 • Fitting Model model • model vs. surrogate • Outliers • Uncertainty • Sensitivity in 7/21 papers Summerschool, Sep 2018 Torsten Möller 87

  47. Analysis tasks What ranges/variations of • Optimization outputs to expect with changes of input? • Partitioning • Fitting Model • Outliers • Uncertainty • Sensitivity in 14/21 papers Summerschool, Sep 2018 Torsten Möller 88

  48. The (machine) learning process Summerschool, Sep 2018 Torsten Möller 89

  49. types of learning • regression • classification (supervised) • clustering (unsupervised) • (dimensionality reduction) • (outlier detection) Summerschool, Sep 2018 Torsten Möller 90

  50. techniques of learning • Neural Networks (plus Deep-NN) The world of ML algorithms is not as • Kernel methods (SVM) well organized in terms of strategies as it is with simulation environments. • Graphical models This is work in progress. • Ensemble methods • … Summerschool, Sep 2018 Torsten Möller 91

  51. A small selection: • confusion matrixes for classification • deep neural nets • understand / diagnose / refine • Explainers • LIME Summerschool, Sep 2018 Torsten Möller 92

  52. Confusion matrix • Google’s Facet: - http://gifctrl.com/?g=https:// 3.bp.blogspot.com/-T0dTxdse9Ow/ WWz0u431RpI/AAAAAAAAB5M/ rBvToJjx1L0FVVpXkgNOAwzXASyZC_JWw CLcBGAs/s1600/image4.gif - EuroVis keynote, 2017 — https:// www.youtube.com/watch?v=E70lG9-HGEM Summerschool, Sep 2018 Torsten Möller 93

  53. Squares Ren et al., 2017. Squares: Supporting inter- active performance analysis for multiclass classifiers. IEEE TVCG 23 (1), 61–70. Summerschool, Sep 2018 Torsten Möller 94

  54. Deep NN’s: 
 Neurons — point based Rauber, et al., 2017. Visualizing the hidden activity of artificial neural networks. IEEE TVCG 23 (1), 101–110 Summerschool, Sep 2018 Torsten Möller 95

  55. Deep NN’s: 
 Neurons — network based Tzeng, F.Y., Ma, K.L. 2005. Opening the black box - data driven visualization of neural networks. In: IEEE Visualization Summerschool, Sep 2018 Torsten Möller 96

  56. CNNVis http://shixialiu.com/publications/cnnvis/demo/ Summerschool, Sep 2018 Torsten Möller 97

  57. Conclusions • Why explainable? - improve algorithms - trust - bridge the model builder / model usage gap - ethics and law • How? - characterization of input-output relationships OR parameter tuning - understanding the behaviour of neurons in Deep NN - It is the “wild west” in terms of understanding machine learning models! Summerschool, Sep 2018 Torsten Möller 98

  58. Acknowledgments Steven Bergner Maryam Booshehrian Stephen Ingram Tom Torsney-Weir Hamid Younesy 
 Lorenz Linhardt SFU Muprime Tech Coho Data U of Vienna ETH Zurich SFU Stefan Bruckner Tamara Munzner Melanie Tory Harald Piringer Michael Sedlmair Patrick Wolf Tableau VRVis U of Bergen UBC U of Vienna Software Dev Summerschool, Sep 2018 Torsten Möller 99

  59. References • Visual Parameter Space Analysis: A Conceptual Framework. Michael Sedlmair, Christoph Heinzl, Stefan Bruckner, Harald Piringer, Torsten Möller, IEEE Transactions on Visualization and Computer Graphics 20(12):2161-2170, 2014. • eScience -- A Transformed Scientific Method. Jim Gray, (2007), in “The Fourth Paradigm: Data-Intensive Scientific Discovery”, 2009. • Google Facet, https://pair-code.github.io/facets/, Jul 2017. • Squares: Supporting Interactive Performance Analysis for Multiclass Classifiers, D. Ren, S. Amershi, B. Lee, J. Suh and J. D. Williams, IEEE Transactions on Visualization and Computer Graphics, vol. 23, no. 1, pp. 61-70, Jan. 2017. • Visualizing the Hidden Activity of Artificial Neural Networks, P. E. Rauber, S. G. Fadel, A. X. Falcão and A. C. Telea, IEEE Transactions on Visualization and Computer Graphics, vol. 23, no. 1, pp. 101-110, Jan. 2017. • Towards Better Analysis of Deep Convolutional Neural Networks. Mengchen Liu, Jiaxin Shi, Zhen Li, Chongxuan Li, Jun Zhu, and Shixia Liu. IEEE Transactions on Visualization and Computer Graphics 23, 1 (January 2017), 91-100. Summerschool, Sep 2018 Torsten Möller 100

Recommend


More recommend