Robustness in Sum-Product Networks with Continuous and Categorical - PowerPoint PPT Presentation

Robustness in Sum-Product Networks with Continuous and Categorical Data ISIPTA 2019 - Ghent, Belgium R. C. de Wit 1 Cassio P. de Campos 1 D. Conaty 2 J. Martínez del Rincon 2 1 Department of Information and Computing Sciences, Utrecht University, The Netherlands 2 Centre for Data Science and Scalable Computing, Queen’s University Belfast, U.K. July 2019 1

SPNs • Sum-Product Networks: sacrifice “interpretability” for the sake of computational efficiency; represent computations not interactions (Poon & Domingos 2011). • Complex mixture distributions represented graphically as an arithmetic circuit (Darwiche 2001). + 0.2 0.3 0.5 × × × + + + + 0.4 1 0.7 8 . . 0 0 0.6 0.9 0.3 0.2 a ¯ ¯ a b b 2

Sum-Product Network Distribution S ( X 1 , . . . , X n ) built by • an indicator function over a single variable • I ( X = 0) , I ( Y = 1) (also written ¬ x, y ), • a weighted sum of SPNs with same domain and nonnegative weights (summing 1) • S 3 ( X, Y ) = 0 . 6 · S 1 ( X, Y ) + 0 . 4 · S 2 ( X, Y ) , • a product of SPNs with disjoint domains • S 3 ( X, Y, Z, W ) = S 1 ( X, Y ) · S 2 ( Z, W ) . 3

Sum-product networks - main computational points • Computing conditional probability values is very efficient (linear time). • Computing MAP instantiations is NP-hard in general (originally it was thought to be efficient), but efficient in some cases. 4

Credal Sum-Product Networks • Robustify SPNs by allowing weights to vary inside sets. • Class of tractable imprecise graphical models (as credal nets, they also represent a set K ( X ) ). ⎧ � ⎫ ( w 1 , w 2 , w 3 ) ∈ CH ( [0 . 28 , 0 . 45 , 0 . 27] , � ⎪ ⎪ ⎪ � ⎪ ⎪ [0 . 18 , 0 . 55 , . 27] , [0 . 18 , 0 . 45 , 0 . 37]) , ⎪ ⎪ � ⎪ ⎪ ⎪ ⎪ + � ⎪ ⎪ ⎪ w 1 w 3 0 . 54 ≤ w 4 ≤ 0 . 64 , 0 . 36 ≤ w 5 ≤ 0 . 46 , ⎪ � ⎪ w 2 ⎪ ⎪ ⎪ � ⎪ ⎪ ⎪ 0 . 09 ≤ w 6 ≤ 0 . 19 , 0 . 81 ≤ w 7 ≤ 0 . 91 , ⎨ × × × � ⎬ � 0 . 27 ≤ w 8 ≤ 0 . 37 , 0 . 63 ≤ w 9 ≤ 0 . 73 , � + + + + w w ⎪ w 6 w 10 � ⎪ 5 9 ⎪ ⎪ w 4 w 7 w 8 w 11 ⎪ � 0 . 72 ≤ w 10 ≤ 0 . 82 , 0 . 18 ≤ w 11 ≤ 0 . 28 , ⎪ ⎪ ⎪ ⎪ � ⎪ ⎪ a a b ¯ ⎪ ¯ b ⎪ � w 4 + w 5 = 1 , w 6 + w 7 = 1 , ⎪ ⎪ ⎪ ⎪ � ⎪ ⎪ ⎪ ⎪ � ⎪ w 8 + w 9 = 1 , w 10 + w 11 = 1 ⎩ ⎭ � 5

Credal sum-product networks - main computational points • Computing unconditional probability intervals is very efficient (quadratic time). • Computing conditional probability intervals is very efficient under some assumptions (quadratic time). 6

Credal classification with a single class variable can be done in polynomial time when each internal node has at most one parent. Note: Structure learning algorithms may generate sum-product nets of the above form! Credal classification Given configurations c ′ , c ′′ of variables C and evidence e decide: � > 0 . ∀ P : P ( c ′ , e ) > P ( c ′′ , e ) ⇐ � S w ( c ′ , e ) − S w ( c ′′ , e ) ⇒ min w 7

Credal classification Given configurations c ′ , c ′′ of variables C and evidence e decide: � > 0 . ∀ P : P ( c ′ , e ) > P ( c ′′ , e ) ⇐ � S w ( c ′ , e ) − S w ( c ′′ , e ) ⇒ min w Credal classification with a single class variable can be done in polynomial time when each internal node has at most one parent. Note: Structure learning algorithms may generate sum-product nets of the above form! 7

Credal Sum-Product Networks with mixed variable types Theorem 1 Credal classification with a single class variable can be done in polynomial time when each internal node has at most one parent in domains with mixed variable types (under mild assumptions). ⎧ � ⎫ ( w 1 , w 2 , w 3 ) ∈ CH ( [0 . 28 , 0 . 45 , 0 . 27] , � ⎪ ⎪ ⎪ � ⎪ ⎪ [0 . 18 , 0 . 55 , . 27] , [0 . 18 , 0 . 45 , 0 . 37]) , ⎪ ⎪ � ⎪ ⎪ ⎪ ⎪ + � ⎪ ⎪ ⎪ 0 . 54 ≤ w 4 ≤ 0 . 64 , 0 . 36 ≤ w 5 ≤ 0 . 46 , w 1 w 3 ⎪ � ⎪ w 2 ⎪ ⎪ ⎪ � ⎪ ⎪ ⎪ × × × 0 . 09 ≤ w 6 ≤ 0 . 19 , 0 . 81 ≤ w 7 ≤ 0 . 91 , ⎨ � ⎬ � 0 . 27 ≤ w 8 ≤ 0 . 37 , 0 . 63 ≤ w 9 ≤ 0 . 73 , � + + + + w 5 w ⎪ w 6 w 10 � ⎪ 9 ⎪ ⎪ w 4 w 7 w 8 w 11 ⎪ � 0 . 72 ≤ w 10 ≤ 0 . 82 , 0 . 18 ≤ w 11 ≤ 0 . 28 , ⎪ ⎪ ⎪ ⎪ � ⎪ ⎪ d A d A b ¯ ⎪ b ⎪ � w 4 + w 5 = 1 , w 6 + w 7 = 1 , ⎪ ⎪ ⎪ ⎪ � ⎪ ⎪ ⎪ ⎪ � ⎪ w 8 + w 9 = 1 , w 10 + w 11 = 1 ⎩ ⎭ � 8

Experiments - bol.com • 36707 orders analysed (51% legit, 49% fraud). • Expert achieves 94% accuracy. • 109 features reduced to 1 continuous (price) and 23 Boolean variables (with at least a 9:1 split). • Robustness of a given testing instance is defined as the largest possible � -contamination of local weights from an original sum-product network such that a single class is returned. 9

Preliminary results 10

• However, this is not an obvious gain for the company: those 15% analysed orders which can be automatically classified well are typically easier and the expert may do better than 94% accuracy there (there is ongoing work to understand this better). Preliminary discussion • If we only issued automatic classification for instances with robustness above 0.1, we would achieve accuracy similar to the expert on 15% of all analysed orders. • Robustness seems to work better than probability value itself to identify ‘easy-to-classify’ instances. 11

Preliminary discussion • If we only issued automatic classification for instances with robustness above 0.1, we would achieve accuracy similar to the expert on 15% of all analysed orders. • Robustness seems to work better than probability value itself to identify ‘easy-to-classify’ instances. • However, this is not an obvious gain for the company: those 15% analysed orders which can be automatically classified well are typically easier and the expert may do better than 94% accuracy there (there is ongoing work to understand this better). 11

Gradient decision tree boosting 12

Thank you for your attention cassiopc@acm.org 12

Robustness in Sum-Product Networks with Continuous and Categorical - PowerPoint PPT Presentation

Robustness in Sum-Product Networks with Continuous and Categorical Data ISIPTA 2019 - Ghent, Belgium R. C. de Wit 1 Cassio P. de Campos 1 D. Conaty 2 J. Martnez del Rincon 2 1 Department of Information and Computing Sciences, Utrecht University,

Product Section Product Section New Product Introduction New Product Introduction Product

ex Addition: 1-bit half adder A + Sum B Carry out Carry A B Sum out 0 0 A 0 1 Sum

UCSD Robustness Summer School David Donoho 20190812 David Donoho UCSD Robustness Summer School

Robustness? Robustness ? Robustness?

Sum Product Networks What is a Sum Product Network? 1. It is a tractable probabilistic model

Continuous dialogue from product Continuous dialogue from product development throughout the

Robustness and Generalization Huan Xu The University of Texas at Austin Department of Electrical

Where Are We? Lecture 9 Robustness through Training 1 Robustness Explicit Handling of Noise

Continuous dialogue from product development Continuous dialogue from product development

Chapter 6 Methods 1 Opening Problem Find the sum of integers from 1 to 10, from 20 to 30, and

Basic Ruby Syntax sum = 0 Newline is statement separator i = 1 while i <= 10 do sum += i*i

ex start small with a 1-bit (half) adder A B Carry out Sum A 0 0 Sum 0 1 B 1 0 1 1

Sum-Product Networks CS486 / 686 University of Waterloo Lecture 23: July 19, 2017 Outline

Trade-off between Efficiency and Robustness Doctoral Colloqium @ SenSys18, Shenzhen Robert

The Dot Product and Orthogonal Vectors The Dot Product Defn. The dot product (or inner product )

Algorithms in Nature Network robustness Slides adapted from Carl Kingsford Network robustness

The Role of Siberian River Discharge in Arctic Freshwater Balance (

Page 1 of 1

Macquarie Bank Limited 2006 Annual General Meeting 20 July 2006 Macquarie Bank Limited 2006

Full Year 2015 Financial Results 18 January 2016 1 Outline 1. Key Highlights

CITY OF PATERSON, NJ AND GREAT FALLS HYDROELECTRIC COMPANY JOINT AGENCY / STAKEHOLDER MEETING

Diophantine Equations and Modulo 11. Those who were present during the Mental Calculation World

AGENDA ITEM 13 May 22, 2017 May 22, 2017 | ASMI annual meeting of shareholders | 1

PRESENTATI ON PHRASES MAKI NG A START Title/ Subject Id like to talk (to you) today about

Robustness in Sum-Product Networks with Continuous and Categorical - PowerPoint PPT Presentation

Robustness in Sum-Product Networks with Continuous and Categorical Data ISIPTA 2019 - Ghent, Belgium R. C. de Wit 1 Cassio P. de Campos 1 D. Conaty 2 J. Martnez del Rincon 2 1 Department of Information and Computing Sciences, Utrecht University,

Product Section Product Section New Product Introduction New Product Introduction Product

ex Addition: 1-bit half adder A + Sum B Carry out Carry A B Sum out 0 0 A 0 1 Sum

UCSD Robustness Summer School David Donoho 20190812 David Donoho UCSD Robustness Summer School

Robustness? Robustness ? Robustness?

Sum Product Networks What is a Sum Product Network? 1. It is a tractable probabilistic model

Continuous dialogue from product Continuous dialogue from product development throughout the

Robustness and Generalization Huan Xu The University of Texas at Austin Department of Electrical

Where Are We? Lecture 9 Robustness through Training 1 Robustness Explicit Handling of Noise

Continuous dialogue from product development Continuous dialogue from product development

Chapter 6 Methods 1 Opening Problem Find the sum of integers from 1 to 10, from 20 to 30, and

Basic Ruby Syntax sum = 0 Newline is statement separator i = 1 while i &lt;= 10 do sum += i*i

ex start small with a 1-bit (half) adder A B Carry out Sum A 0 0 Sum 0 1 B 1 0 1 1

Sum-Product Networks CS486 / 686 University of Waterloo Lecture 23: July 19, 2017 Outline

Trade-off between Efficiency and Robustness Doctoral Colloqium @ SenSys18, Shenzhen Robert

The Dot Product and Orthogonal Vectors The Dot Product Defn. The dot product (or inner product )

Algorithms in Nature Network robustness Slides adapted from Carl Kingsford Network robustness

The Role of Siberian River Discharge in Arctic Freshwater Balance (

Page 1 of 1

Macquarie Bank Limited 2006 Annual General Meeting 20 July 2006 Macquarie Bank Limited 2006

Full Year 2015 Financial Results 18 January 2016 1 Outline 1. Key Highlights

CITY OF PATERSON, NJ AND GREAT FALLS HYDROELECTRIC COMPANY JOINT AGENCY / STAKEHOLDER MEETING

Diophantine Equations and Modulo 11. Those who were present during the Mental Calculation World

AGENDA ITEM 13 May 22, 2017 May 22, 2017 | ASMI annual meeting of shareholders | 1

PRESENTATI ON PHRASES MAKI NG A START Title/ Subject Id like to talk (to you) today about

Basic Ruby Syntax sum = 0 Newline is statement separator i = 1 while i <= 10 do sum += i*i