Active Learning for Probabilistic Structured Prediction of Cuts and - PowerPoint PPT Presentation

University of Illinois at Chicago Active Learning for Probabilistic Structured Prediction of Cuts and Matchings Sima Behpour , University of Pennsylvania Anqi Liu, California Institute of Technology Brian D. Ziebart, University of Illinois at Chicago

Motivation Sea 0 a) Multi-label Classification [Behpour et al. 2018] Ship 0 Sheep 0 Wolf 0 Mountain 1 Person 1 Dog 1 Horse 1 Tree 1 b) Video Tracking 2

Motivation Sea 0 a) Multi-label Classification [Behpour et al. 2018] Ship 0 Sheep 0 Wolf 0 Mountain 1 Labeling can be Person 1 • Time consuming, e.g., document classification Dog 1 Horse 1 • Expensive, e.g., medical decision (need doctors) Tree 1 • Sometimes dangerous, e.g., landmine detection b) Video Tracking 3

Motivation Active learning methods, like uncertainty sampling , combined with probabilistic prediction techniques [ Lewis & Gale, 1994; Settles, 2012 ] have been successful. Previous methods: ➢ CRF ➢ Intractable ➢ SSVM ➢ SVM Platts [ Lambrou et al., 2012; Platt, 1999 ] ➔ Unreliable ➢ Complication of Interpretation for multi-class 4

Our approach 1 - Leveraging Adversarial prediction methods [Behpour et al. 2018]: - An Adversarial approximation of the training data labels, ෘ 𝑄(ු 𝑧|𝑦) - A predictor, ෠ 𝑄(ො 𝑧|𝑦) , that minimizes the expected loss against the worst-case distribution chosen by the adversary.

Our approach 2 - Computing Mutual Information to measure reduction in uncertainty [Guo and Greiner 2007]. The mutual information of two discrete random variable a and b: ( the amount of the information which is held between a and b) Joint entropy of and Marginal entropy of Marginal entropy of

Game Matrix for Multi- label prediction y = [Sea, Ship, Sheep, Horse, Dog, Person, Mountain, Wolf, Tree] 𝑧 = [0 0 1 0 1 1 0 1 1] 𝑼 ) = 𝟑𝟔 % 𝑧 = [0 0 0 0 0 1 1 1 1] 𝑼 ) = 𝟒𝟑 % 𝑧 = [0 0 0 1 1 0 1 1 1] 𝑼 ) = 𝟓𝟒 % P( ු P( ු P( ු 𝑧 = [0 0 1 0 1 1 0 1 1] 𝑼 𝑧 = [0 0 0 0 0 1 1 1 1] 𝑼 𝑧 = [0 0 0 1 1 0 1 1 1] 𝑼 ු ු ු L ( [0 1 0 1 0 1 1 0 1] 𝑈 , [0 0 0 0 0 1 1 1 1] 𝑼 ) L ( [0 1 0 1 0 1 1 0 1] 𝑈 , [0 0 1 0 1 1 0 1 1] 𝑼 ) L ( [0 1 0 1 0 1 1 0 1] 𝑈 , [0 0 0 1 1 0 1 1 1] 𝑼 ) [0 1 0 1 0 1 1 0 1] 𝑈 + 𝝌 ( [0 0 0 0 0 1 1 1 1] 𝑼 ) + 𝝌 ( [0 0 1 0 1 1 0 1 1] 𝑼 ) + 𝝌 ( [0 0 0 1 1 0 1 1 1] 𝑼 ) L ( [0 1 0 1 0 0 0 1 1] 𝑈 , [0 0 0 1 1 0 1 1 1] 𝑼 ) L ( [0 1 0 1 0 0 0 1 1] 𝑈 , [0 0 0 0 0 1 1 1 1] 𝑼 ) L ( [0 1 0 1 0 0 0 1 1] 𝑈 , [0 0 1 0 1 1 0 1 1] 𝑼 ) [0 1 0 1 0 0 0 1 1] 𝑈 + 𝝌 ( [0 0 0 1 1 0 1 1 1] 𝑼 ) + 𝝌 ( [0 0 0 0 0 1 1 1 1] 𝑼 ) + 𝝌 ( [0 0 1 0 1 1 0 1 1] 𝑼 ) L ( [1 1 1 0 0 1 1 0 1] 𝑈 , [0 0 0 0 0 1 1 1 1] 𝑼 ) L ( [1 1 1 0 0 1 1 0 1] 𝑈 , [0 0 0 1 1 0 1 1 1] 𝑼 ) L ( [1 1 1 0 0 1 1 0 1] 𝑈 , [0 0 1 0 1 1 0 1 1] 𝑼 ) [1 1 1 0 0 1 1 0 1] 𝑈 + 𝝌 ( [0 0 0 0 0 1 1 1 1] 𝑼 ) + 𝝌 ( [0 0 0 1 1 0 1 1 1] 𝑼 ) + 𝝌 ( [0 0 1 0 1 1 0 1 1] 𝑼 )

Sample selection strategy The total expected reduction in uncertainty over all variables, 𝑍 1 , . . . , 𝑍 𝑜 , from Observing a particular variable 𝑍 𝑘 Marginal entropy

Active Learning for Cuts Train a model Test the model Analyze unlabeled ∅ 𝑗 , ∅ 𝑗,𝑘 data pool Unlabeled data pool Labeled data pool Return the sample Add/ update the sample if there is any unannotated label. Solicit the sample with Y=[? 1 ? ? ? ? ? ? ?] Y=[? 1 ? ? ? ? ? ? ?] the highest 𝑊 𝑘

Multi-label Experiments a) Bibtex b) Bookmarks c) CAL500 d) Corel5K e) Enron f) NUS-WIDE g) TMC2007 h) Yeast

Tracking Experiments a) ETH-BAHNHOF b) TUD-CAMPUS c) TUD-STADTMITTE d) ETH-SUN e) BAHNHOF-PEDCROSS2 f) CAMPUS-STAD g) SUN-PEDCROSS2 h) BAHNHOF-SUN

Conclusion Leveraging Adversarial Structured Predictions ➢ Adversarial Robust Cut ➢ Adversarial Bipartite Matching Adversary probability distribution correlations between unknown label variables Useful in estimating the value of information for different annotation solicitation decisions. Better performance and lower computational complexity

Thank You! Please visit our poster at Pacific Ballroom #264

Active Learning for Probabilistic Structured Prediction of Cuts and - PowerPoint PPT Presentation

University of Illinois at Chicago Active Learning for Probabilistic Structured Prediction of Cuts and Matchings Sima Behpour , University of Pennsylvania Anqi Liu, California Institute of Technology Brian D. Ziebart, University of Illinois at

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

CSCE 496/896 Lecture 11: Structured Prediction and Structured Prediction and Probabilistic

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Prediction and Odds 18.05 Spring 2017 Probabilistic Prediction Also called probabilistic

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

L101: Introduction to Structured Prediction Ryan Cotterell What is structured prediction?

Training Strategies CS 6355: Structured Prediction 1 So far we saw What is structured output

Prediction and Odds 18.05 Spring 2014 January 1, 2017 1 / 20 Probabilistic Prediction Also

Prediction and Odds 18.05 Spring 2014 January 1, 2017 1 / 26 Probabilistic Prediction Also

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

Structured Prediction Final words CS 6355: Structured Prediction 1 A look back What is a

Course Information CS 6355: Structured Prediction Building up structured output prediction

L101: Incremental structured prediction Structured prediction reminder Given an input x (e.g. a

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Complex Prediction Problems A novel approach to multiple Structured Output Prediction Yasemin

Pangaea Energy Flow Types of Plate Interactions and the Results Natural Disasters

1. Mid-Ocean Ridges 1. Mid-Ocean Ridges How Do We Know? We know about Mid-Ocean Ridges

Towards a distributed Earth Science Data Infrastructure ISGC 2011 & OGF 31 INFRA-2010-1.2.3

(x,y), W(y)=(y1/2) 2 1/4, =20, =0.01 1 0.9 0.8 0.7 0.6 0.5 y 0.4 0.3 0.2

Interplay between the Indian Ocean, ENSO and the Monsoon in a warming environment Roxy Roxy Mat

WORSHIP BY SINGING EXODUS 15:1-21 @LibertyStudents #MyLSMhpt WHAT ABOUT YOU? 1. Some people

Model simulations of atmospheric methane and their evaluation using AGAGE/NOAA surface- and

Single-Use-Seals Peter Todd July 3rd 2018 Building on Bitcoin Peter Todd 37EC 7D7B 0A21 7CDB

Active Learning for Probabilistic Structured Prediction of Cuts and - PowerPoint PPT Presentation

University of Illinois at Chicago Active Learning for Probabilistic Structured Prediction of Cuts and Matchings Sima Behpour , University of Pennsylvania Anqi Liu, California Institute of Technology Brian D. Ziebart, University of Illinois at

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

CSCE 496/896 Lecture 11: Structured Prediction and Structured Prediction and Probabilistic

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Prediction and Odds 18.05 Spring 2017 Probabilistic Prediction Also called probabilistic

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

L101: Introduction to Structured Prediction Ryan Cotterell What is structured prediction?

Training Strategies CS 6355: Structured Prediction 1 So far we saw What is structured output

Prediction and Odds 18.05 Spring 2014 January 1, 2017 1 / 20 Probabilistic Prediction Also

Prediction and Odds 18.05 Spring 2014 January 1, 2017 1 / 26 Probabilistic Prediction Also

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

Structured Prediction Final words CS 6355: Structured Prediction 1 A look back What is a

Course Information CS 6355: Structured Prediction Building up structured output prediction

L101: Incremental structured prediction Structured prediction reminder Given an input x (e.g. a

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Complex Prediction Problems A novel approach to multiple Structured Output Prediction Yasemin

Pangaea Energy Flow Types of Plate Interactions and the Results Natural Disasters

1. Mid-Ocean Ridges 1. Mid-Ocean Ridges How Do We Know? We know about Mid-Ocean Ridges

Towards a distributed Earth Science Data Infrastructure ISGC 2011 &amp; OGF 31 INFRA-2010-1.2.3

(x,y), W(y)=(y1/2) 2 1/4, =20, =0.01 1 0.9 0.8 0.7 0.6 0.5 y 0.4 0.3 0.2

Interplay between the Indian Ocean, ENSO and the Monsoon in a warming environment Roxy Roxy Mat

WORSHIP BY SINGING EXODUS 15:1-21 @LibertyStudents #MyLSMhpt WHAT ABOUT YOU? 1. Some people

Model simulations of atmospheric methane and their evaluation using AGAGE/NOAA surface- and

Single-Use-Seals Peter Todd July 3rd 2018 Building on Bitcoin Peter Todd 37EC 7D7B 0A21 7CDB

Towards a distributed Earth Science Data Infrastructure ISGC 2011 & OGF 31 INFRA-2010-1.2.3