Adversarial Bipartite Matching Rizal Fathony* ,# , Sima Behpour*, - PowerPoint PPT Presentation

Efficient and Consistent Adversarial Bipartite Matching Rizal Fathony* ,# , Sima Behpour*, Xinhua Zhang, Brian D. Ziebart *) equal contribution # ) presenter 1

Bipartite Matching Tasks 1 1 2 2 𝜌 = [4, 3, 1, 2] 3 3 4 4 B A Maximum weighted bipartite matching: Machine learning task: Learn the appropriate weights 𝜔 𝑗 (⋅) 2

Learning Bipartite Matching | Applications Word alignment (Taskar et. al., 2005; Pado & Lapta, 2006; Mac-Cartney et. al., 2008) 1 natürlich ist das haus klein of course the house is small Correspondence between images (Belongie et. al., 2002; Dellaert et. al., 2003) 2 Learning to rank documents (Dwork et. al., 2001; Le & Smola, 2007) 3 1 2 3 4 3

Desiderata for a Predictor Learning objective: seek pairwise potentials that most compatible with training data Challenge: loss functions (e.g. Hamming loss): non-continuous & non convex Desiderata for predictor: Efficiency 1 runtime: (low degree) polynomial time Consistency 2 must also minimize Hamming loss under ideal condition (given the true distribution and fully expressive model parameters) 4

Exponential Family Random Field Approach (Petterson et. al., 2009; Volkovs & Zemel, 2012) Probabilistic model: #P-hard (Valiant, 1979) Consistent? produce Bayes optimal prediction in an ideal condition Efficient ? normalization term 𝑎 𝜔 involves matrix permanent computation impractical even for modestly-size 𝑜 = 20 5

Maximum Margin Approach (Tsochantaridis et. al., 2005) Max-margin model: Efficient? polynomial algorithm for computing maximum violated constraint: (Hungarian algorithm) Consistent ? - based on Crammer & Singer multiclass SVM formulation - is not consistent for distribution with no majority label (Liu, 2007) 6

Adversarial Bipartite Matching (our approach) Seek a predictor that robustly minimize Hamming loss against the worst-case permutation mixture probability Predictor: - makes a probabilistic prediction ෠ 𝑄(ො 𝜌|𝑦) - aims to minimize the loss - is pitted with an adversary instead of the empirical distribution Adversary: - makes a probabilistic prediction ෘ 𝑄(ු 𝜌|𝑦) - aims to maximize the loss - constrained to select probability that match the statistics of empirical distribution ( ෨ 𝑄 ) 𝑜 via moment matching on the features 𝜚 𝑦, 𝜌 = σ 𝑗=1 𝜚 𝑗 (𝑦, 𝜌 𝑗 ) 7

Adversarial Bipartite Matching | Dual Lagrangian Dual Formulation of the Adversarial Bipartite Matching term (methods of Lagrange multipliers, Von Neumann & Sion minimax duality) where 𝜄 is the dual variable for moment matching constraints Hamming loss Augmented Hamming loss matrix for 𝑜 = 3 permutations size: 𝑜! × 𝑜! Intractable for modestly-sized 𝑜 8

Efficient Algorithms 1 Double Oracle Method (Constraint Generations) Marginal Distribution Formulation 2 9

Double Oracle Method Based on the observation: equilibrium is usually supported by small number of permutations Iterative procedure: 𝜌 =123 𝜌 =213 𝜌 =312 ු ු ු 𝜌 =123 0+ 𝜀 123 2+ 𝜀 213 3+ 𝜀 213 Adversary’s best response: ො 𝜌 =213 ු Adversary’s best response: 𝜌 =312 ු 𝜌 =213 2+ 𝜀 123 0+ 𝜀 213 2+ 𝜀 213 ො 𝜌 =312 3+ 𝜀 123 2+ 𝜀 213 0+ 𝜀 213 ො Predictor’s best response: 𝜌 =213 ො - no formal polynomial bound is known Predictor’s best response: 𝜌 =312 ො - runtime: cannot be characterized as polynomial 10

Marginal Distribution Formulation Marginal Distribution Matrices: Predictor Adversary 𝐐 = 𝐑 = 𝑞 𝑗,𝑘 = ෠ 𝑟 𝑗,𝑘 = ෘ 𝑄(ො 𝜌 𝑗 = 𝑘) 𝑄 ( ෕ 𝜌 𝑗 = 𝑘) Birkhoff – Von Neumann theorem: 123 321 convex polytope whose points are doubly stochastic matrix 312 132 reduce the space of optimization: from 𝑃(𝑜!) to 𝑃(𝑜 2 ) 231 213 11

Marginal Formulation | Optimization Optimization : add regularization and smoothing penalty Techniques: - Outer (Q) : * projected Quasi-Newton (Schmidt, et.al., 2009) * projection to doubly-stochastic matrix - Inner ( 𝜄 ) : closed-form solution - Inner (P) : projection to doubly-stochastic matrix - Projection to doubly-stochastic matrix : ADMM 12

Consistency Empirical Risk Perspective of Adversarial Bipartite Matching Consistency: our method also minimize the Hamming loss in ideal case. arg-max of 𝑔 is in the Bayes optimal responses 13

Experiment Setup Application: Video Tracking Empirical runtime (until convergence) Adv. Marginal Form.: grows (roughly) quadratically in 𝑜 1.0 1.0 1.3 1.2 1.5 1.4 CRF: impractical 2.5 4.2 2.8 5.0 even for 𝑜 = 20 (Petterson et. al., 2009) relative: 12=1.0 relative: 1.96=1.0 14

Experiment Results 6 pairs of dataset significantly outperforms SSVM 2 pairs of dataset competitive with SSVM Adv. Double Oracle: small number of permutations 15

Conclusions Perform well ? Efficient? Consistent? ? Exponential Family Random Field (Petterson et. al., 2009; Volkovs & Zemel, 2012) ?? Maximum Margin (Tsochantaridis et. al., 2005) Adversarial Bipartite Matching (our approach) 16

THANK YOU 17

Adversarial Bipartite Matching Rizal Fathony* ,# , Sima Behpour*, - PowerPoint PPT Presentation

Efficient and Consistent Adversarial Bipartite Matching Rizal Fathony* ,# , Sima Behpour, Xinhua Zhang, Brian D. Ziebart ) equal contribution # ) presenter 1 Bipartite Matching Tasks 1 1 2 2 = [4, 3, 1, 2] 3 3 4 4 B A

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

The Bipartite Matching Problem Math 482, Lecture 21 Misha Lavrov March 25, 2020 Bipartite graph

Weighted Bipartite Matching CS31005: Algorithms-II Autumn 2020 IIT Kharagpur Matching A

The Bipartite Matching Problem II Math 482, Lecture 22 Misha Lavrov March 27, 2020 Bipartite

Matching Bipartite Matching Input Given a (undirected) graph G = ( V , E ) Input Given a bipartite

Semi-Online Bipartite Matching Zoya Svitkina with Ravi Kumar, Manish Purohit, Aaron Schild, Erik

X u a u a a matching. 3 3 3 0 0 1 dashed means matched. Algorithm No augmenting path

Applications of Network Flow T. M. Murali April 7, 9, 2009 Introduction Bipartite Matching

Today Rules for School... Maximum Weight Matching. or...Rules for taking duals Bipartite

Sources for this lecture 3. Matching in bipartite and general graphs The material for this

Online Matching Bipartite Matching Adwords Balance Algorithm Anil Maheshwari

Lecture 17: Max Flow Bipartite Matching Tim LaRock larock.t@northeastern.edu

Week 10 - Friday What did we talk about last time? Shortest paths Matching bipartite

CS 401 Max Flow / Bipartite Matching Xiaorui Sun 1 Flow network Flow network. G = (V, E) =

GPU accelerated maximum cardinality matching algorithms for bipartite graphs Bora U car CNRS

CMSC 451: Maximum Bipartite Matching Slides By: Carl Kingsford Department of Computer Science

CPSC 490: Problem Solving in Computer Science A bipartite graph is: and Y . A graph with no

Mini Max-Flow Unit Part 2: Flow with Demands, Cool Reductions, Bipartite Matching, and Konigs

Combinatorial Ski Rental and Online Bipartite Matching Hanrui Zhang Vincent Conitzer

Lexical sense alignment using weighted bipartite b -matching Sina Ahmadi (ULD) Supervisors: John

Polynomial Identity Testing Lemma Bipartite Matching References Anil Maheshwari

Stable Matching CS31005: Algorithms-II Autumn 2020 IIT Kharagpur Stable Matching A type of

1 Matching in General Graphs For the most part, weve discussed matching restricted to

8.1 Matching in General Graphs For the most part, weve discussed matching restricted to

Adversarial Bipartite Matching Rizal Fathony* ,# , Sima Behpour*, - PowerPoint PPT Presentation

Efficient and Consistent Adversarial Bipartite Matching Rizal Fathony* ,# , Sima Behpour*, Xinhua Zhang, Brian D. Ziebart *) equal contribution # ) presenter 1 Bipartite Matching Tasks 1 1 2 2 = [4, 3, 1, 2] 3 3 4 4 B A

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

The Bipartite Matching Problem Math 482, Lecture 21 Misha Lavrov March 25, 2020 Bipartite graph

Weighted Bipartite Matching CS31005: Algorithms-II Autumn 2020 IIT Kharagpur Matching A

The Bipartite Matching Problem II Math 482, Lecture 22 Misha Lavrov March 27, 2020 Bipartite

Matching Bipartite Matching Input Given a (undirected) graph G = ( V , E ) Input Given a bipartite

Semi-Online Bipartite Matching Zoya Svitkina with Ravi Kumar, Manish Purohit, Aaron Schild, Erik

X u a u a a matching. 3 3 3 0 0 1 dashed means matched. Algorithm No augmenting path

Applications of Network Flow T. M. Murali April 7, 9, 2009 Introduction Bipartite Matching

Today Rules for School... Maximum Weight Matching. or...Rules for taking duals Bipartite

Sources for this lecture 3. Matching in bipartite and general graphs The material for this

Online Matching Bipartite Matching Adwords Balance Algorithm Anil Maheshwari

Lecture 17: Max Flow Bipartite Matching Tim LaRock larock.t@northeastern.edu

Week 10 - Friday What did we talk about last time? Shortest paths Matching bipartite

CS 401 Max Flow / Bipartite Matching Xiaorui Sun 1 Flow network Flow network. G = (V, E) =

GPU accelerated maximum cardinality matching algorithms for bipartite graphs Bora U car CNRS

CMSC 451: Maximum Bipartite Matching Slides By: Carl Kingsford Department of Computer Science

CPSC 490: Problem Solving in Computer Science A bipartite graph is: and Y . A graph with no

Mini Max-Flow Unit Part 2: Flow with Demands, Cool Reductions, Bipartite Matching, and Konigs

Combinatorial Ski Rental and Online Bipartite Matching Hanrui Zhang Vincent Conitzer

Lexical sense alignment using weighted bipartite b -matching Sina Ahmadi (ULD) Supervisors: John

Polynomial Identity Testing Lemma Bipartite Matching References Anil Maheshwari

Stable Matching CS31005: Algorithms-II Autumn 2020 IIT Kharagpur Stable Matching A type of

1 Matching in General Graphs For the most part, weve discussed matching restricted to

8.1 Matching in General Graphs For the most part, weve discussed matching restricted to

Efficient and Consistent Adversarial Bipartite Matching Rizal Fathony* ,# , Sima Behpour, Xinhua Zhang, Brian D. Ziebart ) equal contribution # ) presenter 1 Bipartite Matching Tasks 1 1 2 2 = [4, 3, 1, 2] 3 3 4 4 B A