MELODI M achin E L earning, O ptimization, & D ata I - PowerPoint PPT Presentation

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary The Lov´ asz-Bregman Divergence and Connections to Rank Aggregation, Clustering, and Web Ranking Rishabh Iyer Jeff Bilmes University of Washington, Seattle UAI-2013 MELODI M achin E L earning, O ptimization, & D ata I nterpretation @ UW Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 1 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Outline Ranking and Machine Learning 1 The Lov´ asz-Bregman divergences 2 Properties of the Lov´ asz-Bregman 3 Applications 4 Summary 5 Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 2 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Occur in a number of Machine Learning applications: Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 3 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Occur in a number of Machine Learning applications: Combining Classifiers (Lebanon & Lafferty, 2002) Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 3 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Occur in a number of Machine Learning applications: 1) Munich 1) Seattle 1) Munich 2) Paris 2) Munich 2) Seattle 3) London 3) London 3) London 4) Seattle 4) Atlanta 4) Paris 5) Atlanta 5) Paris 5) Atlanta Aggregating Preferences Combining Classifiers (Murphy & Martin, (Lebanon & Lafferty, 2002) 2003) Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 3 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Occur in a number of Machine Learning applications: 1) Munich 1) Seattle 1) Munich 2) Paris 2) Munich 2) Seattle 3) London 3) London 3) London 4) Seattle 4) Atlanta 4) Paris 5) Atlanta 5) Paris 5) Atlanta Aggregating Preferences Combining Classifiers (Murphy & Martin, (Lebanon & Lafferty, 2002) Web Ranking (Liu, 2009) 2003) Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 3 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Denote σ as a permutation of { 1 , 2 , · · · , n } such that σ ( i ) denotes the item at rank i and σ − 1 ( i ) as the rank of item i . Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 4 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Denote σ as a permutation of { 1 , 2 , · · · , n } such that σ ( i ) denotes the item at rank i and σ − 1 ( i ) as the rank of item i . Denote { σ 1 , σ 2 , . . . , σ k } as a set of k permutations. Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 4 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Denote σ as a permutation of { 1 , 2 , · · · , n } such that σ ( i ) denotes the item at rank i and σ − 1 ( i ) as the rank of item i . Denote { σ 1 , σ 2 , . . . , σ k } as a set of k permutations. Some important problems concerning rankings: Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 4 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Denote σ as a permutation of { 1 , 2 , · · · , n } such that σ ( i ) denotes the item at rank i and σ − 1 ( i ) as the rank of item i . Denote { σ 1 , σ 2 , . . . , σ k } as a set of k permutations. Some important problems concerning rankings: Combining Permutations: Given permutations σ 1 , σ 2 , · · · , σ k , find 1 a representative σ , which is “close“ to σ 1 , σ 2 , · · · , σ k . Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 4 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Denote σ as a permutation of { 1 , 2 , · · · , n } such that σ ( i ) denotes the item at rank i and σ − 1 ( i ) as the rank of item i . Denote { σ 1 , σ 2 , . . . , σ k } as a set of k permutations. Some important problems concerning rankings: Combining Permutations: Given permutations σ 1 , σ 2 , · · · , σ k , find 1 a representative σ , which is “close“ to σ 1 , σ 2 , · · · , σ k . Combining Scores: Given a set of score vectors x 1 , x 2 , · · · , x k , find 2 a representative σ , which is “close“ to x 1 , x 2 , · · · , x k . Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 4 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Combining Scores and Rankings Denote σ as a permutation of { 1 , 2 , · · · , n } such that σ ( i ) denotes the item at rank i and σ − 1 ( i ) as the rank of item i . Denote { σ 1 , σ 2 , . . . , σ k } as a set of k permutations. Some important problems concerning rankings: Combining Permutations: Given permutations σ 1 , σ 2 , · · · , σ k , find 1 a representative σ , which is “close“ to σ 1 , σ 2 , · · · , σ k . Combining Scores: Given a set of score vectors x 1 , x 2 , · · · , x k , find 2 a representative σ , which is “close“ to x 1 , x 2 , · · · , x k . Clustering: Cluster the set of permutations σ 1 , σ 2 , · · · , σ k (or 3 equivalently score vectors x 1 , x 2 , · · · , x k ). Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 4 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Rank aggregation Combine a set of rankings σ 1 , σ 2 , · · · , σ k . Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 5 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Rank aggregation Combine a set of rankings σ 1 , σ 2 , · · · , σ k . Rank Aggregation . . . Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 5 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Rank aggregation Combine a set of rankings σ 1 , σ 2 , · · · , σ k . Rank Aggregation . . . Often done using permutation based distance metrics. Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 5 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Permutation based Distance Metrics d ( σ, π ) Metric on the space of permutations. Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 6 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Permutation based Distance Metrics d ( σ, π ) Metric on the space of permutations. Kendall τ , � I ( σ − 1 π ( i ) > σ − 1 π ( j )) d T ( σ, π ) = i , j , i < j and Spearman’s footrule: n � | σ − 1 ( i ) − π − 1 ( i ) | d S ( σ, π ) = i =1 Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 6 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Permutation based Distance Metrics d ( σ, π ) Metric on the space of permutations. Kendall τ , � I ( σ − 1 π ( i ) > σ − 1 π ( j )) d T ( σ, π ) = i , j , i < j and Spearman’s footrule: n � | σ − 1 ( i ) − π − 1 ( i ) | d S ( σ, π ) = i =1 Invariance with respect to re-orderings – i.e d ( πσ, πτ ) = d ( σ, τ ). Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 6 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Permutation based Distance Metrics d ( σ, π ) Metric on the space of permutations. Kendall τ , � I ( σ − 1 π ( i ) > σ − 1 π ( j )) d T ( σ, π ) = i , j , i < j and Spearman’s footrule: n � | σ − 1 ( i ) − π − 1 ( i ) | d S ( σ, π ) = i =1 Invariance with respect to re-orderings – i.e d ( πσ, πτ ) = d ( σ, τ ). Given a set of permutations σ 1 , σ 2 , · · · , σ k , find a permutation σ : k � σ = argmin d ( σ i , π ) (1) π i =1 Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 6 / 24

Ranking and Machine Learning The Lov´ asz-Bregman divergences Properties of the Lov´ asz-Bregman Applications Summary Score Aggregation What if one has scores instead of just the orderings? For example, Iyer & Bilmes, 2013 Lov´ asz Bregman Divergences page 7 / 24

MELODI M achin E L earning, O ptimization, & D ata I - PowerPoint PPT Presentation

Ranking and Machine Learning The Lov asz-Bregman divergences Properties of the Lov asz-Bregman Applications Summary The Lov asz-Bregman Divergence and Connections to Rank Aggregation, Clustering, and Web Ranking Rishabh Iyer Jeff

Article 31 RIHSS Report on Individual Radiosensitivity Dr Patrick Smeesters EC Art31, RP Advisor

28 AUGUST 2018 PRESENTED BY ABIAH MELODI WHAT IS THE OFO The OFO is a skills-based coded

UNSCEAR White Paper: Biological Mechanisms of Radiation Actions at Low Doses Simon Bouffler 8

MELODI M achin E L earning, O ptimization, & D ata I nterpretation @ UW Iyer & Bilmes,

Roger Harrison Chair, EURADOS Working Group 9: Radiation Dosimetry in Radiotherapy Tuesday, 10

Radiological Risk from Low Dose and Low Dose-Rate Exposures: An Epidemiologic Perspective

Deep Canonical Correlation Analysis Galen Andrew 1 Raman Arora 2 Jeff Bilmes 1 Karen Livescu 2 1

Distributional Semantics The unsupervised modeling of meaning on a large scale Tim Van de Cruys

The Vocal Joystick: Voice-based Continuous Control of Electro-mechanical Devices Jeff Bilmes

Preferences in college applications A non-parametric Bayesian analysis of top-10 rankings Alnur

06.03.2015 20:33:15 These could be pictures of another planet or the set of a science fiction

EECS 4314 Advanced Software Engineering Topic 05: Design Pattern Review Zhen Ming (Jack) Jiang

Transportation in the Future November 23, 2012 UDLS, November 23, 2012 Future Transportation

Strong Consistency of the AIC, BIC, C p and KOO Methods in High-Dimensional-Response Regression

Day 5: Model Selection I Lucas Leemann Essex Summer School Introduction to Statistical Learning

The Problem of Overfitting The Problem of Overfitting BR data: neural network with 20%

Unit 7: Multiple linear regression 1. Introduction to multiple linear regression Sta 101 - Fall

Selection for Feature-Based Image Registration F. Brunet 1,2 , A. Bartoli 1 , N. Navab 2 , and R.

Variable selection STAT 401 - Statistical Methods for Research Workers Jarad Niemi Iowa State

Aspects of Group Theory in Stochastic Problems Dr. Marconi Barbosa NICTA/ANU, Canberra, Australia

Devavrat Shah Laboratory for Information and Decision Systems

4. Model evaluatjon & selectjon Chlo-Agathe Azencot Centre for Computatjonal Biology, Mines

STAT 213 Model Selection II Colin Reimer Dawson Oberlin College March 30, 2018 1 / 13 Outline

STAT 213 ANOVA as Multiple Regression Colin Reimer Dawson Oberlin College 5 April 2016 Outline

Sambuz

Useful Links

Newsletter

Mail Us

MELODI M achin E L earning, O ptimization, & D ata I - PowerPoint PPT Presentation

Ranking and Machine Learning The Lov asz-Bregman divergences Properties of the Lov asz-Bregman Applications Summary The Lov asz-Bregman Divergence and Connections to Rank Aggregation, Clustering, and Web Ranking Rishabh Iyer Jeff

Article 31 RIHSS Report on Individual Radiosensitivity Dr Patrick Smeesters EC Art31, RP Advisor

28 AUGUST 2018 PRESENTED BY ABIAH MELODI WHAT IS THE OFO The OFO is a skills-based coded

UNSCEAR White Paper: Biological Mechanisms of Radiation Actions at Low Doses Simon Bouffler 8

MELODI M achin E L earning, O ptimization, &amp; D ata I nterpretation @ UW Iyer &amp; Bilmes,

Roger Harrison Chair, EURADOS Working Group 9: Radiation Dosimetry in Radiotherapy Tuesday, 10

Radiological Risk from Low Dose and Low Dose-Rate Exposures: An Epidemiologic Perspective

Deep Canonical Correlation Analysis Galen Andrew 1 Raman Arora 2 Jeff Bilmes 1 Karen Livescu 2 1

Distributional Semantics The unsupervised modeling of meaning on a large scale Tim Van de Cruys

The Vocal Joystick: Voice-based Continuous Control of Electro-mechanical Devices Jeff Bilmes

Preferences in college applications A non-parametric Bayesian analysis of top-10 rankings Alnur

06.03.2015 20:33:15 These could be pictures of another planet or the set of a science fiction

EECS 4314 Advanced Software Engineering Topic 05: Design Pattern Review Zhen Ming (Jack) Jiang

Transportation in the Future November 23, 2012 UDLS, November 23, 2012 Future Transportation

Strong Consistency of the AIC, BIC, C p and KOO Methods in High-Dimensional-Response Regression

Day 5: Model Selection I Lucas Leemann Essex Summer School Introduction to Statistical Learning

The Problem of Overfitting The Problem of Overfitting BR data: neural network with 20%

Unit 7: Multiple linear regression 1. Introduction to multiple linear regression Sta 101 - Fall

Selection for Feature-Based Image Registration F. Brunet 1,2 , A. Bartoli 1 , N. Navab 2 , and R.

Variable selection STAT 401 - Statistical Methods for Research Workers Jarad Niemi Iowa State

Aspects of Group Theory in Stochastic Problems Dr. Marconi Barbosa NICTA/ANU, Canberra, Australia

Devavrat Shah Laboratory for Information and Decision Systems

4. Model evaluatjon &amp; selectjon Chlo-Agathe Azencot Centre for Computatjonal Biology, Mines

STAT 213 Model Selection II Colin Reimer Dawson Oberlin College March 30, 2018 1 / 13 Outline

STAT 213 ANOVA as Multiple Regression Colin Reimer Dawson Oberlin College 5 April 2016 Outline

Sambuz

Useful Links

Newsletter

Mail Us

MELODI M achin E L earning, O ptimization, & D ata I nterpretation @ UW Iyer & Bilmes,

4. Model evaluatjon & selectjon Chlo-Agathe Azencot Centre for Computatjonal Biology, Mines