Minimum Bayes Risk SFPLODD September 24, 2013 Some - PowerPoint PPT Presentation

Minimum ¡Bayes ¡Risk ¡ SFPLODD ¡ September ¡24, ¡2013 ¡

Some ¡Things ¡You ¡Know ¡ • How ¡to ¡decode ¡by ¡finding ¡the ¡single ¡best ¡ global ¡structure ¡ – Lots ¡of ¡ways ¡to ¡think ¡about ¡the ¡algorithms ¡ • How ¡to ¡find ¡posterior ¡marginals ¡for ¡ “parts” ¡(a.k.a. ¡“cliques”), ¡if ¡we ¡interpret ¡ scoring ¡probabilisQcally ¡

A ¡Different ¡View ¡of ¡Decoding ¡ • Cost ¡(someQmes ¡called ¡“loss”): ¡ ¡a ¡funcQon ¡that ¡ tells ¡how ¡bad ¡every ¡guess ¡y ¡is, ¡given ¡every ¡correct ¡ answer ¡y*: ¡ cost ¡: ¡Val(Y) ¡× ¡Val(Y) ¡→ ¡[0, ¡∞) ¡ • Risk : ¡ ¡pretend ¡Y* ¡is ¡random ¡and ¡distributed ¡ according ¡to ¡your ¡model ¡distribuQon; ¡risk ¡is ¡the ¡ expectaQon ¡of ¡cost, ¡for ¡a ¡given ¡y: ¡ risk: ¡Val(Y) ¡→ ¡[0, ¡∞) ¡ • MBR ¡decoding : ¡ ¡pick ¡the ¡y ¡that ¡minimizes ¡risk. ¡ p ( y ∗ | x ) × cost( y , y ∗ ) X arg min y y ∗ ∈ Y

DerivaQon ¡ X y E p ( x , Y ∗ ) [cost( y , Y ∗ )] = min p ( x , y ∗ ) × cost( y , y ∗ ) min y y ∗ ∈ Y p ( x ) × p ( y ∗ | x ) × cost( y , y ∗ ) X = min y y ∗ ∈ Y p ( y ∗ | x ) × cost( y , y ∗ ) X = p ( x ) × min y y ∗ ∈ Y

Example ¡1: ¡ ¡Posterior ¡Decoding ¡ • model: ¡ ¡sequence ¡labeling ¡with ¡bigram ¡label ¡factors ¡ • cost(y, ¡y*): ¡ ¡number ¡of ¡tokens ¡you ¡mislabeled ¡ (someQmes ¡called ¡“Hamming” ¡cost) ¡ • risk(y): ¡ ¡expected ¡number ¡of ¡mislabeled ¡tokens ¡in ¡y ¡ ¡ " n ¡ # n p ( y ∗ | x ) X X X 1 { y i 6 = y ∗ i } = E p ( Y ∗ | x ) 1 { y i 6 = Y ∗ i } ¡ ¡ y ∗ i =1 i =1 n X = E p ( Y ∗ | x ) [ 1 { y i 6 = Y ∗ i } ] i =1 n X � � = 1 � E p ( Y ∗ | x ) [ 1 { y i = Y ∗ i } ] i =1

Example ¡2: ¡ ¡0-‑1 ¡cost ¡ • model: ¡ ¡anything ¡ • cost(y, ¡y*): ¡ ¡0 ¡if ¡y ¡= ¡y*, ¡1 ¡otherwise ¡ • risk(y): ¡ ¡1 ¡– ¡p(y ¡| ¡x) ¡ ¡

Example ¡3: ¡ ¡Maximum ¡Expected ¡Recall ¡ (Goodman, ¡1996) ¡ • model: ¡ ¡PCFG ¡ • cost(y, ¡y*) ¡= ¡number ¡of ¡labeled ¡spans ¡in ¡y* ¡ that ¡are ¡not ¡in ¡y ¡ • risk(y) ¡= ¡sum ¡of ¡ ¡ (1 ¡-‑ ¡posterior ¡probability ¡of ¡a ¡labeled ¡span) ¡

Example ¡4: ¡ ¡WeighQng ¡Different ¡BIO ¡ Errors ¡ • model: ¡ ¡BIO ¡ • cost: ¡ ¡different ¡costs ¡for ¡recall, ¡precision, ¡and ¡ boundary ¡errors: ¡ correct: ¡ B-‑B ¡ B-‑I ¡ B-‑O ¡ I-‑B ¡ I-‑I ¡ I-‑O ¡ O-‑B ¡ O-‑O ¡ B-‑B ¡ split ¡ prec. ¡ split ¡ prec. ¡ prec. ¡ B-‑I ¡ merge ¡ bound. ¡ merge ¡ bound. ¡ bound. ¡ bound. ¡ B-‑O ¡ recall ¡ recall ¡ recall ¡ bound. ¡ recall ¡ I-‑B ¡ split ¡ prec. ¡ split ¡ prec. ¡ prec. ¡ I-‑I ¡ merge ¡ bound. ¡ merge ¡ bound. ¡ bound. ¡ bound. ¡ I-‑O ¡ recall ¡ recall ¡ recall ¡ bound. ¡ recall ¡ O-‑B ¡ prec. ¡ prec. ¡ bound. ¡ prec. ¡ prec. ¡ O-‑O ¡ recall ¡ recall ¡ recall ¡ recall ¡

General ¡MBR ¡Algorithm ¡ Assump4on : ¡ ¡cost ¡factors ¡locally ¡into ¡parts ¡ 1. Calculate ¡posterior ¡distribuQon ¡for ¡each ¡part ¡ (generalized ¡inside ¡algorithm) ¡ 2. If ¡parts ¡don’t ¡overlap, ¡pick ¡local ¡argmax ¡for ¡ each ¡part. ¡ 3. Otherwise, ¡decode ¡with ¡a ¡model ¡that ¡ defines: ¡ ¯ f j, π ( π 0 ) = − localcost( π , π 0 ) w j, π = p (part j = π | x ) ¯

Pop ¡Quiz ¡ Can ¡you ¡think ¡of ¡a ¡cost ¡funcQon ¡such ¡that ¡ minimum ¡Bayes ¡risk ¡decoding ¡ can’t ¡be ¡done ¡in ¡ polynomial ¡Qme? ¡

Minimum Bayes Risk SFPLODD September 24, 2013 Some - PowerPoint PPT Presentation

Minimum Bayes Risk SFPLODD September 24, 2013 Some Things You Know How to decode by finding the single best global structure Lots of ways

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier

Bayes Theorem Thomas Bayes (1701-1761) Simple form of Bayes Theorem, for

Minimum Bayes-Risk Methods in Automatic Speech Recognition Vaibhava Goel IBM William Byrne

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

DATA MINING: NAVE BAYES 1 Nave Bayes Classifier Thomas Bayes 1702 - 1761 We will start off

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

STAT 339 Naive Bayes Classification 8-10 March 2017 Colin Reimer Dawson Outline Naive Bayes

Bayes Classifiers Nave Bayes Classification Patrick Mair Bayes Classifiers Weather data

I ntroduction to Mobile Robotics Bayes Filter Kalm an Filter Wolfram Burgard 1 Bayes

Bayesian Learning Bayes Theorem MAP, ML hypotheses MAP learners Minimum description

Risk Management Workshop 1 Risk management workshop Why do we Risk Risk and need risk

Formal Modeling in Cognitive Science Independence Lecture 23: Conditional Probability; Bayes

Nave Bayes Classification Nickolai Riabov, Kenneth Tiong Brown University Fall 2013 Nickolai

BAYES FORMULA a two-stage experiment Xingru Chen xingru.chen.gr@dartmouth.edu XC 2020

Another Walkthrough of Variational Bayes Bevan Jones ML for NLP Reading Group The University of

Physics 2D Lecture Slides Lecture 13: Feb 2 nd 2004 Vivek Sharma UCSD Physics Quiz 3 14 12

Model-Based Software Engineering and Certification: Some Open Issues Stefano Russo, Fabio

Pediatrician & Chief of Digital Innovation Seattle Mama Doc Seattle Childrens Hospital

City of Oakland Economic Recovery Advisory Council May 18, 2020 June 1, 2020 Monday, June 1,

Recent developments in the area of SoftQCD and Diffractive Physics at the ATLAS Experiment

Asynchronous Timed Session Types & Processes Laura Bocchi University of Kent Maurizio

What about Retrofit Design of Heat Exchanger Networks ? Process, Energy and System Optimal

Lecture 15: Sequential Networks Finite State Machines Moore and Mealy (contd) CSE 140:

Minimum Bayes Risk SFPLODD September 24, 2013 Some - PowerPoint PPT Presentation

Minimum Bayes Risk SFPLODD September 24, 2013 Some Things You Know How to decode by finding the single best global structure Lots of ways

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier

Bayes Theorem Thomas Bayes (1701-1761) Simple form of Bayes Theorem, for

Minimum Bayes-Risk Methods in Automatic Speech Recognition Vaibhava Goel IBM William Byrne

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

DATA MINING: NAVE BAYES 1 Nave Bayes Classifier Thomas Bayes 1702 - 1761 We will start off

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

STAT 339 Naive Bayes Classification 8-10 March 2017 Colin Reimer Dawson Outline Naive Bayes

Bayes Classifiers Nave Bayes Classification Patrick Mair Bayes Classifiers Weather data

I ntroduction to Mobile Robotics Bayes Filter Kalm an Filter Wolfram Burgard 1 Bayes

Bayesian Learning Bayes Theorem MAP, ML hypotheses MAP learners Minimum description

Risk Management Workshop 1 Risk management workshop Why do we Risk Risk and need risk

Formal Modeling in Cognitive Science Independence Lecture 23: Conditional Probability; Bayes

Nave Bayes Classification Nickolai Riabov, Kenneth Tiong Brown University Fall 2013 Nickolai

BAYES FORMULA a two-stage experiment Xingru Chen xingru.chen.gr@dartmouth.edu XC 2020

Another Walkthrough of Variational Bayes Bevan Jones ML for NLP Reading Group The University of

Physics 2D Lecture Slides Lecture 13: Feb 2 nd 2004 Vivek Sharma UCSD Physics Quiz 3 14 12

Model-Based Software Engineering and Certification: Some Open Issues Stefano Russo, Fabio

Pediatrician &amp; Chief of Digital Innovation Seattle Mama Doc Seattle Childrens Hospital

City of Oakland Economic Recovery Advisory Council May 18, 2020 June 1, 2020 Monday, June 1,

Recent developments in the area of SoftQCD and Diffractive Physics at the ATLAS Experiment

Asynchronous Timed Session Types &amp; Processes Laura Bocchi University of Kent Maurizio

What about Retrofit Design of Heat Exchanger Networks ? Process, Energy and System Optimal

Lecture 15: Sequential Networks Finite State Machines Moore and Mealy (contd) CSE 140:

Pediatrician & Chief of Digital Innovation Seattle Mama Doc Seattle Childrens Hospital

Asynchronous Timed Session Types & Processes Laura Bocchi University of Kent Maurizio