Nave Bayes CMSC 470 Marine Carpuat Slides credit: Dan Jurafsky - PowerPoint PPT Presentation

Classification, Linear Models, Naïve Bayes CMSC 470 Marine Carpuat Slides credit: Dan Jurafsky & James Martin, Jacob Eisenstein

Today • Text classification problems • and their evaluation • Linear classifiers • Features & Weights • Bag of words • Naïve Bayes

Classification problems

Multiclass Classification Training Testing training data unlabeled ? document label 1 label 2 label 3 label 4 Feature Functions label 1 ? label 2 ? supervised machine Classifier learning algorithm label 3 ? label 4 ?

Is this spam? From: "Fabian Starr“ <Patrick_Freeman@pamietaniepeerelu.pl> Subject: Hey! Sofware for the funny prices! Get the great discounts on popular software today for PC and Macintosh http://iiled.org/Cj4Lmx 70-90% Discounts from retail price!!! All sofware is instantly available to download - No Need Wait!

What is the subject of this article? MeSH Subject Category Hierarchy MEDLINE Article • Antogonists and Inhibitors • Blood Supply • Chemistry ? • Drug Therapy • Embryology • Epidemiology • …

Text Classification • Assigning subject categories, topics, or genres • Spam detection • Authorship identification • Age/gender identification • Language Identification • Sentiment analysis • …

Text Classification: definition • Input : • a document d • a fixed set of classes Y = { y 1 , y 2 ,…, y J } • Output : a predicted class y  Y

Classification Methods: Supervised Machine Learning • Input • a document d • a fixed set of classes Y = { y 1 , y 2 ,…, y J } • a training set of m hand-labeled documents (d 1 ,y 1 ),....,(d m ,y m ) • Output • a learned classifier d  y

Aside: getting examples for supervised learning • Human annotation • By experts or non-experts (crowdsourcing) • Found data • How do we know how good a classifier is? • Compare classifier predictions with human annotation • On held out test examples • Evaluation metrics: accuracy, precision, recall

The 2-by-2 contingency table correct not correct selected tp fp not selected fn tn

Precision and recall • Precision : % of selected items that are correct Recall : % of correct items that are selected correct not correct selected tp fp not selected fn tn

A combined measure: F • A combined measure that assesses the P/R tradeoff is F measure (weighted harmonic mean): b + PR 2 1 ( 1 ) = = F b + P R 1 1 2 a + - a ( 1 ) P R • People usually use balanced F1 measure i.e., with  = 1 (that is,  = ½): • F = 2 PR /( P + R )

Linear Models for Multiclass Classification

Linear Models for Classification Feature function representation Weights

Defining features: Bag of words

Defining features

Linear Classification

Linear Models for Classification Feature function representation Weights

How can we learn weights? • By hand • Probability • e.g.,Naïve Bayes • Discriminative training • e.g., perceptron, support vector machines

Naïve Bayes Models for Text Classification

Generative Story for Multinomial Naïve Bayes • A hypothetical stochastic process describing how training examples are generated

Prediction with Naïve Bayes Score(x,y) Definition of conditional probability Generative story assumptions This is a linear model!

Parameter Estimation • “count and normalize” • Parameters of a multinomial distribution • Relative frequency estimator • Formally: this is the maximum likelihood estimate • See CIML for derivation

Smoothing (add alpha)

Naïve Bayes recap

Why is this model called “Naïve Bayes”? Another view of the same model 𝑧 = 𝑏𝑠𝑕𝑛𝑏𝑦 𝑧 𝑄 𝑍 = 𝑧 𝑌 = 𝑦) = 𝑏𝑠𝑕𝑛𝑏𝑦 𝑧 𝑄(𝑍 = 𝑧)𝑄 𝑌 = 𝑦 𝑍 = 𝑧) 𝑒 = 𝑏𝑠𝑕𝑛𝑏𝑦 𝑧 𝑄(𝑍 = 𝑧) 𝑄 𝑌 𝑗 = 𝑦 𝑗 𝑍 = 𝑧) 𝑗=1 Bayes rule + Conditional independence assumption

Today • Text classification problems • and their evaluation • Linear classifiers • Features & Weights • Bag of words • Naïve Bayes

Nave Bayes CMSC 470 Marine Carpuat Slides credit: Dan Jurafsky - PowerPoint PPT Presentation

Classification, Linear Models, Nave Bayes CMSC 470 Marine Carpuat Slides credit: Dan Jurafsky & James Martin, Jacob Eisenstein Today Text classification problems and their evaluation Linear classifiers Features &

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier

Bayes Theorem Thomas Bayes (1701-1761) Simple form of Bayes Theorem, for

DATA MINING: NAVE BAYES 1 Nave Bayes Classifier Thomas Bayes 1702 - 1761 We will start off

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

STAT 339 Naive Bayes Classification 8-10 March 2017 Colin Reimer Dawson Outline Naive Bayes

Bayes Classifiers Nave Bayes Classification Patrick Mair Bayes Classifiers Weather data

I ntroduction to Mobile Robotics Bayes Filter Kalm an Filter Wolfram Burgard 1 Bayes

Formal Modeling in Cognitive Science Independence Lecture 23: Conditional Probability; Bayes

Nave Bayes Classification Nickolai Riabov, Kenneth Tiong Brown University Fall 2013 Nickolai

BAYES FORMULA a two-stage experiment Xingru Chen xingru.chen.gr@dartmouth.edu XC 2020

Another Walkthrough of Variational Bayes Bevan Jones ML for NLP Reading Group The University of

Probabilistic Diagnosis Albert R Meyer, May 3, 2013 Albert R Meyer, May 3, 2013 bayes.1

Introduction to Machine Learning Classification: Naive Bayes Learning goals 15 Understand the

Arthur Berg Pennsylvania State University Introduction Bayes Estimation Empirical Bayes

Bayes meets Dijkstra Exact Inference by Program Verification Joost-Pieter Katoen Dagstuhl

Multimodal Abstractive Summarization for How2 Videos ACL19 Shru* Palaskar Jindrich

Speaker and Emotion Recognition of TV-Series Data Using Multimodal and Multitask Deep Learning

Multimodal Dependent Type Theory Daniel Gratzer 0 Alex Kavvos 0 Andreas Nuyts 1 Lars Birkedal 0

Production in a Multimodal Corpus: How Speakers Communicate Complex Actions LREC 2008 Carlos

Applications November 20, 2008 CS 486/686 University of Waterloo Outline Alchemy

Probabilistic Classifiers -- Generative Naive Bayes Announcements Math for Visual

Sifting through images with Multinomial Relevance Feedback Dorota G lowacka, Alan Medlar and

Visualization DOs & DONTs Duen Horng (Polo) Chau Georgia Tech Partly based on materials

Nave Bayes CMSC 470 Marine Carpuat Slides credit: Dan Jurafsky - PowerPoint PPT Presentation

Classification, Linear Models, Nave Bayes CMSC 470 Marine Carpuat Slides credit: Dan Jurafsky & James Martin, Jacob Eisenstein Today Text classification problems and their evaluation Linear classifiers Features &

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier

Bayes Theorem Thomas Bayes (1701-1761) Simple form of Bayes Theorem, for

DATA MINING: NAVE BAYES 1 Nave Bayes Classifier Thomas Bayes 1702 - 1761 We will start off

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

STAT 339 Naive Bayes Classification 8-10 March 2017 Colin Reimer Dawson Outline Naive Bayes

Bayes Classifiers Nave Bayes Classification Patrick Mair Bayes Classifiers Weather data

I ntroduction to Mobile Robotics Bayes Filter Kalm an Filter Wolfram Burgard 1 Bayes

Formal Modeling in Cognitive Science Independence Lecture 23: Conditional Probability; Bayes

Nave Bayes Classification Nickolai Riabov, Kenneth Tiong Brown University Fall 2013 Nickolai

BAYES FORMULA a two-stage experiment Xingru Chen xingru.chen.gr@dartmouth.edu XC 2020

Another Walkthrough of Variational Bayes Bevan Jones ML for NLP Reading Group The University of

Probabilistic Diagnosis Albert R Meyer, May 3, 2013 Albert R Meyer, May 3, 2013 bayes.1

Introduction to Machine Learning Classification: Naive Bayes Learning goals 15 Understand the

Arthur Berg Pennsylvania State University Introduction Bayes Estimation Empirical Bayes

Bayes meets Dijkstra Exact Inference by Program Verification Joost-Pieter Katoen Dagstuhl

Multimodal Abstractive Summarization for How2 Videos ACL19 Shru* Palaskar Jindrich

Speaker and Emotion Recognition of TV-Series Data Using Multimodal and Multitask Deep Learning

Multimodal Dependent Type Theory Daniel Gratzer 0 Alex Kavvos 0 Andreas Nuyts 1 Lars Birkedal 0

Production in a Multimodal Corpus: How Speakers Communicate Complex Actions LREC 2008 Carlos

Applications November 20, 2008 CS 486/686 University of Waterloo Outline Alchemy

Probabilistic Classifiers -- Generative Naive Bayes Announcements Math for Visual

Sifting through images with Multinomial Relevance Feedback Dorota G lowacka, Alan Medlar and

Visualization DOs &amp; DONTs Duen Horng (Polo) Chau Georgia Tech Partly based on materials

Visualization DOs & DONTs Duen Horng (Polo) Chau Georgia Tech Partly based on materials