INF4820 Algorithms for AI and NLP Evaluating Classifiers - PowerPoint PPT Presentation

— INF4820 — Algorithms for AI and NLP Evaluating Classifiers Clustering Erik Velldal & Stephan Oepen Language Technology Group (LTG) September 23, 2015

Agenda Last week ◮ Supervised vs unsupervised learning. ◮ Vectors space classification. ◮ How to represent classes and class membership. ◮ Rocchio + k NN. ◮ Linear vs non-linear decision boundaries. Today ◮ Evaluation of classifiers ◮ Unsupervised machine learning for class discovery: Clustering ◮ Flat vs. hierarchical clustering. ◮ k -means clustering ◮ Vector space quiz 2

Testing a classifier ◮ Vector space classification amounts to computing the boundaries in the space that separate the class regions: the decision boundaries . ◮ To evaluate the boundary, we measure the number of correct classification predictions on unseeen test items. ◮ Many ways to do this. . . 3

Testing a classifier ◮ Vector space classification amounts to computing the boundaries in the space that separate the class regions: the decision boundaries . ◮ To evaluate the boundary, we measure the number of correct classification predictions on unseeen test items. ◮ Many ways to do this. . . ◮ We want to test how well a model generalizes on a held-out test set. ◮ Labeled test data is sometimes refered to as the gold standard. ◮ Why can’t we test on the training data? 3

Example: Evaluating classifier decisions ◮ Predictions for a given class can be wrong or correct in two ways: gold = positive gold = negative prediction = positive true positive (TP) false positive (FP) prediction = negative false negative (FN) true negative (TN) 4

Example: Evaluating classifier decisions accuracy = TP + TN N 5

Example: Evaluating classifier decisions accuracy = TP + TN N = 1+6 10 = 0 . 7 5

Example: Evaluating classifier decisions accuracy = TP + TN N = 1+6 10 = 0 . 7 TP precision = TP + FP TP recall = TP + FN 5

Example: Evaluating classifier decisions accuracy = TP + TN N = 1+6 10 = 0 . 7 TP precision = TP + FP 1 = 1+1 = 0 . 5 TP recall = TP + FN 1 = 1+2 = 0 . 33 5

Example: Evaluating classifier decisions accuracy = TP + TN N = 1+6 10 = 0 . 7 TP precision = TP + FP 1 = 1+1 = 0 . 5 TP recall = TP + FN 1 = 1+2 = 0 . 33 F - score = 2 × precision × recall precision + recall = 0 . 4 5

Evaluation measures ◮ accuracy = TP + TN TP + TN = N TP + TN + FP + FN ◮ The ratio of correct predictions. ◮ Not suitable for unbalanced numbers of positive / negative examples. TP ◮ precision = TP + FP ◮ The number of detected class members that were correct. TP ◮ recall = TP + FN ◮ The number of actual class members that were detected. ◮ Trade-off: Positive predictions for all examples would give 100% recall but (typically) terrible precision. ◮ F - score = 2 × precision × recall precision + recall ◮ Balanced measure of precision and recall (harmonic mean). 6

Evaluating multi-class predictions Macro-averaging ◮ Sum precision and recall for each class, and then compute global averages of these. ◮ The macro average will be highly influenced by the small classes. 7

Evaluating multi-class predictions Macro-averaging ◮ Sum precision and recall for each class, and then compute global averages of these. ◮ The macro average will be highly influenced by the small classes. Micro-averaging ◮ Sum TPs, FPs, and FNs for all points/objects across all classes, and then compute global precision and recall. ◮ The micro average will be highly influenced by the large classes. 7

A note on obligatory assignment 2b ◮ Builds on oblig 2a: Vector space representation of a set of words based on BoW features extracted from a sample of the Brown corpus. ◮ For 2b we’ll provide class labels for most of the words. ◮ Train a Rocchio classifier to predict labels for a set of unlabeled words. Label Examples potato, food, bread, fish, eggs . . . food embassy, institute, college, government, school . . . institution president, professor, dr, governor, doctor . . . title italy, dallas, france, america, england . . . place_name lizzie, david, bill, howard, john . . . person_name department, egypt, robert, butter, senator . . . unknown 8

A note on obligatory assignment 2b ◮ For a given set of objects { o 1 , . . . , o m } the proximity matrix R is a square m × m matrix where R ij stores the proximity of o i and o j . ◮ For our word space, R ij would give the dot-product of the normalized feature vectors � x i and � x j , representing the words o i and o j . 9

A note on obligatory assignment 2b ◮ For a given set of objects { o 1 , . . . , o m } the proximity matrix R is a square m × m matrix where R ij stores the proximity of o i and o j . ◮ For our word space, R ij would give the dot-product of the normalized feature vectors � x i and � x j , representing the words o i and o j . ◮ Note that, if our similarity measure sim is symmetric, i.e. sim( � y ) = sim( � x ) , then R will also be symmetric, i.e. R ij = R ji x ,� y ,� 9

A note on obligatory assignment 2b ◮ For a given set of objects { o 1 , . . . , o m } the proximity matrix R is a square m × m matrix where R ij stores the proximity of o i and o j . ◮ For our word space, R ij would give the dot-product of the normalized feature vectors � x i and � x j , representing the words o i and o j . ◮ Note that, if our similarity measure sim is symmetric, i.e. sim( � y ) = sim( � x ) , then R will also be symmetric, i.e. R ij = R ji x ,� y ,� ◮ Computing all the pairwise similarities once and then storing them in R can help save time in many applications. ◮ R will provide the input to many clustering methods. ◮ By sorting the row elements of R , we get access to an important type of similarity relation; nearest neighbors. ◮ For 2b we will implement a proximity matrix for retrieving knn relations. 9

Two categorization tasks in machine learning Classification ◮ Supervised learning, requiring labeled training data. ◮ Given some training set of examples with class labels, train a classifier to predict the class labels of new objects. Clustering ◮ Unsupervised learning from unlabeled data. ◮ Automatically group similar objects together. ◮ No pre-defined classes: we only specify the similarity measure. ◮ “ The search for structure in data ” (Bezdek, 1981) ◮ General objective: ◮ Partition the data into subsets, so that the similarity among members of the same group is high (homogeneity) while the similarity between the groups themselves is low (heterogeneity). 10

Example applications of cluster analysis ◮ Visualization and exploratory data analysis. 11

Example applications of cluster analysis ◮ Visualization and exploratory data analysis. ◮ Many applications within IR. Examples: ◮ Speed up search: First retrieve the most relevant cluster, then retrieve documents from within the cluster. ◮ Presenting the search results: Instead of ranked lists, organize the results as clusters. 11

Example applications of cluster analysis ◮ Visualization and exploratory data analysis. ◮ Many applications within IR. Examples: ◮ Speed up search: First retrieve the most relevant cluster, then retrieve documents from within the cluster. ◮ Presenting the search results: Instead of ranked lists, organize the results as clusters. ◮ Dimensionality reduction / class-based features. 11

INF4820 Algorithms for AI and NLP Evaluating Classifiers - PowerPoint PPT Presentation

INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Erik Velldal & Stephan Oepen Language Technology Group (LTG) September 23, 2015 Agenda Last week Supervised vs unsupervised learning. Vectors space

INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Erik Velldal &

Nonlinear Classifiers II 2 Nonlinear Classifiers: Introduction Classifiers Supervised

INF4820 Algorithms for AI and NLP Hierarchical Clustering Erik Velldal & Stephan

INF4820: Algorithms for Artificial Intelligence and Natural Language Processing Wrap-Up and Exam

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

INF4820 Algorithms for AI and NLP Semantic Spaces Murhaf Fares & Stephan Oepen

INF4820 Algorithms for AI and NLP Semantic Spaces Murhaf Fares & Stephan Oepen

INF4820 Algorithms for AI and NLP Summing up Exam preparations Murhaf Fares &

INF4820: Algorithms for AI and NLP Clustering Milen Kouylekov & Stephan Oepen Language

INF4820: Algorithms for AI and NLP Classification Milen Kouylekov & Stephan Oepen Language

INF4820 Algorithms for AI and NLP Basic Probability Theory & Language Models Murhaf

INF4820 Algorithms for AI and NLP Common Lisp Essentials Erik Velldal & Stephan

INF4820: Algorithms for AI and NLP Clustering Milen Kouylekov & Stephan Oepen Language

INF4820 Algorithms for AI and NLP Basic Probability Theory & Language Models Murhaf

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Local Algorithms and Large Scale Graph Mining Silvio Lattanzi (Google Research NY) Charles River

Density-based Clustering MAT 6480W / STT 6705V Guy Wolf guy.wolf@umontreal.ca Universit e de

The impact of high dimension on clustering Gilles Celeux Inria Saclay-le-de-France, Universit

Chapter 5-2: Clu lust ster erin ing Jilles Vreeken Revision 1, November 20 th typos fixed:

Machine Learning for NLP Unsupervised Learning Aurlie Herbelot 2019 Centre for Mind/Brain

Memory Hard Jol Alwen Binyi Chen IST Austria UCSB Krzysztof Pietrzak Leonid Reyzin Stefano

HMC-Sim 2.0: A Simulation Platform for Exploring Custom Memory Cube Operations John D. Leidel,

INF4820 Algorithms for AI and NLP Evaluating Classifiers - PowerPoint PPT Presentation

INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Erik Velldal & Stephan Oepen Language Technology Group (LTG) September 23, 2015 Agenda Last week Supervised vs unsupervised learning. Vectors space

INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Erik Velldal &amp;

Nonlinear Classifiers II 2 Nonlinear Classifiers: Introduction Classifiers Supervised

INF4820 Algorithms for AI and NLP Hierarchical Clustering Erik Velldal &amp; Stephan

INF4820: Algorithms for Artificial Intelligence and Natural Language Processing Wrap-Up and Exam

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

INF4820 Algorithms for AI and NLP Semantic Spaces Murhaf Fares &amp; Stephan Oepen

INF4820 Algorithms for AI and NLP Semantic Spaces Murhaf Fares &amp; Stephan Oepen

INF4820 Algorithms for AI and NLP Summing up Exam preparations Murhaf Fares &amp;

INF4820: Algorithms for AI and NLP Clustering Milen Kouylekov &amp; Stephan Oepen Language

INF4820: Algorithms for AI and NLP Classification Milen Kouylekov &amp; Stephan Oepen Language

INF4820 Algorithms for AI and NLP Basic Probability Theory &amp; Language Models Murhaf

INF4820 Algorithms for AI and NLP Common Lisp Essentials Erik Velldal &amp; Stephan

INF4820: Algorithms for AI and NLP Clustering Milen Kouylekov &amp; Stephan Oepen Language

INF4820 Algorithms for AI and NLP Basic Probability Theory &amp; Language Models Murhaf

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Local Algorithms and Large Scale Graph Mining Silvio Lattanzi (Google Research NY) Charles River

Density-based Clustering MAT 6480W / STT 6705V Guy Wolf guy.wolf@umontreal.ca Universit e de

The impact of high dimension on clustering Gilles Celeux Inria Saclay-le-de-France, Universit

Chapter 5-2: Clu lust ster erin ing Jilles Vreeken Revision 1, November 20 th typos fixed:

Machine Learning for NLP Unsupervised Learning Aurlie Herbelot 2019 Centre for Mind/Brain

Memory Hard Jol Alwen Binyi Chen IST Austria UCSB Krzysztof Pietrzak Leonid Reyzin Stefano

HMC-Sim 2.0: A Simulation Platform for Exploring Custom Memory Cube Operations John D. Leidel,

INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Erik Velldal &

INF4820 Algorithms for AI and NLP Hierarchical Clustering Erik Velldal & Stephan

INF4820 Algorithms for AI and NLP Semantic Spaces Murhaf Fares & Stephan Oepen

INF4820 Algorithms for AI and NLP Semantic Spaces Murhaf Fares & Stephan Oepen

INF4820 Algorithms for AI and NLP Summing up Exam preparations Murhaf Fares &

INF4820: Algorithms for AI and NLP Clustering Milen Kouylekov & Stephan Oepen Language

INF4820: Algorithms for AI and NLP Classification Milen Kouylekov & Stephan Oepen Language

INF4820 Algorithms for AI and NLP Basic Probability Theory & Language Models Murhaf

INF4820 Algorithms for AI and NLP Common Lisp Essentials Erik Velldal & Stephan

INF4820: Algorithms for AI and NLP Clustering Milen Kouylekov & Stephan Oepen Language

INF4820 Algorithms for AI and NLP Basic Probability Theory & Language Models Murhaf