Support vector machines (SVMs) Lecture 3 David Sontag New York - PowerPoint PPT Presentation

Jan 06, 2023 •317 likes •466 views

Support vector machines (SVMs) Lecture 3 David Sontag New York University Slides adapted from Luke Zettlemoyer, Vibhav Gogate, and Carlos Guestrin Geometry of linear separators (see blackboard) A plane can be specified as the set of all

Support vector machines (SVMs) Lecture 3 David Sontag New York University Slides adapted from Luke Zettlemoyer, Vibhav Gogate, and Carlos Guestrin
Geometry of linear separators (see blackboard) A plane can be specified as the set of all points given by: Vector from origin to a point in the plane Two non-parallel directions in the plane Alternatively, it can be specified as: Normal vector (we will call this w) Only need to specify this dot product, a scalar (we will call this the offset, b) Barber, Section A.1.1-4
Linear Separators � If training data is linearly separable, perceptron is guaranteed to find some linear separator � Which of these is optimal ?
Support Vector Machine (SVM) � SVMs (Vapnik, 1990’s) choose the linear separator with the largest margin Robust to outliers! V. Vapnik • Good according to intuition, theory, practice • SVM became famous when, using images as input, it gave accuracy comparable to neural-network with hand-designed features in a handwriting recognition task
Support vector machines: 3 key ideas 1. Use optimization to find solution (i.e. a hyperplane) with few errors 2. Seek large margin separator to improve generalization 3. Use kernel trick to make large feature spaces computationally efficient
Finding a perfect classifier (when one exists) using linear programming w . x + b = +1 For every data point (x t , y t ), enforce the w . x + b = 0 w . x + b = -1 constraint for y t = +1, and for y t = -1, Equivalently, we want to satisfy all of the linear constraints This linear program can be efficiently solved using algorithms such as simplex, interior point, or ellipsoid
Finding a perfect classifier (when one exists) using linear programming Weight space Example of 2-dimensional linear programming (feasibility) problem: For SVMs, each data point gives one inequality: What happens if the data set is not linearly separable?
Minimizing number of errors (0-1 loss) • Try to find weights that violate as few constraints as possible? #(mistakes) • Formalize this using the 0-1 loss: where • Unfortunately, minimizing 0-1 loss is NP-hard in the worst-case – Non-starter. We need another approach.
Key idea #1: Allow for slack w . x + b = +1 w . x + b = 0 w . x + b = -1 Σ j ξ j , ξ - ξ j ξ j ≥ 0 ξ 1 “slack variables” ξ 2 ξ 3 We now have a linear program again, and can efficiently find its optimum ξ 4 For each data point: • If functional margin ≥ 1, don’t care • If functional margin < 1, pay linear penalty
Key idea #1: Allow for slack w . x + b = +1 w . x + b = 0 w . x + b = -1 Σ j ξ j , ξ - ξ j ξ j ≥ 0 ξ 1 “slack variables” ξ 2 ξ 3 What is the optimal value ξ j * as a function of w* and b*? ξ 4 then ξ j = 0 If then ξ j = If Sometimes written as
Equivalent hinge loss formulation Σ j ξ j , ξ - ξ j ξ j ≥ 0 into the objective, we get: Substituting The hinge loss is defined as This is empirical risk minimization, using the hinge loss
Hinge loss vs. 0/1 loss Hinge loss: 1 0-1 Loss: 0 1 Hinge loss upper bounds 0/1 loss! It is the tightest convex upper bound on the 0/1 loss
Key idea #2: seek large margin

Recommend

Support Vector Machines (SVMs). Semi-Supervised Learning. Semi-Supervised SVMs.

Support Vector Machines (SVMs). Semi-Supervised Learning. Semi-Supervised SVMs. Maria-Florina Balcan 03/25/2015 Support Vector Machines (SVMs). One of the most theoretically well motivated and practically most e ff ective

792 views • 40 slides

Multiclass Classification using SVMs on GPUs Sergio Herrero 6.338J Applied Parallel Computing

Multiclass Classification using SVMs on GPUs Sergio Herrero 6.338J Applied Parallel Computing Large Scale SVMs Parallel/Multiprocessor SVMs Serial GPU SVMs SVMs Cao 2006 Zanni 2006 Distributed/Cluster Catanzaro 2008 SVMs Osuna 1997

510 views • 16 slides

Kernel Machines Support Vector Machines 1 Kernel Machines Optimal Separating HyperPlanes Soft

Support Vector Machines Kernel Machines Support Vector Machines Kernel Machines Kernel Machines Support Vector Machines 1 Kernel Machines Optimal Separating HyperPlanes Soft Margin HyperPlanes Steven J Zeil Kernel Machines 2 Old

655 views • 8 slides

Kernel Machines Steven J Zeil Old Dominion Univ. Fall 2010 1 Support Vector Machines Kernel

Support Vector Machines Kernel Machines Kernel Machines Steven J Zeil Old Dominion Univ. Fall 2010 1 Support Vector Machines Kernel Machines Kernel Machines Support Vector Machines 1 Optimal Separating HyperPlanes Soft Margin

368 views • 36 slides

Support Vector Machines (Ch. 18.9) SVM Basics Support Vector Machines (SVMs) try to do our

Support Vector Machines (Ch. 18.9) SVM Basics Support Vector Machines (SVMs) try to do our normal linear classification (last few lectures), but with a couple of twists 1. Find the line in the middle of points with the largest gap (called

454 views • 32 slides

Machine Learning for NLP Support Vector Machines Aurlie Herbelot 2019 Centre for Mind/Brain

Machine Learning for NLP Support Vector Machines Aurlie Herbelot 2019 Centre for Mind/Brain Sciences University of Trento 1 Support Vector Machines: introduction 2 Support Vector Machines (SVMs) SVMs are supervised algorithms for

1.04k views • 57 slides

? 17.10.2018 3 17.10.2018 4 Support Vector Machines (SVM): Background Support Vector Machines

INF3490 - Biologically inspired computing Support Vector Machines Support Vector Machines, Ensemble Learning, and Dimensionality (SVM) Reduction Weria Khaksar October 17, 2018 17.10.2018 2 Support Vector Machines (SVM): Background Support

579 views • 9 slides

Introduction Kailash Awati Instructor DataCamp Support Vector Machines in R Preliminaries

DataCamp Support Vector Machines in R SUPPORT VECTOR MACHINES IN R Introduction Kailash Awati Instructor DataCamp Support Vector Machines in R Preliminaries Objective : gain understanding of how SVMs work; options available in the algorithm

1.25k views • 22 slides

Support Vector Machines Support Vector Machines CSC 411 Tutorial April 1, 2015 Tutor: Shenlong

Support Vector Machines Support Vector Machines CSC 411 Tutorial April 1, 2015 Tutor: Shenlong Wang Many thanks to Renjie Liao, Jake Snell, Yujia Li and Kevin Swersky for much of the following material. 1 of 36 2 of 36 Brief Review of SVMs

762 views • 36 slides

Support Vector Machines October 16, 2018 Support Vector Machines October 16, 2018 1 / 31

Support Vector Machines October 16, 2018 Support Vector Machines October 16, 2018 1 / 31 Introduction General information support vector machine (SVM) is an approach for classification that was developed in the computer science community in

654 views • 27 slides

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Outline Introduction Relevance Vector Machines Examples Summary Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector Machines Outline Introduction Relevance Vector Machines Examples Summary

249 views • 23 slides

Support Vector Machines This set of notes presents the Support Vector Machine (SVM) learning al-

CS229 Lecture notes Andrew Ng Part V Support Vector Machines This set of notes presents the Support Vector Machine (SVM) learning algorithm. SVMs are among the best (and many believe are indeed the best) off-the-shelf supervised

563 views • 25 slides

Lecture 20: Support Vector Machines (SVMs) CS109A Introduction to Data Science Pavlos Protopapas

Lecture 20: Support Vector Machines (SVMs) CS109A Introduction to Data Science Pavlos Protopapas and Kevin Rader Outline Classifying Linear Separable Data Classifying Linear Non-Separable Data Kernel Trick Text Reading: Ch. 9,

568 views • 35 slides

Support vector machines (SVMs) Lecture 5 David Sontag New

Support vector machines (SVMs) Lecture 5 David Sontag New York University So5 margin SVM w . x + b = +1 w . x + b = 0 w . x + b =

251 views • 23 slides

Support vector machines (SVMs) Lecture 6 David Sontag New York University Slides adapted from

Support vector machines (SVMs) Lecture 6 David Sontag New York University Slides adapted from Luke Zettlemoyer, Vibhav Gogate, and Carlos Guestrin Pegasos vs. Perceptron Pegasos Algorithm Initialize: w 1 = 0, t=0 For iter = 1,2,,20 For

624 views • 20 slides

Vector addition: The zero vector The D -vector whose entries are all zero is the zero vector ,

Vector addition: The zero vector The D -vector whose entries are all zero is the zero vector , written 0 D or just 0 v + 0 = v Vector addition: Vector addition is associative and commutative I Associativity ( x + y ) + z = x + ( y + z ) I

476 views • 36 slides

A Historical Sociolinguists Digital Tools Starter Kit Kelly E. Wright University of Kentucky

A Historical Sociolinguists Digital Tools Starter Kit Kelly E. Wright University of Kentucky Inaugural NARNiHS Conference 22 July 2017 http://www.uky.edu/~mrlaue2/narnih s2017/workshop.html Google Drive Folder A Text Editer BBEdit:

765 views • 42 slides

Objectives To enhance the understanding of Python by completing a program which tests for prime

Due: 11 th Sept 1 CMPSC 102 Discrete Structures Fall 2018 Lab 1 Assignment: Prime Number Computation Using Python Submit deliverables through your assignment GitHib repository bearing your name. Place source code in src/ and output in output/

208 views • 6 slides

Data in the Cloud Happy 10 th ACM SoCC! Raghu Ramakrishnan CTO for Data, Technical Fellow ACM

Data in the Cloud Happy 10 th ACM SoCC! Raghu Ramakrishnan CTO for Data, Technical Fellow ACM SoCC Topics Over the Past 10 Years 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 Word clouds courtesy Carlo Curino ACM SoCC Topics After

801 views • 29 slides

9:30 AM We are still collecting Hygiene and Dry good April 9, 2017 products for Camden Food

Palm Sunday Breakfast 9:30 AM We are still collecting Hygiene and Dry good April 9, 2017 products for Camden Food Pantry M&O Next Week Good Friday --- Easter Egg Hunt and Games 7:30 PM The March for Science in Oxford, Ohio will be

1.1k views • 32 slides

Digital Classroom Lessons for Week of April 13th 1 For these next few weeks we are going to be

Digital Classroom Lessons for Week of April 13th 1 For these next few weeks we are going to be switching to a digital learning platform, rather than in our classroom. Your lessons for Digital Learning the week will be on this powerpoint.

1.26k views • 81 slides

a word processor Year 7 and 8 Using Media Gaining support for a cause Objectives Lesson

Save a copy Lesson 1: Features of a word processor Year 7 and 8 Using Media Gaining support for a cause Objectives Lesson 1: Features of a word processor In this lesson, you will: Select the most appropriate software to use to

379 views • 17 slides

Small World Model: Cascades and Myopic Routing Jie Gao, Grant Schoenebeck, Fang-Yi Yu What is a

Nonhomogeneous Kleinbergs Small World Model: Cascades and Myopic Routing Jie Gao, Grant Schoenebeck, Fang-Yi Yu What is a social network? Social network models interactions between individuals Individuals behave freely. Society

866 views • 52 slides

Early Literacy Update State Board of Education July 2018 Commissioner Candice McQueen Districts

Early Literacy Update State Board of Education July 2018 Commissioner Candice McQueen Districts and schools in Tennessee will exemplify excellence and equity such that all students are equipped with the knowledge and skills to successfully

665 views • 18 slides