The Perceptron CMSC 422 M ARINE C ARPUAT marine@cs.umd.edu Credit: - PowerPoint PPT Presentation

Mar 14, 2024 •455 likes •660 views

The Perceptron CMSC 422 M ARINE C ARPUAT marine@cs.umd.edu Credit: figures by Piyush Rai and Hal Daume III This week Project 1 posted Form teams! Due Wed March 2 nd by 2:59pm A new model/algorithm the perceptron and its

The Perceptron CMSC 422 M ARINE C ARPUAT marine@cs.umd.edu Credit: figures by Piyush Rai and Hal Daume III
This week • Project 1 posted – Form teams! – Due Wed March 2 nd by 2:59pm • A new model/algorithm – the perceptron – and its variants: voted, averaged • Fundamental Machine Learning Concepts – Online vs. batch learning – Error-driven learning
Geometry concept: Hy Hyperplane erplane • Separates a D-dimensional space into two half-spaces • Defined by an outward pointing normal vector 𝑥 ∈ ℝ 𝐸 – 𝑥 is orthogonal to any vector lying on the hyperplane • Hyperplane passes through the origin, unless we also define a bias term b
Binary classification via hyperplanes • Let’s assume that the decision boundary is a hyperplane • Then, training consists in finding a hyperplane 𝑥 that separates positive from negative examples
Binary classification via hyperplanes • At test time, we check on what side of the hyperplane examples fall 𝑧 = 𝑡𝑗𝑕𝑜(𝑥 𝑈 𝑦 + 𝑐)
Function Approximation with Perceptron Problem setting • Set of possible instances 𝑌 – Each instance 𝑦 ∈ 𝑌 is a feature vector 𝑦 = [𝑦 1 , … , 𝑦 𝐸 ] • Unknown target function 𝑔: 𝑌 → 𝑍 – 𝑍 is binary valued {-1; +1} • Set of function hypotheses 𝐼 = ℎ ℎ: 𝑌 → 𝑍} – Each hypothesis ℎ is a hyperplane in D-dimensional space Input • Training examples { 𝑦 1 , 𝑧 1 , … 𝑦 𝑂 , 𝑧 𝑂 } of unknown target function 𝑔 Output • Hypothesis ℎ ∈ 𝐼 that best approximates target function 𝑔
Perception: Prediction Algorithm
Aside: biological inspiration Analogy: the perceptron as a neuron
Perceptron Training Algorithm
Properties of the Perceptron training algorithm • Online – We look at one example at a time, and update the model as soon as we make an error – As opposed to batch algorithms that update parameters after seeing the entire training set • Error-driven – We only update parameters/model if we make an error
Perceptron update: geometric interpretation
Practical considerations • The order of training examples matters! – Random is better • Early stopping – Good strategy to avoid overfitting • Simple modifications dramatically improve performance – voting or averaging
Predicting with • The voted perceptron • The averaged perceptron • Require keeping track of “survival time” of weight vectors
How would you modify this algorithm for voted perceptron?
How would you modify this algorithm for averaged perceptron?
Averaged perceptron decision rule can be rewritten as
Averaged Perceptron Training
Can the perceptron always find a hyperplane to separate positive from negative examples?
This week • Project 1 posted – Form teams! – Due Wed March 2 nd by 2:59pm • A new model/algorithm – the perceptron – and its variants: voted, averaged • Fundamental Machine Learning Concepts – Online vs. batch learning – Error-driven learning

Recommend

CS 472 - Perceptron 1 Basic Neuron CS 472 - Perceptron 2 Expanded Neuron CS 472 - Perceptron

CS 472 - Perceptron 1 Basic Neuron CS 472 - Perceptron 2 Expanded Neuron CS 472 - Perceptron 3 Perceptron Learning Algorithm l First neural network learning model in the 1960s l Simple and limited (single layer models) l Basic concepts

571 views • 53 slides

The Perceptron Algorithm Machine Learning 1 Some slides based on lectures from Dan Roth, Avrim

The Perceptron Algorithm Machine Learning 1 Some slides based on lectures from Dan Roth, Avrim Blum and others Outline The Perceptron Algorithm Variants of Perceptron Perceptron Mistake Bound 2 Where are we? The Perceptron

559 views • 44 slides

Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron

Sequence Labeling with the Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron Sequence labeling problem Structured Perceptron Input: Perceptron algorithm can be used for sequence labeling

654 views • 13 slides

Introduction to Machine Learning Perceptron Barnabs Pczos Contents History of Artificial

Introduction to Machine Learning Perceptron Barnabs Pczos Contents History of Artificial Neural Networks Definitions: Perceptron, Multi-Layer Perceptron Perceptron algorithm 2 Short History of Artificial Neural Networks 3

836 views • 42 slides

How to Train Your Perceptron 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University

PERCEPTRON How to Train Your Perceptron 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University Lets start easy worlds smallest perceptron! w f y x y = wx (a.k.a. line equation, linear regression) Learning a Perceptron

656 views • 20 slides

The Perceptron Mistake Bound Machine Learning 1 Some slides based on lectures from Dan Roth,

The Perceptron Mistake Bound Machine Learning 1 Some slides based on lectures from Dan Roth, Avrim Blum and others Where are we? The Perceptron Algorithm Variants of Perceptron Perceptron Mistake Bound 2 Convergence Convergence

692 views • 33 slides

Machine Learning A Geometric Approach Linear Classification: Perceptron Professor Liang Huang

Machine Learning A Geometric Approach Linear Classification: Perceptron Professor Liang Huang some slides from Alex Smola (CMU) Perceptron Frank Rosenblatt deep learning multilayer perceptron perceptron linear regression SVM CRF

698 views • 47 slides

The Perceptron Algorithm Perceptron (Frank Rosenblatt, 1957) First learning algorithm for

9/14/10 The Perceptron Algorithm Perceptron (Frank Rosenblatt, 1957) First learning algorithm for neural networks; Originally introduced for character classification, where each character is represented as an image; 1 9/14/10

372 views • 7 slides

Lecture 3: Perceptron Princeton University COS 495 Instructor: Yingyu Liang Perceptron Overview

Machine Learning Basics Lecture 3: Perceptron Princeton University COS 495 Instructor: Yingyu Liang Perceptron Overview Previous lectures: (Principle for loss function) MLE to derive loss Example: linear regression; some linear

769 views • 33 slides

Supervised Classification with Logistic Regression CMSC 470 Marine Carpuat The Perceptron What

Supervised Classification with Logistic Regression CMSC 470 Marine Carpuat The Perceptron What you should know What is the underlying function used to make predictions Perceptron test algorithm Perceptron training algorithm How

376 views • 22 slides

Perceptron Homework Assume a 3 input perceptron plus bias (it outputs 1 if net > 0, else 0) l

Perceptron Homework Assume a 3 input perceptron plus bias (it outputs 1 if net > 0, else 0) l Assume a learning rate c of 1 and initial weights all 1: w i = c ( t z) x i l Show weights after each pattern for just one epoch l Training

1.26k views • 26 slides

NLP Programming Tutorial 3 - The Perceptron Algorithm Graham Neubig Nara Institute of Science

NLP Programming Tutorial 3 The Perceptron Algorithm NLP Programming Tutorial 3 - The Perceptron Algorithm Graham Neubig Nara Institute of Science and Technology (NAIST) 1 NLP Programming Tutorial 3 The Perceptron Algorithm Prediction

503 views • 24 slides

NLP Programming Tutorial 11 - The Structured Perceptron Graham Neubig Nara Institute of Science

NLP Programming Tutorial 11 The Structured Perceptron NLP Programming Tutorial 11 - The Structured Perceptron Graham Neubig Nara Institute of Science and Technology (NAIST) 1 NLP Programming Tutorial 11 The Structured Perceptron

447 views • 31 slides

Regularization + Perceptron Perceptron Readings: Matt Gormley Murphy 8.5.4 Bishop

10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Regularization + Perceptron Perceptron Readings: Matt Gormley Murphy 8.5.4 Bishop

906 views • 65 slides

Perceptron Algorithm An aside: a hyperplane is a perceptron. (single layer neural network, do you

Perceptron Algorithm An aside: a hyperplane is a perceptron. (single layer neural network, do you see? Linear programming!) Labelled points with x 1 ,..., x n . Alg: Given x 1 ,..., x n . Hyperplane separator. + Let w 1 = x 1 .

322 views • 3 slides

Today Perceptron. Today Perceptron. Support Vector Machine. Labelled points with x 1 ,..., x n

Today Perceptron. Today Perceptron. Support Vector Machine. Labelled points with x 1 ,..., x n . + + ++ Labelled points with x 1 ,..., x n . Hyperplane separator. + + ++ Labelled points

2.36k views • 148 slides

Support Vector Machines 3-18-16 Reading Quiz Q1: Which of these hyperplanes would be selected by

Support Vector Machines 3-18-16 Reading Quiz Q1: Which of these hyperplanes would be selected by a support vector machine? a) a b) b c) c d) None of these a c b Reading Quiz Q2: Which of these points is a support vector to the

277 views • 13 slides

Statistical Machine Learning Lecture 11: Support Vector Machines Kristian Kersting TU Darmstadt

Statistical Machine Learning Lecture 11: Support Vector Machines Kristian Kersting TU Darmstadt Summer Term 2020 K. Kersting based on Slides from J. Peters Statistical Machine Learning Summer Term 2020 1 / 59 Todays Objectives

877 views • 59 slides

Chapter IX: Classification* 1. Basic idea 2. Decision trees 3. Nave Bayes classifier 4.

Chapter IX: Classification* 1. Basic idea 2. Decision trees 3. Nave Bayes classifier 4. Support vector machines 5. Ensemble methods * Zaki & Meira: Ch. 18, 19, 21, 22; Tan, Steinbach & Kumar: Ch. 4, 5.35.6 IR&DM 13/14 16

582 views • 42 slides

COMP24111: Machine Learning and Optimisation Chapter 4: Support Vector Machines Dr. Tingting Mu

COMP24111: Machine Learning and Optimisation Chapter 4: Support Vector Machines Dr. Tingting Mu Email: tingting.mu@manchester.ac.uk Outline Geometry concepts: hyperplane, distance, parallel hyperplane, margin. Basic idea of support

671 views • 36 slides

1 if w x b 0 + i y = i 1 if w x b 0

Lab 6: 23 rd April 2012 Exercises on Support Vector Machines 1. What is the goal of the SVM algorithm? When can be it successfully applied? Solution SVMs are linear classifiers that find a hyperplane to

436 views • 4 slides

Local formality of inversion hyperplane arrangements William Slofstra IQC, University of

Local formality of inversion hyperplane arrangements William Slofstra IQC, University of Waterloo July 15, 2016 joint work with Travis Scrimshaw Local formality of inversion hyperplane arrangements William Slofstra Basic ideas Coxeter

412 views • 14 slides

Lecture 10: Linear Discriminant Functions (2) Dr. Chengjiang Long Computer Vision Researcher at

Lecture 10: Linear Discriminant Functions (2) Dr. Chengjiang Long Computer Vision Researcher at Kitware Inc. Adjunct Professor at RPI. Email: longc3@rpi.edu Recap Previous Lecture 2 C. Long Lecture 10 February 17, 2018 Outline Perceptron

796 views • 63 slides

Ac%ve learning x 2 o o Spam + o o o + o o o + o o o o

Ac%ve learning x 2 o o Spam + o o o + o o o + o o o o o o Ham x 1 Labels are expensive (need to ask expert) Want to minimize the number

860 views • 44 slides