The Perceptron Algorithm Perceptron (Frank Rosenblatt, 1957) First - PDF document

9/14/10 The Perceptron Algorithm Perceptron (Frank Rosenblatt, 1957) • First learning algorithm for neural networks; • Originally introduced for character classification, where each character is represented as an image; 1

9/14/10 Perceptron (contd.) n Total input to output node: w j x ∑ j j 1 = 1 if x 0 ≥  ( ) H x Output unit performs the =  0 if x 0 < function: (activation function):  Perceptron: Learning Algorithm • Goal : we want to define a learning algorithm for the weights in order to compute a mapping from the inputs to the outputs; • Example : two class character recognition problem. – Training set : set of images representing either the character ‘a’ or the character ‘b’ (supervised learning); – Learning Task : Learn the weights so that when a new unlabelled image comes in, the network can predict its label. – Settings: The perceptron Class ‘a’  1 (class C1) needs to learn Class ‘b’  0 (class C2) ℜ n { } n input units (intensity level of a pixel) f : 0 , 1 → 1 output unit 2

9/14/10 Perceptron: Learning Algorithm The algorithm proceeds as follows : • Initial random setting of weights; • The input is a random sequence { } ℵ x k k ∈ • For each element of class C1, if output = 1 (correct) do nothing , otherwise update weights ; • For each element of class C2, if output = 0 (correct) do nothing , otherwise update weights . Perceptron: Learning Algorithm A bit more formally: ( ) ( ) x x , x ,..., x w w , w ,..., w = = 1 2 n 1 2 n : θ Threshold of the output unit T wx w x w x ... w x = + + + 1 1 2 2 n n T wx 0 − θ ≥ Output is 1 if To eliminate the explicit dependence on : θ Output is 1 if: n 1 + ˆ ˆ T w x w x 0 ∑ = ≥ i i i 1 = 3

9/14/10 Perceptron: Learning Algorithm • We want to learn values of the weights so that the perceptron correctly discriminate elements of C1 from elements of C2: • Given x in input, if x is classified correctly, weights are unchanged, otherwise: w x if an elem ent of cla ss C ( 1 ) was classi fied as in C +  1 2 ' w =  w x if an elem ent of cla ss C ( 0 ) was classi fied as in C −  2 1 Perceptron: Learning Algorithm w x if an elem ent of cla ss C ( 1 ) was classi fied as in C  + ' 1 2 w =  w x if an elem ent of cla ss C ( 0 ) was classi fied as in C −  2 1 • 1 st case : x ∈ C and was classified in C 1 2 ˆ ˆ T w x 0 The correct answer is 1, which corresponds to: ≥ ˆ ˆ T w x 0 We have instead: < We want to get closer to the correct answer: T ' T wx w x < T T T ' T wx ( w x ) x wx w x iff < < + 2 ( ) T T T T w x x wx xx wx x + = + = + 2 ≥ because x 0 , the condit ion is ver ified 4

9/14/10 Perceptron: Learning Algorithm w x if an elem ent of cla ss C ( 1 ) was classi fied as in C +  1 2 ' w =  w x if an elem ent of cla ss C ( 0 ) was classi fied as in C −  2 1 • 2 nd case : x ∈ C 2 and was classified in C 1 The correct answer is 0, which corresponds to: ˆ ˆ T w x 0 < ˆ ˆ T We have instead: w x 0 ≥ We want to get closer to the correct answer: T ' T wx w x > T ' T T T wx w x wx ( w x ) x iff > > − 2 ( ) T T T T w x x wx xx wx x − = − = − 2 ≥ because x 0 , the condit ion is ver ified The previous rule allows the network to get closer to the correct answer when it performs an error. Perceptron: Learning Algorithm • In summary : 1. A random sequence is generated x , x ,  , x ,  1 2 k such that x C C ∈ ∪ i 1 2 2. If is correctly classified, then x w w = k k + 1 k otherwise w x if x C + ∈  k k k 1 w =  k 1 + w x if x C − ∈  k k k 2 5

9/14/10 Perceptron: Learning Algorithm Does the learning algorithm converge? Convergence theorem: Regardless of the initial choice of weights, if the two classes are linearly separable, i.e. there exist s.t. w ˆ ˆ T w x 0 if x C  ≥ ∈  1  ˆ ˆ T w x 0 if x C < ∈   2 then the learning rule will find such solution after a finite number of steps. Representational Power of Perceptrons • Marvin Minsky and Seymour Papert, “Perceptrons” 1969: “The perceptron can solve only problems with linearly separable classes.” • Examples of linearly separable Boolean functions: AND OR 6

9/14/10 Representational Power of Perceptrons 1 1 -1.5 -0.5 1 1 Perceptron that computes the Perceptron that computes the AND function OR function Representational Power of Perceptrons • Example of a non linearly separable Boolean function: EX-OR The EX-OR function cannot be computed by a perceptron 7

The Perceptron Algorithm Perceptron (Frank Rosenblatt, 1957) First - PDF document

9/14/10 The Perceptron Algorithm Perceptron (Frank Rosenblatt, 1957) First learning algorithm for neural networks; Originally introduced for character classification, where each character is represented as an image; 1 9/14/10

CS 472 - Perceptron 1 Basic Neuron CS 472 - Perceptron 2 Expanded Neuron CS 472 - Perceptron

The Perceptron Algorithm Machine Learning 1 Some slides based on lectures from Dan Roth, Avrim

Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron

NLP Programming Tutorial 3 - The Perceptron Algorithm Graham Neubig Nara Institute of Science

Introduction to Machine Learning Perceptron Barnabs Pczos Contents History of Artificial

The Perceptron Mistake Bound Machine Learning 1 Some slides based on lectures from Dan Roth,

How to Train Your Perceptron 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University

Machine Learning A Geometric Approach Linear Classification: Perceptron Professor Liang Huang

Supervised Classification with Logistic Regression CMSC 470 Marine Carpuat The Perceptron What

Perceptron Algorithm An aside: a hyperplane is a perceptron. (single layer neural network, do you

Lecture 3: Perceptron Princeton University COS 495 Instructor: Yingyu Liang Perceptron Overview

Perceptron Homework Assume a 3 input perceptron plus bias (it outputs 1 if net > 0, else 0) l

NLP Programming Tutorial 11 - The Structured Perceptron Graham Neubig Nara Institute of Science

Regularization + Perceptron Perceptron Readings: Matt Gormley Murphy 8.5.4 Bishop

Today Perceptron. Today Perceptron. Support Vector Machine. Labelled points with x 1 ,..., x n

CS 472 Homework CS 472 - Homework 1 Perceptron Homework Assume a 3 input perceptron plus bias

Disaggregate SO 2 Em issions from National Total to County Level Distributions Xiulian HU,

USB Flash Drives as an Energy Efficient Storage Alternative Olga Mordvinova, Julian Martin

Delivering value and growth Ian Davies, Managing Director Singapore and Hong Kong, June 2012

2014 ADAMS COUNTY HEARING OCTOBER 9, 2014 5:30 PM TO 7:30 PM Partnering Opportunities with

Data Analytics and Machine Learning Cheng Zihan Hor Jasrene Joshua Tan EEE03 Content Outline

Translating visually the reasoning of a perceptron: the weighted rainbow boxes technique and an

Networks Luke Schuler Overview What is an Artificial Neural Network? History

Measuring Political Bias in British Media: Using Recurrent Neural Networks for Long Form Textual

The Perceptron Algorithm Perceptron (Frank Rosenblatt, 1957) First - PDF document

9/14/10 The Perceptron Algorithm Perceptron (Frank Rosenblatt, 1957) First learning algorithm for neural networks; Originally introduced for character classification, where each character is represented as an image; 1 9/14/10

CS 472 - Perceptron 1 Basic Neuron CS 472 - Perceptron 2 Expanded Neuron CS 472 - Perceptron

The Perceptron Algorithm Machine Learning 1 Some slides based on lectures from Dan Roth, Avrim

Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron

NLP Programming Tutorial 3 - The Perceptron Algorithm Graham Neubig Nara Institute of Science

Introduction to Machine Learning Perceptron Barnabs Pczos Contents History of Artificial

The Perceptron Mistake Bound Machine Learning 1 Some slides based on lectures from Dan Roth,

How to Train Your Perceptron 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University

Machine Learning A Geometric Approach Linear Classification: Perceptron Professor Liang Huang

Supervised Classification with Logistic Regression CMSC 470 Marine Carpuat The Perceptron What

Perceptron Algorithm An aside: a hyperplane is a perceptron. (single layer neural network, do you

Lecture 3: Perceptron Princeton University COS 495 Instructor: Yingyu Liang Perceptron Overview

Perceptron Homework Assume a 3 input perceptron plus bias (it outputs 1 if net &gt; 0, else 0) l

NLP Programming Tutorial 11 - The Structured Perceptron Graham Neubig Nara Institute of Science

Regularization + Perceptron Perceptron Readings: Matt Gormley Murphy 8.5.4 Bishop

Today Perceptron. Today Perceptron. Support Vector Machine. Labelled points with x 1 ,..., x n

CS 472 Homework CS 472 - Homework 1 Perceptron Homework Assume a 3 input perceptron plus bias

Disaggregate SO 2 Em issions from National Total to County Level Distributions Xiulian HU,

USB Flash Drives as an Energy Efficient Storage Alternative Olga Mordvinova, Julian Martin

Delivering value and growth Ian Davies, Managing Director Singapore and Hong Kong, June 2012

2014 ADAMS COUNTY HEARING OCTOBER 9, 2014 5:30 PM TO 7:30 PM Partnering Opportunities with

Data Analytics and Machine Learning Cheng Zihan Hor Jasrene Joshua Tan EEE03 Content Outline

Translating visually the reasoning of a perceptron: the weighted rainbow boxes technique and an

Networks Luke Schuler Overview What is an Artificial Neural Network? History

Measuring Political Bias in British Media: Using Recurrent Neural Networks for Long Form Textual

Perceptron Homework Assume a 3 input perceptron plus bias (it outputs 1 if net > 0, else 0) l