Lecture 3: Linear Classification Fei-Fei Li & Andrej Karpathy - PowerPoint PPT Presentation

Lecture 3: Linear Classification Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 1

Last time: Image Classification assume given set of discrete labels {dog, cat, truck, plane, ...} cat Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 2

k-Nearest Neighbor test images training set Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 3

Linear Classification 1. define a score function class scores Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 4

Linear Classification 1. define a score function data (image) “bias vector” “weights” class scores “parameters” Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 5

Linear Classification data (image) 1. define a score function [3072 x 1] (assume CIFAR-10 example so 32 x 32 x 3 images, 10 classes) bias vector weights class scores Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 6

Linear Classification data (image) 1. define a score function [3072 x 1] (assume CIFAR-10 example so 32 x 32 x 3 images, 10 classes) bias vector weights class scores [10 x 1] [10 x 3072] [10 x 1] Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 7

Linear Classification Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 8

Interpreting a Linear Classifier Question: what can a linear classifier do? Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 9

Interpreting a Linear Classifier Example training classifiers on CIFAR-10: Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 10

Interpreting a Linear Classifier Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 11

Bias trick Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 12

So far: We defined a (linear) score function: Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 13

2. Define a loss function (or cost function, or objective) Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 14

2. Define a loss function (or cost function, or objective) - scores, label loss . Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 15

2. Define a loss function (or cost function, or objective) - scores, label loss . Example: Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 16

2. Define a loss function (or cost function, or objective) - scores, label loss . Question: if you were to Example: assign a single number to how “unhappy” you are with these scores, what would you do? Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 17

2. Define a loss function (or cost function, or objective) One (of many ways) to do it: Multiclass SVM Loss Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 18

2. Define a loss function (or cost function, or objective) One (of many ways) to do it: Multiclass SVM Loss (One possible generalization of Binary Support Vector Machine to multiple classes) Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 19

2. Define a loss function (or cost function, or objective) One (of many ways) to do it: Multiclass SVM Loss loss due to difference between the correct class sum over all example i score and incorrect class score incorrect labels Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 20

loss due to difference between the correct class sum over all example i score and incorrect class score incorrect labels Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 21

Example: e.g. 10 loss = ? Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 22

Example: e.g. 10 Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 23

Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 24

There is a bug with the objective… Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 25

L2 Regularization Regularization strength Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 26

L2 regularization: motivation Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 27

Do we have to cross-validate both and ? Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 29

So far… 1. Score function 2. Loss function Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 31

score function Softmax Classifier is the same (extension of Logistic Regression to multiple classes) Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 33

score function Softmax Classifier is the same Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 34

score function Softmax Classifier is the same softmax function i.e. we’re minimizing the negative log likelihood. Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 35

score function Softmax Classifier is the same softmax function Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 36

score function Softmax Classifier is the same i.e. we’re minimizing the negative log likelihood. Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 37

Softmax vs. SVM - Interpreting the probabilities from the Softmax Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 39

Softmax vs. SVM - Interpreting the probabilities from the Softmax suppose the weights W were only half as large (we use a higher regularization strength) Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 40

Softmax vs. SVM - Interpreting the probabilities from the Softmax suppose the weights W were only half as large: Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 41

Softmax vs. SVM - Interpreting the probabilities from the Softmax suppose the weights W were only half as large: What happens in the limit, as the regularization strength goes to infinity? Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 42

Softmax vs. SVM 1 scores: [10, -2, 3] [10, 9, 9] [10, -100, -100] Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 43

Softmax vs. SVM 1 Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 44

Interactive Web Demo time.... http://vision.stanford.edu/teaching/cs231n/linear-classify-demo/ Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 45

Lecture 3: Linear Classification Fei-Fei Li & Andrej Karpathy - PowerPoint PPT Presentation

Lecture 3: Linear Classification Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 1 Last time: Image Classification assume given set of discrete labels {dog, cat, truck,

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Learning From Data Lecture 8 Linear Classification and Regression Linear Classification Linear

Classification K-nearest neighbor classification D istance functions Choice of k Choice of k

Graph Classification Classification Outline Introduction, Overview Classification using

Classification of Symmetry Classification of Symmetry Classification of Symmetry Classification

Homework Homework Lecture 7: Linear Classification Methods Final projects? Groups Topics

Web Information Retrieval Lecture 14 Text classification Sec. 13.1 Text Classification

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Classification Image Classification Set of predefined categories [eg: table, apple, dog, giraffe]

Linear Discrimination Discriminant-Based Classification 1 Linear Discrimination Linearly

(a) Quantitative classification (b) Qualitative classification (c) Area classification (d) Simple

Classification 1 Classification: Basic Concepts and Methods Classification: Basic Concepts

Library of Congress Classification: Module 1.3 1 Library of Congress Classification: Module 1.3

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Management of Classification Lookup Files The basics of classification The basics of

Multiclass Classification CS 6956: Deep Learning for NLP 1 So far: Binary Classification We

Computer Graphics III Spherical integrals, Light & Radiometry Exercises Jaroslav

Agents that Reason Logically by Chris Horn Jiansui Yang Xiaojing Wu 3/30/00 Logic

Inference engine Knowledge base Breeze Stench 4 PIT Breeze Breeze 3 PIT Stench Gold

Google Slides Lots of options to start Lots of template options to start with. Try to keep it

Linear classifiers : prediction eq u ations L IN E AR C L ASSIFIE R S IN P YTH ON Michael ( Mike

EXPLOITING AND REMODELLING SEMANTIC RELATIONSHIPS Fran Ale lexande der, , Taxonomy Manager,

Error Link Detection and Correction in Wikipedia Chengyu Wang, Rong Zhang, Xiaofeng He, Aoying

Introduction Karl Stratos Rutgers University Karl Stratos CS 533: Natural Language Processing

Lecture 3: Linear Classification Fei-Fei Li & Andrej Karpathy - PowerPoint PPT Presentation

Lecture 3: Linear Classification Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 2 - Lecture 2 - 7 Jan 2015 7 Jan 2015 1 Last time: Image Classification assume given set of discrete labels {dog, cat, truck,

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Learning From Data Lecture 8 Linear Classification and Regression Linear Classification Linear

Classification K-nearest neighbor classification D istance functions Choice of k Choice of k

Graph Classification Classification Outline Introduction, Overview Classification using

Classification of Symmetry Classification of Symmetry Classification of Symmetry Classification

Homework Homework Lecture 7: Linear Classification Methods Final projects? Groups Topics

Web Information Retrieval Lecture 14 Text classification Sec. 13.1 Text Classification

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Classification Image Classification Set of predefined categories [eg: table, apple, dog, giraffe]

Linear Discrimination Discriminant-Based Classification 1 Linear Discrimination Linearly

(a) Quantitative classification (b) Qualitative classification (c) Area classification (d) Simple

Classification 1 Classification: Basic Concepts and Methods Classification: Basic Concepts

Library of Congress Classification: Module 1.3 1 Library of Congress Classification: Module 1.3

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Management of Classification Lookup Files The basics of classification The basics of

Multiclass Classification CS 6956: Deep Learning for NLP 1 So far: Binary Classification We

Computer Graphics III Spherical integrals, Light &amp; Radiometry Exercises Jaroslav

Agents that Reason Logically by Chris Horn Jiansui Yang Xiaojing Wu 3/30/00 Logic

Inference engine Knowledge base Breeze Stench 4 PIT Breeze Breeze 3 PIT Stench Gold

Google Slides Lots of options to start Lots of template options to start with. Try to keep it

Linear classifiers : prediction eq u ations L IN E AR C L ASSIFIE R S IN P YTH ON Michael ( Mike

EXPLOITING AND REMODELLING SEMANTIC RELATIONSHIPS Fran Ale lexande der, , Taxonomy Manager,

Error Link Detection and Correction in Wikipedia Chengyu Wang, Rong Zhang, Xiaofeng He, Aoying

Introduction Karl Stratos Rutgers University Karl Stratos CS 533: Natural Language Processing

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Computer Graphics III Spherical integrals, Light & Radiometry Exercises Jaroslav