CSE 152: Computer Vision Hao Su Lecture 7: Neural Networks Review - PowerPoint PPT Presentation

CSE 152: Computer Vision Hao Su Lecture 7: Neural Networks

Review of Filters: From Linear to Non-linear

Image filtering (Linear case) 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 90 0 90 90 90 0 0 0 0 0 90 0 90 90 90 0 0 0 0 0 90 90 90 90 90 0 0 0 0 0 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 0 0 0 0 0 0 0 0 0 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Credit: S. Seitz

Image filtering (Linear case) 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 0 0 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Credit: S. Seitz

Image filtering (Linear case) 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 20 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 0 0 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Credit: S. Seitz

Image filtering (Linear case) 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 20 30 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 0 0 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Credit: S. Seitz

Image filtering (Linear case) 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 10 20 30 30 0 0 0 0 0 0 0 0 0 0 0 0 0 90 90 90 90 90 0 0 0 0 0 90 90 90 90 90 0 0 0 0 0 90 90 90 90 90 0 0 0 0 0 90 0 90 90 90 0 0 0 0 0 90 90 90 90 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Credit: S. Seitz

Image filtering (Linear case) 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 20 30 30 30 20 10 0 0 0 90 90 90 90 90 0 0 0 20 40 60 60 60 40 20 0 0 0 90 90 90 90 90 0 0 0 30 60 90 90 90 60 30 0 0 0 90 90 90 90 90 0 0 0 30 50 80 80 90 60 30 0 0 0 90 0 90 90 90 0 0 0 30 50 80 80 90 60 30 0 0 0 90 90 90 90 90 0 0 0 20 30 50 50 60 40 20 10 20 30 30 30 30 20 10 0 0 0 0 0 0 0 0 0 0 0 0 90 0 0 0 0 0 0 0 10 10 10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Credit: S. Seitz

Reducing salt-and-pepper noise 3x3 5x5 7x7 • What’s wrong with the results?

Median filter (Non-linear) •What advantage does median filtering have over box filtering? • Robustness to outliers Source: K. Grauman

Median filter (Non-linear) 3x3 5x5 7x7 Gaussian Median Source: M. Hebert

Gaussian vs. median filtering 3x3 5x5 7x7 Gaussian Median

Neural Networks A General Framework from Linear to Non-linear

Image Classification : A core task in Computer Vision (assume given set of discrete labels) {dog, cat, truck, plane, ...} cat This image by Nikita is licensed under CC-BY 2.0 Lecture 2 - 14

The Problem : Semantic Gap What the computer sees An image is just a big grid of numbers between [0, 255]: e.g. 800 x 600 x 3 (3 channels RGB) This image by Nikita is licensed under CC-BY 2.0 Lecture 2 - 15

Challenges : Viewpoint variation All pixels change when the camera moves! This image by Nikita is licensed under CC-BY 2.0 Lecture 2 - 16

17 Challenges : Illumination This image is CC0 1.0 public domain This image is CC0 1.0 public domain This image is CC0 1.0 public domain This image is CC0 1.0 public domain Lecture 2 -

Challenges : Deformation This image by Tom Thai is This image by sare bear is This image by Umberto Salvagnin This image by Umberto Salvagnin licensed under CC-BY 2.0 licensed under CC-BY 2.0 is licensed under CC-BY 2.0 is licensed under CC-BY 2.0 Lecture 2 -

Challenges : Occlusion 19 This image by jonsson is licensed This image is CC0 1.0 public domain This image is CC0 1.0 public domain under CC-BY 2.0 Lecture 2 -

Challenges : Background Clutter 20 This image is CC0 1.0 public domain This image is CC0 1.0 public domain Lecture 2 -

Challenges : Intraclass variation This image is CC0 1.0 public domain Lecture 2 -

Linear Classification Lecture 2 -

Recall CIFAR10 50,000 training images each image is 32x32x3 10,000 test images. Lecture 2 -

Parametric Approach Image 10 numbers giving f( x , W ) class scores Array of 32x32x3 numbers W (3072 numbers total) parameters or weights Lecture 2 -

Parametric Approach: Linear Classifier f(x,W) = Wx Image 10 numbers giving f( x , W ) class scores Array of 32x32x3 numbers W (3072 numbers total) parameters or weights Lecture 2 -

Parametric Approach: Linear Classifier 3072x1 f(x,W) = Wx Image 10x1 10x3072 10 numbers giving f( x , W ) class scores Array of 32x32x3 numbers W (3072 numbers total) parameters or weights Lecture 2 -

Parametric Approach: Linear Classifier 3072x1 f(x,W) = Wx + b 10x1 Image 10x1 10x3072 10 numbers giving f( x , W ) class scores Array of 32x32x3 numbers W (3072 numbers total) parameters or weights Lecture 2 -

Example with an image with 4 pixels, and 3 classes (cat/dog/ship) Stretch pixels into column 56 0.2 -0.5 0.1 2.0 1.1 -96.8 56 231 Cat score 231 1.5 1.3 2.1 0.0 3.2 437.9 + = 24 2 Dog score 24 0 0.25 0.2 -0.3 -1.2 61.95 Ship score 2 Input image W b Lecture 2 -

Example with an image with 4 pixels, and 3 classes (cat/dog/ship) Algebraic Viewpoint f(x,W) = Wx Lecture 2 -

Example with an image with 4 pixels, and 3 classes (cat/dog/ship) Input image Algebraic Viewpoint f(x,W) = Wx 0.2 -0.5 1.5 1.3 0 .25 W 0.1 2.0 2.1 0.0 0.2 -0.3 b 1.1 3.2 -1.2 -96.8 437.9 61.95 Score Lecture 2 -

Interpreting a Linear Classifier Lecture 2 -

Interpreting a Linear Classifier: Geometric Viewpoint f(x,W) = Wx + b Array of 32x32x3 numbers (3072 numbers total) Plot created using Wolfram Cloud Cat image by Nikita is licensed under CC-BY 2.0 Lecture 2 -

Hard cases for a linear classifier Class 1 : Class 1 : Class 1 : First and third quadrants 1 <= L2 norm <= 2 Three modes Class 2 : Class 2 : Class 2 : Everything else Everything else Second and fourth quadrants Lecture 2 -

Linear Classifier: Three Viewpoints Visual Viewpoint Algebraic Viewpoint Geometric Viewpoint One template Hyperplanes f(x,W) = Wx per class cutting up space Lecture 2 -

How the Human Brain learns In the human brain, a typical neuron collects signals from others through a host of fine structures called dendrites . • • The neuron sends out spikes of electrical activity through a long, thin strand known as an axon , which splits into thousands of branches. • At the end of each branch, a structure called a synapse converts the activity from the axon into electrical effects that inhibit or excite activity in the connected neurons.

A Simple Neuron • An artificial neuron is a device with many inputs and one output.

Element of Neural Network 𝑔 : 𝑆 𝐿 → 𝑆 Neuron a z a w a w a w b w = + + + + 1 � 1 1 2 2 K K 1 w a 2 z 2 a ( ) z + σ … w Activation a K K function b weights bias

Neural Network neuron Input Layer 1 Layer 2 Layer L Output y 1 …… x 1 x …… y 2 2 …… …… …… …… …… …… y M x N Input Output Hidden Layers Layer Layer Deep means many hidden layers

Example of Neural Network 0.98 4 1 1 -2 1 0.12 -1 -2 -1 1 0 Sigmoid Function ( ) z σ 1 ( ) z = 1 σ z e − + z

Activation functions Leaky ReLU Sigmoid tanh Maxout ELU ReLU

CSE 152: Computer Vision Hao Su Lecture 7: Neural Networks Review - PowerPoint PPT Presentation

CSE 152: Computer Vision Hao Su Lecture 7: Neural Networks Review of Filters: From Linear to Non-linear Image filtering (Linear case) 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Computer Vision Neurobio 230 Bill Lotter Exciting time: Neuroscience computer vision

Introductions Computer Vision Automatic understanding of images and video Instructor :

A Computer Vision Sampler COMPSCI 527 Today: Introduction to computer vision Course

Computer Vision Introduction Historical context Connections to other disciplines Vision and

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

CS4495/6495 Introduction to Computer Vision 1A-L1 Introduction Outline What is computer

Camera Calibration COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Camera

Computer Vision/Graphics -- Dr. Chandra Kambhamettu for SIGNEWGRAD 11/24/04 Computer Vision :

COMPUTER VISION FOR ROBOT NAVIGATION Sanketh Shetty Computer Vision and Robotics Laboratory

CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques John Magee 1 Computer

Computer Vision II Bjoern Andres Machine Learning for Computer Vision TU Dresden 2020-05-22

Image Pyramids COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Pyramids 1

Stereo Vision I Introduction to Computer Vision CSE 152 Lecture 13 CSE152, Spr 07 Intro

Learning for Computer Vision Ramprasaath Lecture Outline Computer Vision Before

Computer Vision: from Recognition to Geometry Shao-Yi Chien

Tracking Feature Windows COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Training Neural Nets COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Training

Dont Use Computer Vision For Web Security Florian Tramr CV-COPS August 28 th 2020 Computer

Visual Cognition Computer Vision 3D Vision Mobile multimedia 3D TV Model-based

University of Cambridge Engineering Part IIB Module 4F12: Computer Vision Handout 1:

Computer Vision II Bjoern Andres Machine Learning for Computer Vision TU Dresden 2020-05-29

EVC: Image Processing & Computer Vision http://

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

CSE 152: Computer Vision Hao Su Lecture 7: Neural Networks Review - PowerPoint PPT Presentation

CSE 152: Computer Vision Hao Su Lecture 7: Neural Networks Review of Filters: From Linear to Non-linear Image filtering (Linear case) 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Computer Vision Neurobio 230 Bill Lotter Exciting time: Neuroscience computer vision

Introductions Computer Vision Automatic understanding of images and video Instructor :

A Computer Vision Sampler COMPSCI 527 Today: Introduction to computer vision Course

Computer Vision Introduction Historical context Connections to other disciplines Vision and

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

CS4495/6495 Introduction to Computer Vision 1A-L1 Introduction Outline What is computer

Camera Calibration COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Camera

Computer Vision/Graphics -- Dr. Chandra Kambhamettu for SIGNEWGRAD 11/24/04 Computer Vision :

COMPUTER VISION FOR ROBOT NAVIGATION Sanketh Shetty Computer Vision and Robotics Laboratory

CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques John Magee 1 Computer

Computer Vision II Bjoern Andres Machine Learning for Computer Vision TU Dresden 2020-05-22

Image Pyramids COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Pyramids 1

Stereo Vision I Introduction to Computer Vision CSE 152 Lecture 13 CSE152, Spr 07 Intro

Learning for Computer Vision Ramprasaath Lecture Outline Computer Vision Before

Computer Vision: from Recognition to Geometry Shao-Yi Chien

Tracking Feature Windows COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Training Neural Nets COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Training

Dont Use Computer Vision For Web Security Florian Tramr CV-COPS August 28 th 2020 Computer

Visual Cognition Computer Vision 3D Vision Mobile multimedia 3D TV Model-based

University of Cambridge Engineering Part IIB Module 4F12: Computer Vision Handout 1:

Computer Vision II Bjoern Andres Machine Learning for Computer Vision TU Dresden 2020-05-29

EVC: Image Processing &amp; Computer Vision http://

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

EVC: Image Processing & Computer Vision http://