Lecture 8: Spatial Localization and Detection Fei-Fei Li & - PowerPoint PPT Presentation

Lecture 8: Spatial Localization and Detection Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 1

Administrative - Project Proposals were due on Saturday - Homework 2 due Friday 2/5 - Homework 1 grades out this week - Midterm will be in-class on Wednesday 2/10 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 2

Convolution 32 32 3 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 3

Pooling 1 1 2 4 5 6 7 8 3 2 1 0 1 2 3 4 2x2 max pooling 6 8 3 4 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 4

Case Studies LeNet (1998) AlexNet (2012) ZFNet (2013) Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 5

Case Studies VGG GoogLeNet ResNet (2014) (2014) (2015) Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 6

Localization and Detection Results from Faster R-CNN, Ren et al 2015 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 7

Computer Vision Tasks Classification Instance Object Detection Classification + Localization Segmentation CAT, DOG, DUCK CAT CAT CAT, DOG, DUCK Single object Multiple objects Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 8

Computer Vision Tasks Classification Instance Object Detection Classification + Localization Segmentation Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 9

Classification + Localization: Task Classification : C classes Input: Image CAT Output: Class label Evaluation metric: Accuracy Localization : Input: Image (x, y, w, h) Output : Box in the image (x, y, w, h) Evaluation metric: Intersection over Union Classification + Localization : Do both Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 10

Classification + Localization: ImageNet 1000 classes (same as classification) Each image has 1 class, at least one bounding box ~800 training images per class Algorithm produces 5 (class, box) guesses Example is correct if at least one one guess has correct class AND bounding box at least 0.5 intersection over union (IoU) Krizhevsky et. al. 2012 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 11

Idea #1: Localization as Regression Input : image Neural Net Output : Box coordinates (4 numbers) Loss : L2 distance Correct output : box coordinates Only one object, (4 numbers) simpler than detection Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 12

Simple Recipe for Classification + Localization Step 1 : Train (or download) a classification model (AlexNet, VGG, GoogLeNet) Convolution Fully-connected and Pooling layers Softmax loss Final conv Class scores feature map Image Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 13

Simple Recipe for Classification + Localization Step 2 : Attach new fully-connected “regression head” to the network Fully-connected layers “Classification head” Convolution Class scores and Pooling Fully-connected layers “Regression head” Final conv Box coordinates feature map Image Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 14

Simple Recipe for Classification + Localization Step 3 : Train the regression head only with SGD and L2 loss Fully-connected layers Convolution Class scores and Pooling Fully-connected layers L2 loss Final conv Box coordinates feature map Image Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 15

Simple Recipe for Classification + Localization Step 4 : At test time use both heads Fully-connected layers Convolution Class scores and Pooling Fully-connected layers Final conv Box coordinates feature map Image Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 16

Per-class vs class agnostic regression Assume classification Fully-connected over C classes: layers Classification head : C numbers (one per class) Convolution Class scores and Pooling Class agnostic: 4 numbers Fully-connected layers (one box) Class specific: C x 4 numbers Final conv Box coordinates feature map (one box per class) Image Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 17

Where to attach the regression head? After conv layers : After last FC layer : Overfeat, VGG DeepPose, R-CNN Convolution Fully-connected and Pooling layers Softmax loss Final conv Class scores feature map Image Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 18

Aside: Localizing multiple objects Want to localize exactly K objects in each image Fully-connected layers (e.g. whole cat, cat head, cat left ear, cat right ear for K=4) Convolution Class scores and Pooling Fully-connected layers K x 4 numbers (one box per object) Final conv Box coordinates feature map Image Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 19

Aside: Human Pose Estimation Represent a person by K joints Regress (x, y) for each joint from last fully-connected layer of AlexNet (Details: Normalized coordinates, iterative refinement) Toshev and Szegedy, “DeepPose: Human Pose Estimation via Deep Neural Networks”, CVPR 2014 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 20

Localization as Regression Very simple Think if you can use this for projects Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 21

Idea #2: Sliding Window ● Run classification + regression network at multiple locations on a high- resolution image ● Convert fully-connected layers into convolutional layers for efficient computation ● Combine classifier and regressor predictions across all scales for final prediction Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 22

Sliding Window: Overfeat Class scores: 4096 4096 Winner of ILSVRC 2013 1000 localization challenge FC FC Softmax Convolution loss + pooling FC FC FC FC Feature map: Euclidean 1024 x 5 x 5 Image: loss 3 x 221 x 221 Boxes: 1024 4096 Sermanet et al, “Integrated Recognition, Localization and 1000 x 4 Detection using Convolutional Networks”, ICLR 2014 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 23

Sliding Window: Overfeat Network input: Larger image: 3 x 221 x 221 3 x 257 x 257 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 24

Sliding Window: Overfeat 0.5 Network input: Larger image: Classification scores: 3 x 221 x 221 P(cat) 3 x 257 x 257 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 25

Sliding Window: Overfeat 0.5 0.75 Network input: Larger image: Classification scores: 3 x 221 x 221 P(cat) 3 x 257 x 257 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 26

Sliding Window: Overfeat 0.5 0.75 0.6 Network input: Larger image: Classification scores: 3 x 221 x 221 P(cat) 3 x 257 x 257 Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 27

Lecture 8: Spatial Localization and Detection Fei-Fei Li & - PowerPoint PPT Presentation

Lecture 8: Spatial Localization and Detection Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 1 Administrative - Project Proposals

Category-level localization Cordelia Schmid Category-level localization Localization of

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Resource 1: What is spatial? presentation notes Section Section text Notes 1. Spatial

Broadening the Study of Spatial Intelligence Mary Hegarty University of California, Santa

A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial

Lecture 18: Localization Lecture 18: Localization algorithms algorithms Mythili Vutukuru CS

Localization Nischal K N System Overview Mapping Hector Mapping Localization Path Planning

Category-level localization Cordelia Schmid Category-level localization Localization up to a

Creating a Science of Spatial Learning Nora S. Newcombe Temple University PI, Spatial

Spatial Digitech Keep it s im ple Make it spatial About US Spatial Digitech is a provider of

UCSB is Spatial ! http://www.spatial.ucsb.edu Specialist Meeting on Spatial Thinking across the

STAT 209 Spatial Data I April 30, 2018 Colin Reimer Dawson 1 / 26 Spatial Data Projections

Lecture 19 Spatial GLM + Point Reference Spatial Data Colin Rundel 11/09/2017 1 Spatial GLM

Lecture 19 Spatial GLM + Point Reference Spatial Data Colin Rundel 04/03/2017 1 Spatial GLM

EE E6820: Speech & Audio Processing & Recognition Lecture 8: Spatial sound 1 Spatial

Sensitivity to color variations & Spatial Localization Setareh Rafatirad Behzad Sajadi 1

Localized Structured Prediction Carlo Ciliberto 1 , Francis Bach 2 , 3 , Alessandro Rudi 2 , 3 1

Why Patient Centered Care Matters on the Path to Value GENA COOK CEO, NAVIGATING CANCER October

Home sweet home! Greenbrier River West Virginia Attualit in tema di Pneumoconiosi Siena, Italy

Mind & Brain in the What Is Neuroscience? 21 st Century Multidisciplinary research attempts

Introduction to Mobile Robotics SLAM: Simultaneous Localization and Mapping Wolfram Burgard,

First, parse the title ... Eigenvector localization : Eigenvectors are usually global

Weaving localization issues into a content strategy W3C Multilingual Web Workshop Limerick,

Murhaf Hossari University College Dublin Murhaf.hossari@gmail.com Right-to-Left Languages

Lecture 8: Spatial Localization and Detection Fei-Fei Li & - PowerPoint PPT Presentation

Lecture 8: Spatial Localization and Detection Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 8 - Lecture 8 - 1 Feb 2016 1 Feb 2016 1 Administrative - Project Proposals

Category-level localization Cordelia Schmid Category-level localization Localization of

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Resource 1: What is spatial? presentation notes Section Section text Notes 1. Spatial

Broadening the Study of Spatial Intelligence Mary Hegarty University of California, Santa

A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial

Lecture 18: Localization Lecture 18: Localization algorithms algorithms Mythili Vutukuru CS

Localization Nischal K N System Overview Mapping Hector Mapping Localization Path Planning

Category-level localization Cordelia Schmid Category-level localization Localization up to a

Creating a Science of Spatial Learning Nora S. Newcombe Temple University PI, Spatial

Spatial Digitech Keep it s im ple Make it spatial About US Spatial Digitech is a provider of

UCSB is Spatial ! http://www.spatial.ucsb.edu Specialist Meeting on Spatial Thinking across the

STAT 209 Spatial Data I April 30, 2018 Colin Reimer Dawson 1 / 26 Spatial Data Projections

Lecture 19 Spatial GLM + Point Reference Spatial Data Colin Rundel 11/09/2017 1 Spatial GLM

Lecture 19 Spatial GLM + Point Reference Spatial Data Colin Rundel 04/03/2017 1 Spatial GLM

EE E6820: Speech &amp; Audio Processing &amp; Recognition Lecture 8: Spatial sound 1 Spatial

Sensitivity to color variations &amp; Spatial Localization Setareh Rafatirad Behzad Sajadi 1

Localized Structured Prediction Carlo Ciliberto 1 , Francis Bach 2 , 3 , Alessandro Rudi 2 , 3 1

Why Patient Centered Care Matters on the Path to Value GENA COOK CEO, NAVIGATING CANCER October

Home sweet home! Greenbrier River West Virginia Attualit in tema di Pneumoconiosi Siena, Italy

Mind &amp; Brain in the What Is Neuroscience? 21 st Century Multidisciplinary research attempts

Introduction to Mobile Robotics SLAM: Simultaneous Localization and Mapping Wolfram Burgard,

First, parse the title ... Eigenvector localization : Eigenvectors are usually global

Weaving localization issues into a content strategy W3C Multilingual Web Workshop Limerick,

Murhaf Hossari University College Dublin Murhaf.hossari@gmail.com Right-to-Left Languages

EE E6820: Speech & Audio Processing & Recognition Lecture 8: Spatial sound 1 Spatial

Sensitivity to color variations & Spatial Localization Setareh Rafatirad Behzad Sajadi 1

Mind & Brain in the What Is Neuroscience? 21 st Century Multidisciplinary research attempts