CS 4495 Computer Vision Features 2 SIFT descriptor Aaron Bobick - PowerPoint PPT Presentation

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors CS 4495 Computer Vision Features 2 – SIFT descriptor Aaron Bobick School of Interactive Computing

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Administrivia • PS 3: Out – due Oct 6 th . • Features recap: • Goal is to find corresponding locations in two images. • Last time: find locations that can be accurately located and likely to be found in both images even if photometric or slight geometric changes. • This time (and next?) – find possible (likely?) correspondences between points • Later: which of guessed, plausible correspondences are correct • Today’s part on matching done really well in Szeliski section 4.1

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Matching with Features • Detect feature points in both images

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors An introductory example: Harris corner detector C.Harris, M.Stephens. “A Combined Corner and Edge Detector”. 1988

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Harris Detector: Mathematics ∑ ∑   I I I I = =  x x x y T  M A A ∑ ∑   I I I I   x y y y Measure of corner response: ( ) 2 = − R det M k trace M = λ λ det M 1 2 = λ + λ trace M 1 2 ( k – empirical constant, k = 0.04-0.06)

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Harris Detector: Mathematics “Edge” “Corner” • R depends only on R < 0 eigenvalues of M • R is large for a corner R > 0 • R is negative with large magnitude for an edge • | R | is small for a flat region “Flat” “Edge” |R| small R < 0 λ 1

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Scale Invariant Detection • Consider regions (e.g. circles) of different sizes around a point • Regions of corresponding sizes will look the same in both images

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Scale Invariant Detection • The problem: how do we choose corresponding circles independently in each image?

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Scale Invariant Detection • Common approach: Take a local maximum of this function Observation : region size, for which the maximum is achieved, should be invariant to image scale. Important: this scale invariant region size is found in each image independently! Image 1 f f Image 2 scale = 1/2 s 1 s 2 region size region size

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Scale sensitive response The top row shows two images taken with different focal lengths. The bottom row shows the response over scales of the normalized LoG . The ratio of scales corresponds to the scale factor (2.5) between the two images.

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Key point localization • General idea: find robust extremum (maximum or minimum) both in space and in scale. Resample Blur Subtract Each point is compared to its 8 neighbors in the current image and 9 neighbors each in the scales above and below.

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Key point localization • General idea: find robust extremum (maximum or minimum) both in space and in scale. • SIFT specific suggestion: use DoG pyramid to find maximum Resample Blur Subtract values (remember edge detection?) – then eliminate “edges” and pick only corners. Each point is compared to its 8 neighbors in the current image and 9 neighbors each in the scales above and below.

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Scale Invariant Detection f = ∗ Kernel Image • Functions for determining scale Kernels: ( ) = σ σ + σ 2 L G ( , , x y ) G ( , , x y ) xx yy (Laplacian) = σ − σ DoG G x y k ( , , ) G x y ( , , ) (Difference of Gaussians) where Gaussian 2 + 2 x y − Note: both kernels are invariant to σ = σ 2 G x y ( , , ) 1 e 2 scale and rotation πσ 2

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Scale space processed one octave at a time

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Extrema at different scales

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Key point localization • General idea: find robust extremum (maximum or minimum) both in space and in scale. • SIFT specific suggestion: use DoG pyramid to find maximum values (remember edge Resample Blur Subtract detection?) – then eliminate “edges” and pick only corners. • More recent: use Harris detector to find maximums in space and then look at the Laplacian for maximum in scale. Each point is compared to its 8 neighbors in the current image and 9 neighbors each in the scales above and below.

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Scale Invariant Detectors ← Laplacian → scale • Harris-Laplacian 1 Find local maximum of: • Harris corner detector in y space (image coordinates) • Laplacian in scale ← Harris → x • Method(s) • Find strong Harris corners at different scales • Keep those that are at maxima in the LoG (DoG) 1 K.Mikolajczyk, C.Schmid. “Indexing Based on Scale Invariant Interest Points”. ICCV 2001

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Scale Invariant Detectors ← Laplacian → scale • Harris-Laplacian 1 Find local maximum of: • Harris corner detector in space (image y coordinates) • Laplacian in scale ← Harris → x scale • SIFT (Lowe) 2 ← DoG → Find local maximum of: – Difference of Gaussians in y space and scale ← DoG → x 1 K.Mikolajczyk, C.Schmid. “Indexing Based on Scale Invariant Interest Points”. ICCV 2001 2 D.Lowe. “Distinctive Image Features from Scale-Invariant Keypoints”. IJCV 2004

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Scale Invariant Detectors • Experimental evaluation of detectors w.r.t. scale change Repeatability rate: # correspondences # possible correspondences K.Mikolajczyk, C.Schmid. “Indexing Based on Scale Invariant Interest Points”. ICCV 2001

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Some references…

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Point Descriptors • We know how to detect points • Next question: How to match them? ? Point descriptor should be: 1. Invariant 2. Distinctive

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Harris detector Interest points extracted with Harris (~ 500 points)

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Simple solution? • Harris gives good detection – and we also know the scale. • Why not just use correlation to check the match of the window around the feature in image1 with every feature in image 2? • Main reasons: Correlation is not rotation invariant - why do we want this? 1. Correlation is sensitive to photometric changes. 2. Normalized correlation is sensitive to non-linear photometric 3. changes and even slight geometric ones. Could be slow. 4.

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors SIFT: Motivation • The Harris operator is not invariant to scale and correlation is not invariant to rotation. • For better image matching, Lowe’s goal was to develop an interest operator – a detecto r – that is invariant to scale and rotation. • Also, Lowe aimed to create a descriptor that was robust to the variations corresponding to typical viewing conditions. The descriptor is the most-used part of SIFT.

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Idea of SIFT • Image content is transformed into local feature coordinates that are invariant to translation, rotation, scale, and other imaging parameters SIFT Features

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Another version of the problem… … in here Want to find

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Overall Procedure at a High Level • Scale-space extrema detection • Search over multiple scales and image locations Use Harris- • Keypoint localization Laplace or other method • Define a model to determine location and scale. Select keypoints based on a measure of stability. • Orientation assignment • Compute best orientation(s) for each keypoint region. • Keypoint description • Use local image gradients at selected scale and rotation • to describe each keypoint region.

Features 2: SIFT and CS 4495 Computer Vision – A. Bobick other descriptors Example of keypoint detection (a) 233x189 image (b) 832 DOG extrema 28

CS 4495 Computer Vision Features 2 SIFT descriptor Aaron Bobick - PowerPoint PPT Presentation

Features 2: SIFT and CS 4495 Computer Vision A. Bobick other descriptors CS 4495 Computer Vision Features 2 SIFT descriptor Aaron Bobick School of Interactive Computing Features 2: SIFT and CS 4495 Computer Vision A. Bobick

CS 4495 Computer Vision 3D Perception Kelsey Hawkins Robotics 3D Perception CS 4495 Computer

CS 4495 Computer Vision Features 1 Harris and other corners Aaron Bobick School of

CS 4495 Computer Vision Features 1 Harris and other corners Aaron Bobick School of

SIFT 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University SIFT (Scale Invariant

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

CS 4495 Computer Vision Binary images and Morphology Aaron Bobick School of Interactive

CS 4495 Computer Vision Camera Model Aaron Bobick School of Interactive Computing Camera Model

CS 4495 Computer Vision Linear Filtering 2: Templates, Edges Aaron Bobick School of Interactive

CS 4495 Computer Vision Motion Models Aaron Bobick School of Interactive Computing Motion

CS 4495 Computer Vision Classification 2 Aaron Bobick School of Interactive Computing

CS 4495 Computer Vision Hidden Markov Models Aaron Bobick School of Interactive Computing

CS 4495 Computer Vision Frequency2: Sampling and Aliasing Aaron Bobick School of Interactive

CS 4495 Computer Vision Frequency and Fourier Transforms Aaron Bobick School of Interactive

CS 4495 Computer Vision Tracking 1- Kalman,Gaussian Aaron Bobick School of Interactive

CS 4495 Computer Vision Camera Model Aaron Bobick School of Interactive Computing Camera Model

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

In Sandra Duncan, EdD & Jody Martin The Best Job of All The Best Job of All 1 10/9/2018

Integrated Assessment Modeling of Coupled Natural and Human Systems in LCB Asim Zia

Role of AIR in supporting locust monitoring and assessment using aerospace technologies Wenjiang

CS4495/6495 Introduction to Computer Vision 4B-L1 SIFT descriptor Point Descriptors Last

SIFT Features and Hough Transform Various slides from previous courses by: D.A. Forsyth (Berkeley

Local Feature Descriptors SIFT Various slides from previous courses by: D.A. Forsyth (Berkeley /

SIFT keypoint detection D. Lowe, Distinctive image features from scale-invariant keypoints, IJCV

O b j e c t R e c o g n i t i o n S I F T v s C o n v o l u t i o

CS 4495 Computer Vision Features 2 SIFT descriptor Aaron Bobick - PowerPoint PPT Presentation

Features 2: SIFT and CS 4495 Computer Vision A. Bobick other descriptors CS 4495 Computer Vision Features 2 SIFT descriptor Aaron Bobick School of Interactive Computing Features 2: SIFT and CS 4495 Computer Vision A. Bobick

CS 4495 Computer Vision 3D Perception Kelsey Hawkins Robotics 3D Perception CS 4495 Computer

CS 4495 Computer Vision Features 1 Harris and other corners Aaron Bobick School of

CS 4495 Computer Vision Features 1 Harris and other corners Aaron Bobick School of

SIFT 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University SIFT (Scale Invariant

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

CS 4495 Computer Vision Binary images and Morphology Aaron Bobick School of Interactive

CS 4495 Computer Vision Camera Model Aaron Bobick School of Interactive Computing Camera Model

CS 4495 Computer Vision Linear Filtering 2: Templates, Edges Aaron Bobick School of Interactive

CS 4495 Computer Vision Motion Models Aaron Bobick School of Interactive Computing Motion

CS 4495 Computer Vision Classification 2 Aaron Bobick School of Interactive Computing

CS 4495 Computer Vision Hidden Markov Models Aaron Bobick School of Interactive Computing

CS 4495 Computer Vision Frequency2: Sampling and Aliasing Aaron Bobick School of Interactive

CS 4495 Computer Vision Frequency and Fourier Transforms Aaron Bobick School of Interactive

CS 4495 Computer Vision Tracking 1- Kalman,Gaussian Aaron Bobick School of Interactive

CS 4495 Computer Vision Camera Model Aaron Bobick School of Interactive Computing Camera Model

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

In Sandra Duncan, EdD &amp; Jody Martin The Best Job of All The Best Job of All 1 10/9/2018

Integrated Assessment Modeling of Coupled Natural and Human Systems in LCB Asim Zia

Role of AIR in supporting locust monitoring and assessment using aerospace technologies Wenjiang

CS4495/6495 Introduction to Computer Vision 4B-L1 SIFT descriptor Point Descriptors Last

SIFT Features and Hough Transform Various slides from previous courses by: D.A. Forsyth (Berkeley

Local Feature Descriptors SIFT Various slides from previous courses by: D.A. Forsyth (Berkeley /

SIFT keypoint detection D. Lowe, Distinctive image features from scale-invariant keypoints, IJCV

O b j e c t R e c o g n i t i o n S I F T v s C o n v o l u t i o

In Sandra Duncan, EdD & Jody Martin The Best Job of All The Best Job of All 1 10/9/2018