sift
play

SIFT 16-385 Computer Vision (Kris Kitani) Carnegie Mellon - PowerPoint PPT Presentation

SIFT 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University SIFT (Scale Invariant Feature Transform) SIFT describes both a detector and descriptor 1. Multi-scale extrema detection 2. Keypoint localization 3. Orientation assignment


  1. SIFT 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University

  2. SIFT (Scale Invariant Feature Transform) SIFT describes both a detector and descriptor 1. Multi-scale extrema detection 2. Keypoint localization 3. Orientation assignment 4. Keypoint descriptor

  3. 1. Multi-scale extrema detection Second octave First octave Gaussian Difference of Gaussian (DoG)

  4. Gaussian Laplacian

  5. Scale-space extrema Scale of Gaussian variance Selected if larger than all 26 neighbors Difference of Gaussian (DoG)

  6. 2. Keypoint localization 2nd order Taylor series approximation of DoG scale-space x = { x, y, σ } Take the derivative and solve for extrema Additional tests to retain only strong features

  7. 3. Orientation assignment For a keypoint, L is the Gaussian-smoothed image with the closest scale, x-derivative y-derivative Detection process returns { x, y, σ , θ } location scale orientation

  8. 4. Keypoint descriptor Image Gradients SIFT descriptor (4 x 4 pixel per cell, 4 x 4 cells) (16 cells x 8 directions = 128 dims) Gaussian weighting (sigma = half width)

  9. �������������������� �������������������� Raw pixels Sampled Locally orderless Global histogram

Recommend


More recommend