feature descriptors
play

Feature Descriptors 16-385 Computer Vision (Kris Kitani) Carnegie - PowerPoint PPT Presentation

Feature Descriptors 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University Tiny Images Just downsample it Simple Fast Robust to small affine transformation What are the problems? Multi-Scale Oriented Patches (MOPS) Multi-Image


  1. Feature Descriptors 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University

  2. Tiny Images

  3. Just downsample it

  4. Simple Fast Robust to small affine transformation What are the problems?

  5. Multi-Scale Oriented Patches (MOPS) Multi-Image Matching using Multi-Scale Oriented Patches. M. Brown, R. Szeliski and S. Winder. 
 International Conference on Computer Vision and Pattern Recognition (CVPR2005). pages 510-517

  6. Multi-Scale Oriented Patches (MOPS) Multi-Image Matching using Multi-Scale Oriented Patches. M. Brown, R. Szeliski and S. Winder. 
 International Conference on Computer Vision and Pattern Recognition (CVPR2005). pages 510-517 ( x, y, s, θ ) Given a feature Get 40 x 40 image patch, subsample every 5th pixel ( what’s the purpose of this step? ) Subtract the mean, divide by standard deviation ( what’s the purpose of this step? ) Haar Wavelet Transform ( what’s the purpose of this step? )

  7. Multi-Scale Oriented Patches (MOPS) Multi-Image Matching using Multi-Scale Oriented Patches. M. Brown, R. Szeliski and S. Winder. 
 International Conference on Computer Vision and Pattern Recognition (CVPR2005). pages 510-517 ( x, y, s, θ ) Given a feature Get 40 x 40 image patch, subsample every 5th pixel (low frequency filtering, absorbs localization errors) Subtract the mean, divide by standard deviation ( what’s the purpose of this step? ) Haar Wavelet Transform ( what’s the purpose of this step? )

  8. Multi-Scale Oriented Patches (MOPS) Multi-Image Matching using Multi-Scale Oriented Patches. M. Brown, R. Szeliski and S. Winder. 
 International Conference on Computer Vision and Pattern Recognition (CVPR2005). pages 510-517 ( x, y, s, θ ) Given a feature Get 40 x 40 image patch, subsample every 5th pixel (low frequency filtering, absorbs localization errors) Subtract the mean, divide by standard deviation (removes bias and gain) Haar Wavelet Transform ( what’s the purpose of this step? )

  9. Multi-Scale Oriented Patches (MOPS) Multi-Image Matching using Multi-Scale Oriented Patches. M. Brown, R. Szeliski and S. Winder. 
 International Conference on Computer Vision and Pattern Recognition (CVPR2005). pages 510-517 ( x, y, s, θ ) Given a feature Get 40 x 40 image patch, subsample every 5th pixel (low frequency filtering, absorbs localization errors) Subtract the mean, divide by standard deviation (removes bias and gain) Haar Wavelet Transform (low frequency projection)

  10. Haar Wavelets (actually, Haar-like features) Use responses of a bank of filters as a descriptor

  11. Haar wavelet responses can be computed with filtering image patch Haar wavelets filters -1 -1 +1 +1 d y d x Haar wavelet responses can be computed efficiently (in constant time) with integral images

  12. Integral Image I ( x, y ) A ( x, y ) 1 5 2 1 6 8 original integral 2 4 1 3 12 15 image image 2 1 1 5 15 19 X I ( x 0 , y 0 ) A ( x, y ) = x 0  x,y 0  y

  13. Integral Image I ( x, y ) A ( x, y ) 1 5 2 1 6 8 original integral 2 4 1 3 12 15 image image 2 1 1 5 15 19 X I ( x 0 , y 0 ) A ( x, y ) = x 0  x,y 0  y Can find the sum of any block using 3 operations A ( x 1 , y 1 , x 2 , y 2 ) = A ( x 2 , y 2 ) − A ( x 1 , y 2 ) − A ( x 2 , y 1 ) + A ( x 1 , y 1 )

  14. What is the sum of the bottom right 2x2 square? A ( x 1 , y 1 , x 2 , y 2 ) = A ( x 2 , y 2 ) − A ( x 1 , y 2 ) − A ( x 2 , y 1 ) + A ( x 1 , y 1 ) I ( x, y ) A ( x, y ) 1 5 2 1 6 8 2 4 1 3 12 15 2 1 1 5 15 19 image integral image A (1 , 1 , 3 , 3) = A (3 , 3) − A (1 , 3) − A (3 , 1) + A (1 , 1) = 19 8 5 + 1 − − = 7

  15. Given an image patch, compute filter responses filter bank (20 Haar wavelet filters) vector of filter responses Responses are usually computed at specified location as a face patch descriptor

  16. AdaBoost AdaBoost AdaBoost accept accept Σ Σ Σ FACE! average average average reject reject reject filter response filter response filter response

  17. Given an image patch, compute filter responses filter bank (20 Haar wavelet filters) vector of filter responses Responses are usually computed at specified location as a face patch descriptor When will this feature descriptor fail?

Recommend


More recommend