beyond bags of features
play

Beyond bags of Features Spatial Pyramid Matching for Recognizing - PowerPoint PPT Presentation

Beyond bags of Features Spatial Pyramid Matching for Recognizing Natural Scene Categories Camille Schreck, Romain Vavassori Ensimag December 14, 2012 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 1 / 32 State of art


  1. Beyond bags of Features Spatial Pyramid Matching for Recognizing Natural Scene Categories Camille Schreck, Romain Vavassori Ensimag December 14, 2012 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 1 / 32

  2. State of art 1 Method 2 Pyramids Spatial Matching Scheme Features extraction Summary Results 3 Scene Category Recognition Caltech-101 Graz Dataset Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 2 / 32

  3. Introduction Overall objective : semantic categorisation. Use spatial information. Global representation. Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 3 / 32

  4. State of art 1 Method 2 Results 3 Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 4 / 32

  5. State of art Bags of feature Images described as an orderless collection of features. Good performances. Do not use the information about the spatial layout of the features. Multiresolution histograms Subsampling an image and compute a global histogram at each level. Generalized histograms to locally orderless images For each Gaussian aperture at a given location and scale, the locally orderless image returns the histogram of image features aggregated over that aperture. Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 5 / 32

  6. State of art 1 Method 2 Pyramids Spatial Matching Scheme Features extraction Summary Results 3 Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 6 / 32

  7. State of art 1 Method 2 Pyramids Spatial Matching Scheme Features extraction Summary Results 3 Scene Category Recognition Caltech-101 Graz Dataset Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 7 / 32

  8. Pyramid Match Kernel Goal Find the approximate correspondance between 2 set of vectors, X and Y, in a d -dimentional feature space. Idea correpondances are computed at different levels of resolution. at each level, a finer grid is set on the space. 2 vectors are said to match if they are on the same cell. take the weighted sum of all the matches. Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 8 / 32

  9. Subdivisions of the feature space We compute matches at different level of resolution 0,..,L. At the level of resolution l the grid is divided in 2 l along each dimension. the grid has D = 2 dl cells where d is the number of dimensions. Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 9 / 32

  10. Histograms intersection Histograms of X and Y H l X and H l Y are the histograms of X and Y at level l where H l X ( i ) is the number of vectors of X in the ith cell of the grid. Histogram Intersection function Give the number of matches at level l : D I ( H l X , H l � min ( H l X ( i ) , H l Y ) = Y ( i )) i =1 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 10 / 32

  11. Histograms intersection Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 11 / 32

  12. Histograms intersection Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 12 / 32

  13. Histograms intersection Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 13 / 32

  14. Computation of the kernel All matches found at level l + 1 are found also at level l . → Number of new matches at level l is given by I l − I l +1 . We sum all 1 the matches weighted by 2 L − l . The more the grid is coarse, the less the matches are weighted. pyramid match kernel Mercer kernel : L − 1 1 κ l ( X , Y ) = I L + 2 L − l ( I l − I l − 1 ) � l =0 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 14 / 32

  15. State of art 1 Method 2 Pyramids Spatial Matching Scheme Features extraction Summary Results 3 Scene Category Recognition Caltech-101 Graz Dataset Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 15 / 32

  16. “Orthogonal” approach Matching of two collection of features in a high-dimensional appearance space quantize all feature vectors into M discrete types, giving M channels. perform pyramid matching in the 2-dimensional image space for each channel m = 1 .. M . Assumption Only features of the same type m can be matched to one another. Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 16 / 32

  17. Final kernel is the sum of separate channel kernels M K L ( X , Y ) = � κ L ( X m , Y m ) m =1 where X m and Y m are the coordinates of features of type m found in the respective image. K L can be computed as the intersection of the histograms obtain by concatenating the histograms of each channel. Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 17 / 32

  18. Example Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 18 / 32

  19. State of art 1 Method 2 Pyramids Spatial Matching Scheme Features extraction Summary Results 3 Scene Category Recognition Caltech-101 Graz Dataset Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 19 / 32

  20. Two kinds of features for the experiments Weak features Edge points at two scales and eight orientations. → M = 16 channels. Strong features SIFT descriptors of 16 x 16 pixels. k -mean clustering to get a visual vocabulary. In the experiments vocalubary size is M = 200 or M = 400. Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 20 / 32

  21. State of art 1 Method 2 Pyramids Spatial Matching Scheme Features extraction Summary Results 3 Scene Category Recognition Caltech-101 Graz Dataset Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 21 / 32

  22. Summary of the method Clustering features from a training set. Computation of the “description” of a query image. Comparison with the description of each image in test set. Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 22 / 32

  23. State of art 1 Method 2 Results 3 Scene Category Recognition Caltech-101 Graz Dataset Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 23 / 32

  24. State of art 1 Method 2 Pyramids Spatial Matching Scheme Features extraction Summary Results 3 Scene Category Recognition Caltech-101 Graz Dataset Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 24 / 32

  25. Scene Category Recognition Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 25 / 32

  26. Example Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 26 / 32

  27. State of art 1 Method 2 Pyramids Spatial Matching Scheme Features extraction Summary Results 3 Scene Category Recognition Caltech-101 Graz Dataset Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 27 / 32

  28. Caltech-101 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 28 / 32

  29. State of art 1 Method 2 Pyramids Spatial Matching Scheme Features extraction Summary Results 3 Scene Category Recognition Caltech-101 Graz Dataset Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 29 / 32

  30. Graz Dataset Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 30 / 32

  31. State of art 1 Method 2 Results 3 Conclusion 4 Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 31 / 32

  32. Conclusion “holistic” approach for categorisation. Simple method. Gives better results than bag-of-features. Schreck, Vavassori (Ensimag) Beyond bags of Features December 14, 2012 32 / 32

Recommend


More recommend