scene recognition scene recognition
play

Scene Recognition Scene Recognition Adriana Kovashka Adriana - PowerPoint PPT Presentation

Scene Recognition Scene Recognition Adriana Kovashka Adriana Kovashka UTCS, PhD student UTCS, PhD student Problem Problem Problem Problem Statement Statement Distinguish between different types of scenes Applications


  1. Scene Recognition Scene Recognition Adriana Kovashka Adriana Kovashka UTCS, PhD student UTCS, PhD student

  2. Problem Problem Problem Problem • Statement Statement – Distinguish between different types of scenes • Applications Applications – Matching human perception – Understanding the environment • Indexing of images / video • Robotics R b ti – Graphics • In painting • In-painting

  3. Background Background Background Background • Definition of a scene Definition of a scene – “[A] scene is mainly characterized as a place in which we can move“ [Oliva 2001] in which we can move [Oliva 2001] • Assumptions – Human categorization Human categorization • Approaches – Parsing of the scene – as a whole, or in parts

  4. Coast [Oliva 2001] Mountain [Oliva 2001]

  5. Inside City [Oliva 2001] Street [Oliva 2001]

  6. Kitchen [Lazebnik 2006] Industrial [Lazebnik 2006]

  7. Scene? Scene? Scene? Scene?

  8. Scene? Scene? Scene? Scene?

  9. Urban or natural? Urban or natural? Urban or natural? Urban or natural?

  10. Urban or natural? Urban or natural? Urban or natural? Urban or natural?

  11. Spatial Envelope [Oliva 2001] Spatial Envelope [Oliva 2001] Spatial Envelope [Oliva 2001] Spatial Envelope [Oliva 2001] • Inspiration from human perception Inspiration from human perception – Naturalness, openness, roughness – Expansion, ruggedness Expansion ruggedness • Holistic, no recognition of objects • Three levels – “cars and people” vs. “street” vs. “urban environment”

  12. Spatial Envelope (cont’d) Spatial Envelope (cont’d) Spatial Envelope (cont d) Spatial Envelope (cont d) • Scene modeling Scene modeling – Discrete Fourier Transform [Oliva 2001] – Windowed Fourier Transform [Oliva 2001]

  13. Spatial Envelope (cont’d) Spatial Envelope (cont’d) Spatial Envelope (cont d) Spatial Envelope (cont d) – Principal Components Analysis Principal Components Analysis [Oliva 2001]

  14. Spatial Envelope (cont’d) Spatial Envelope (cont’d) Spatial Envelope (cont d) Spatial Envelope (cont d) [Oliva 2001]

  15. Spatial Envelope (cont’d) Spatial Envelope (cont’d) Spatial Envelope (cont d) Spatial Envelope (cont d) • Properties of the spatial envelope Properties of the spatial envelope – Discriminant spectral template (DST) • Relates spectral components to properties of the • Relates spectral components to properties of the spatial envelope • Parameter d learned through matching of feature vectors and property values – Windowed discriminant spectral template (WDST) (WDST)

  16. Spatial Envelope (cont’d) Spatial Envelope (cont’d) Spatial Envelope (cont d) Spatial Envelope (cont d) [Oliva 2001]

  17. Spatial Envelope (cont’d) Spatial Envelope (cont’d) Spatial Envelope (cont d) Spatial Envelope (cont d) [Oliva 2001]

  18. Spatial Envelope (cont’d) Spatial Envelope (cont’d) Spatial Envelope (cont d) Spatial Envelope (cont d) • Results Results – Scene properties [Oliva 2001]

  19. Spatial Spatial Envelope Envelope (cont’d) (cont’d) [Oliva 2001]

  20. Spatial Spatial Envelope Envelope (cont’d) (cont’d) [Oliva 2001]

  21. Spatial Envelope (cont’d) Spatial Envelope (cont’d) Spatial Envelope (cont d) Spatial Envelope (cont d) – Classification Classification • K-nn • 4 out of 7 neighbors g picked by humans [Oliva 2001]

  22. Spatial Envelope (cont’d) Spatial Envelope (cont’d) Spatial Envelope (cont d) Spatial Envelope (cont d) • Strengths Strengths – Higher-level descriptions – Low dimensionality Low dimensionality – Invariance to object composition – Weak local information Weak local information • Weaknesses – Significant number of human labels

  23. [Oliva 2001]

  24. Spatial Pyramid [Lazebnik 2006] Spatial Pyramid [Lazebnik 2006] Spatial Pyramid [Lazebnik 2006] Spatial Pyramid [Lazebnik 2006] • Global locally orderless Global, locally orderless • Bag-of-features • Extension of Pyramid Match Kernel in 2-d E t i f P id M t h K l i 2 d • Regular clustering of features

  25. Spatial Pyramid (cont’d) Spatial Pyramid (cont’d) Spatial Pyramid (cont d) Spatial Pyramid (cont d) [Grauman 2005] as quoted in [Lazebnik 2006] [Lazebnik 2006] [Lazebnik 2006] [Lazebnik 2006]

  26. Spatial Pyramid (cont’d) Spatial Pyramid (cont’d) Spatial Pyramid (cont d) Spatial Pyramid (cont d) [Lazebnik 2006]

  27. Spatial Pyramid (cont’d) Spatial Pyramid (cont’d) Spatial Pyramid (cont d) Spatial Pyramid (cont d) [Lazebnik 2006]

  28. Spatial Pyramid (cont’d) Spatial Pyramid (cont’d) Spatial Pyramid (cont d) Spatial Pyramid (cont d) • Results Results – SVM classification – Scene recognition Scene recognition [Lazebnik 2006] 65.2% for [Fei-Fei 2005]

  29. Spatial Pyramid (cont’d) Spatial Pyramid (cont’d) [Lazebnik 2006]

  30. Spatial Pyramid (cont’d) Spatial Pyramid (cont’d) Spatial Pyramid (cont d) Spatial Pyramid (cont d) – Object recognition Object recognition [Lazebnik 2006]

  31. Spatial Pyramid (cont’d) Spatial Pyramid (cont’d) Spatial Pyramid (cont d) Spatial Pyramid (cont d) • Strengths Strengths – Reasonable dimensionality – Locally orderless “Locally orderless” – Dense representation – “Robust to failures at individual levels” “Robust to failures at individual levels” • Weaknesses – No invariability to composition of image – Not robust to clutter

  32. Scene Completion Scene Completion http://graphics.cs.cmu.edu/ [Hays 2007] projects/scene-completion/ … Input image Scene Descriptor Image Collection … Context matching 200 matches 20 completions + blending Hays and Efros, SIGGRAPH 2007

  33. [Oliva 2001]

  34. Topic Models [Fei Topic Models [Fei-Fei 2005] Topic Models [Fei Topic Models [Fei Fei 2005] Fei 2005] Fei 2005] • Bayesian hierarchical model Bayesian hierarchical model • Intermediate representations • Bag-of-features B f f t – 4 ways to extract regions – 2 types of features

  35. Topic Models (cont’d) Topic Models (cont’d) Topic Models (cont d) Topic Models (cont d) [Fei-Fei 2005]

  36. Hierarchical Bayesian text models Hierarchical Bayesian text models [Fei-Fei 2005] “beach” Latent Dirichlet Allocation (LDA) π z c w N N D Fei-Fei et al. ICCV 2005

  37. Topic Models (cont’d) Topic Models (cont’d) Topic Models (cont d) Topic Models (cont d) • η – distribution of class labels l l b l • θ – parameter (estimated by EM) • c – class label • π – distribution of themes for image • z – theme • x – patch x patch • β – parameter (estimated by EM) [Fei-Fei 2005]

  38. Topic Models (cont’d) Topic Models (cont’d) Topic Models (cont d) Topic Models (cont d) • Codebook Codebook – 174 codewords [Fei-Fei 2005] [Fei-Fei 2005]

  39. Topic Models (cont’d) Topic Models (cont’d) [Fei-Fei 2005]

  40. Topic Topic Topic Topic Models Models (cont’d) (cont’d) (cont d) (cont d) • Results [Fei-Fei 2005]

  41. Topic Models (cont’d) Topic Models (cont’d) Topic Models (cont d) Topic Models (cont d) • Strengths Strengths – Unsupervised – Invariant to composition Invariant to composition • Weaknesses – No geometry – Matches of themes to categories – No correspondence to semantic categories

  42. Comparison Comparison Comparison Comparison • Global vs local Global vs. local – Spatial Envelope, Spatial Pyramid – Topic Models Topic Models • Viewpoint / location biases vs. invariability – Spatial Pyramid S – Topic Models, Spatial Envelope

  43. Comparison (cont’d) Comparison (cont’d) Comparison (cont d) Comparison (cont d) • Intermediate representations Intermediate representations – Spatial Envelope, Topic Models • Supervision vs. no supervision Supervision vs no supervision – Spatial Envelope – Topic Models, Spatial Pyramid S • Object recognition?

  44. Discussion Discussion Discussion Discussion • Object recognition vs scene recognition Object recognition vs. scene recognition – Global approaches • Spatial Pyramid scenes vs objects results • Spatial Pyramid, scenes vs. objects results – Bag-of-features • Use of scene recognition • Use of scene recognition • Ambiguous scenes • Human recognition of scenes – Importance

  45. References References References References [Fei-Fei 2005] L. Fei-Fei and P. Perona. A Bayesian Hi Hierarchical Model for Learning Natural Scene hi l M d l f L i N t l S Categories. CVPR 2005. [Grauman 2005] K. Grauman and T. Darrell. The Pyramid Match Kernel: Discriminative Classification with Sets of M t h K l Di i i ti Cl ifi ti ith S t f Image Features. ICCV 2005. [Hays 2007] J. Hays and A.A. Efros. Scene completion using millions of photographs. SIGGRAPH 2007. using millions of photographs SIGGRAPH 2007 [Lazebnik 2006] S. Lazebnik, C. Schmid, and J. Ponce. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories CVPR 2006 Recognizing Natural Scene Categories. CVPR 2006 . [Oliva 2001] A. Oliva and A. Torralba. Modeling the Shape of the Scene: a Holistic Representation of the Spatial Envelope IJCV 2001 Envelope. IJCV 2001.

Recommend


More recommend