cs325 artificial intelligence ch 24 computer vision i
play

CS325 Artificial Intelligence Ch. 24, Computer Vision I Object - PowerPoint PPT Presentation

CS325 Artificial Intelligence Ch. 24, Computer Vision I Object Recognition Cengiz Gnay, Emory Univ. Spring 2013 Gnay Ch. 24, Computer Vision I Object Recognition Spring 2013 1 / 27 Computer Vision Done with games, except


  1. CS325 Artificial Intelligence Ch. 24, Computer Vision I – Object Recognition Cengiz Günay, Emory Univ. Spring 2013 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 1 / 27

  2. Computer Vision Done with games, except homework :) Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 2 / 27

  3. Computer Vision Done with games, except homework :) Vision is one of our main perceptions Computer vision is what robots use to understand their surrounding Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 2 / 27

  4. Computer Vision Done with games, except homework :) Vision is one of our main perceptions Computer vision is what robots use to understand their surrounding 3 lectures: 1 Object recognition (today) 2 3D reconstruction 3 Motion analysis Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 2 / 27

  5. Entry/Exit Surveys Exit survey: Advanced Planning Why isn’t classical planning schema adequate for resource planning? What is the advantage gained in abstract plans by having surely-reachable versus potentially-reachable states? Entry survey: Computer Vision I – Image Processing (0.25 points) List three specific tasks where computer vision would be desirable. What do you think are the major hurdles in computer vision? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 3 / 27

  6. How Machines See: Cameras A charge-coupled device (CCD) photo sensor array: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 4 / 27

  7. Focal Optics for Determining Distance and Size See the videos, I’ll summarize: X Z = x f Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 5 / 27

  8. Focal Optics for Determining Distance and Size See the videos, I’ll summarize: X Z = x f What can we can figure out from this? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 5 / 27

  9. Focal Optics for Determining Distance and Size See the videos, I’ll summarize: X Z = x f What can we can figure out from this? Object’s distance ( Z ) & height ( X ) based on projection height ( x ) and focal distance ( f ) Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 5 / 27

  10. We All See a Perspective Projection Vanishing points from parallel lines: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

  11. We All See a Perspective Projection Vanishing points from parallel lines: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

  12. We All See a Perspective Projection Vanishing points from parallel lines: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

  13. We All See a Perspective Projection Vanishing points from parallel lines: Giant panda, or just close? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

  14. Object Recognition: How Hard Can It Be? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 7 / 27

  15. Object Recognition: How Hard Can It Be? Problems? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 7 / 27

  16. Object Recognition: How Hard Can It Be? Problems? Rotation, scale, illumination, occlusion, viewpoint, deformation Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 7 / 27

  17. Not Hard for Us Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  18. Not Hard for Us Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  19. Not Hard for Us Diffuse reflection, bright Specularities Diffuse reflection, dark Cast shadow Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  20. Not Hard for Us Diffuse reflection, bright Specularities Diffuse reflection, dark Cast shadow How does our brain do it? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  21. Not Hard for Us Diffuse reflection, bright Specularities Diffuse reflection, dark Cast shadow How does our brain do it? Will have examples later. Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  22. Invariance is Crucial for Computer Vision Must recognize objects invariant of their: Rotation, scale, illumination, occlusion, viewpoint, deformation Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 9 / 27

  23. Invariance is Crucial for Computer Vision Must recognize objects invariant of their: Rotation, scale, illumination, occlusion, viewpoint, deformation Let’s start by simplifying: 1 Greyscale (monochrome) images 2 Pixels can have values: 0. . . 255 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 9 / 27

  24. Even Terminator Has Monochrome Vision Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 10 / 27

  25. Extracting Features: Edge Detection Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  26. Extracting Features: Edge Detection Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  27. Extracting Features: Edge Detection How to detect the vertical edge? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  28. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  29. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? 2 Filter with mask: + 1 − 1 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  30. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? 2 Filter with mask: + 1 − 1 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  31. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27 2 Filter with mask: + 1 − 1

  32. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27 2 Filter with mask: + 1 − 1

  33. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  34. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  35. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  36. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  37. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  38. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? The mask, g . Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  39. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? The mask, g . What’s the advantage? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  40. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? The mask, g . What’s the advantage? Works in parallel! Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  41. Neurons Can Do It Faster? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 13 / 27

  42. Detect Only Vertical Edges? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 14 / 27

  43. Detect Only Vertical Edges? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 14 / 27

  44. Horizontal and Vertical Gradients Original: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 15 / 27

Recommend


More recommend