cs325 artificial intelligence computer vision ii 3d
play

CS325 Artificial Intelligence Computer Vision II 3D Vision (Ch. 24) - PowerPoint PPT Presentation

CS325 Artificial Intelligence Computer Vision II 3D Vision (Ch. 24) Dr. Cengiz Gnay, Emory Univ. Spring 2013 Gnay () Computer Vision II 3D Vision (Ch. 24) Spring 2013 1 / 22 Limits of 2D Projection 3D world is projected onto 2D


  1. CS325 Artificial Intelligence Computer Vision II – 3D Vision (Ch. 24) Dr. Cengiz Günay, Emory Univ. Spring 2013 Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 1 / 22

  2. Limits of 2D Projection 3D world is projected onto 2D image What happens to depth information? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 2 / 22

  3. Getting the Depth from Perspective Projection Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 3 / 22

  4. Getting the Depth from Perspective Projection Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 3 / 22

  5. Getting the Depth from Perspective Projection Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 3 / 22

  6. Getting the Depth from Perspective Projection Giant panda, or just close? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 3 / 22

  7. Getting the Depth from Perspective Projection Giant panda, or just close? Can only tell if we know exactly the size. Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 3 / 22

  8. Alternative? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 4 / 22

  9. Alternative? Use your two eyes: Stereo Vision Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 4 / 22

  10. Entry/Exit Surveys Exit survey: Computer Vision I – Object Recognition List some problematic states of objects for which an object recognition algorithm must be invariant for. What kind of a filter mask would you convolve with an image to detect diagonal lines? Entry survey: Computer Vision II – 3D Vision (0.25 points) What tasks would you find difficult if you had only one eye open? How do you think stereograms are made? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 5 / 22

  11. Stereo Vision P : Target object. Z : Distance to object. B : Baseline; separation between eyes. x 1 , x 2 : Disparity or parallax; different offsets at each eye. Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 8 / 22

  12. Stereo Vision P : Target object. Z : Distance to object. B : Baseline; separation between eyes. x 1 , x 2 : Disparity or parallax; different offsets at each eye. Can we always find depth of P ? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 8 / 22

  13. Stereo Vision P : Target object. Z : Distance to object. B : Baseline; separation between eyes. x 1 , x 2 : Disparity or parallax; different offsets at each eye. Can we always find depth of P ? No, only sometimes. Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 8 / 22

  14. Stereo Vision: Which One is Easier? L R Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 9 / 22

  15. Stereo Vision: Which One is Easier? L R Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 9 / 22

  16. Stereo Vision: How to Find Depth? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 10 / 22

  17. Stereo Vision: How to Find Depth? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 10 / 22

  18. Stereo Vision: How to Find Depth? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 10 / 22

  19. Stereo Vision: How to Find Depth? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 10 / 22

  20. Stereo Vision: How to Find Depth? What’s different here? What don’t we need to find depth? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 10 / 22

  21. Stereo Vision: How to Find Depth? What’s different here? What don’t we need to find depth? Original size . Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 10 / 22

  22. Finding Correspondence Between Left and Right Images P ? Right Left Where do we search on the right image? 1 2D: everywhere 2 1D: on a line 3 0D: we know the point Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 11 / 22

  23. Finding Correspondence Between Left and Right Images P ? Right Left Where do we search on the right image? 1 2D: everywhere 2 1D: on a line 3 0D: we know the point Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 11 / 22

  24. Finding Correspondence Between Left and Right Images P ? Right Left Where do we search on the right image? 1 2D: everywhere 2 1D: on a line 3 0D: we know the point Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 11 / 22

  25. Correspondence Problems Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 12 / 22

  26. Correspondence Problems Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 12 / 22

  27. Correspondence Problems Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 12 / 22

  28. Correspondence Problems Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 12 / 22

  29. Correspondence Problems You get Phantom Points if you get the correspondence wrong. Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 12 / 22

  30. A Real Correspondence Example Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 13 / 22

  31. A Real Correspondence Example Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 13 / 22

  32. A Real Correspondence Example Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 13 / 22

  33. A Real Correspondence Example Can we find correspondence with any of: 1 Texture match? 2 Feature match? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 13 / 22

  34. A Real Correspondence Example Can we find correspondence with any of: 1 Texture match? 2 Feature match? Both, actually. Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 13 / 22

  35. Texture Match with SSD SSD is not solid state drive , but it is sum of squared distance Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 14 / 22

  36. Sum of Squared Distance (SSD) Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 15 / 22

  37. Sum of Squared Distance (SSD) Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 15 / 22

  38. The Result: Disparity Maps Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 16 / 22

  39. How About Occlusions? Left Right Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 17 / 22

  40. How About Occlusions? Left Right Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 17 / 22

  41. How About Occlusions? Left Right Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 17 / 22

  42. Using Cost to Optimize Correspondence Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 18 / 22

  43. Using Cost to Optimize Correspondence Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 18 / 22

  44. Using Cost to Optimize Correspondence Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 18 / 22

  45. Using Cost to Optimize Correspondence Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 18 / 22

  46. Using Cost to Optimize Correspondence Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 18 / 22

  47. So How to Compute Best Alignment? Use dynamic programming : calculate correspondence matrix with O ( n 2 ) : Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 19 / 22

  48. So How to Compute Best Alignment? Use dynamic programming : calculate correspondence matrix with O ( n 2 ) : Does this look familiar? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 19 / 22

  49. So How to Compute Best Alignment? Use dynamic programming : calculate correspondence matrix with O ( n 2 ) : Does this look familiar? Can we use MDP? Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 19 / 22

  50. So How to Compute Best Alignment? Use dynamic programming : calculate correspondence matrix with O ( n 2 ) : Does this look familiar? Can we use MDP? V ( i , j ) =  match ( i , j ) + V ( i − 1 , j − 1 )   max occl ( i , j ) + V ( i − 1 , j )  occl ( i , j ) + V ( i , j − 1 )  Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 19 / 22

  51. So How to Compute Best Alignment? Use dynamic programming : calculate correspondence matrix with O ( n 2 ) : Does this look familiar? Can we use MDP? V ( i , j ) =  match ( i , j ) + V ( i − 1 , j − 1 )   max occl ( i , j ) + V ( i − 1 , j )  occl ( i , j ) + V ( i , j − 1 )  Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 19 / 22

  52. So How to Compute Best Alignment? Use dynamic programming : calculate correspondence matrix with O ( n 2 ) : Does this look familiar? Can we use MDP? V ( i , j ) =  match ( i , j ) + V ( i − 1 , j − 1 )   max occl ( i , j ) + V ( i − 1 , j )  occl ( i , j ) + V ( i , j − 1 )  State-of-the-art in computer vision! Günay () Computer Vision II – 3D Vision (Ch. 24) Spring 2013 19 / 22

Recommend


More recommend