Stereo Vision Reading: Chapter 11 Stereo matching computes depth - PowerPoint PPT Presentation

Stereo Vision Reading: Chapter 11 • Stereo matching computes depth from two or more images • Subproblems: – Calibrating camera positions. – Finding all corresponding points (hardest part) – Computing depth or surfaces. Slide credits for this chapter: David Jacobs, Frank Dellaert, Octavia Camps, Steve Seitz

Stereo vision Triangulate on two images of the same point to recover depth. – Feature matching across views depth – Calibrated cameras baseline Left Right ��

The epipolar constraint epipolar line epipolar line epipolar line epipolar line epipolar plane • Epipolar Constraint – Matching points lie along corresponding epipolar lines – Reduces correspondence problem to 1D search along conjugate epipolar lines – Greatly reduces cost and ambiguity of matching Slide credit: Steve Seitz

Simplest Case: Rectified Images • Image planes of cameras are parallel. • Focal points are at same height. • Focal lengths same. • Then, epipolar lines fall along the horizontal scan lines of the images • We will assume images have been rectified so that epipolar lines correspond to scan lines – Simplifies algorithms – Improves efficiency

We can always achieve this geometry with image rectification • Image Reprojection – reproject image planes onto common plane parallel to line between optical centers • Notice, only focal point of camera really matters (Seitz)

Basic Stereo Derivations B e n z O L i (u L ,v L ) P L = (X,Y,Z) l e s a b x y (u R ,v R ) z O R x Disparity: y

Correspondence • It is fundamentally ambiguous, even with stereo constraints ��

Correspondence: What should we match? • Objects? • Edges? • Pixels? • Collections of pixels?

Julesz: showed that recognition is not needed for stereo.

Correspondence: Epipolar constraint. The epipolar constraint helps, but much ambiguity remains.

Correspondence: Photometric constraint • Same world point has same intensity in both images. – True for Lambertian surfaces • A Lambertian surface has a brightness that is independent of viewing angle – Violations: • Noise • Specularity • Non-Lambertian materials • Pixels that contain multiple surfaces

Pixel matching For each epipolar line For each pixel in the left image • compare with every pixel on same epipolar line in right image • pick pixel with minimum match cost This leaves too much ambiguity, so: Improvement: match windows (Seitz)

Correspondence Using Correlation Left Right scanline SSD error disparity

Sum of Squared (Pixel) Differences Left Right w w L R m w w m L R x − x L y d y ( , ) ( , ) L L L I I L R w w m m and are correspond ing by windows of pixels. L R We define the window function : = − ≤ ≤ + − ≤ ≤ + W x y u v x u x y v y m m m m ( , ) { , | , } m 2 2 2 2 The SSD cost measures the intensity difference as a function of disparity : ∑ = − − C x y d I u v I u d v 2 ( , , ) [ ( , ) ( , )] r L R ∈ u v W x y ( , ) ( , ) m

Image Normalization • Even when the cameras are identical models, there can be differences in gain and sensitivity. • For these reason and more, it is a good idea to normalize the pixels in each window: ∑ = I I u v 1 ( , ) Average pixel W x y ( , ) m ∈ u v W x y ( , ) ( , ) m ∑ = I I u v 2 [ ( , )] Window magnitude W x y ( , ) m u v ∈ W x y ( , ) ( , ) m − I x y I ( , ) ˆ = I x y ( , ) Normalized pixel − I I W x y ( , ) m

Images as Vectors Left Right “Unwrap” image to form vector, using raster scan order w L w m row 1 R w m L m m row 2 Each window is a vector in an m 2 dimensional w vector space. L m row 3 Normalization makes them unit length.

Image Metrics (Normalized) Sum of Squared Differences w R ( d ) ∑ = ˆ − ˆ − C d I u v I u d v 2 ( ) [ ( , ) ( , )] L R SSD w ∈ u v W x y L ( , ) ( , ) m 2 = − w w d ( ) L R Normalized Correlation ∑ = ˆ ˆ − C d I u v I u d v ( ) ( , ) ( , ) L R NC ∈ u v W x y ( , ) ( , ) m = ⋅ = θ w w d ( ) cos L R 2 = − = ⋅ d w w d w w d * arg min ( ) arg max ( ) d L R d L R

Stereo Results Images courtesy of Point Grey Research

Window size W = 3 W = 20 • Effect of window size • Some approaches have been developed to use an adaptive window size (try multiple sizes and select best match) (Seitz)

Stereo testing and comparisons D. Scharstein and R. Szeliski. "A Taxonomy and Evaluation of Dense Two- Frame Stereo Correspondence Algorithms," International Journal of Computer Vision, 47 (2002), pp. 7-42. Scene Ground truth

Scharstein and Szeliski

Results with window correlation Window-based matching Ground truth (best window size) (Seitz)

Results with better method State of the art method: Graph cuts Ground truth (Seitz)

Stereo Correspondences Left scanline Right scanline … …

Stereo Correspondences Left scanline Right scanline … … Match Match Match Occlusion Disocclusion

Search Over Correspondences Occluded Pixels Left scanline Right scanline Disoccluded Pixels Three cases: – Sequential – add cost of match (small if intensities agree) – Occluded – add cost of no match (large cost) – Disoccluded – add cost of no match (large cost)

Stereo Matching with Dynamic Programming Occluded Pixels Left scanline Start Dynamic programming yields the optimal path through grid. This is Dis-occluded Pixels the best set of matches that satisfy the ordering constraint Right scanline End

Dynamic Programming • Efficient algorithm for solving sequential decision (optimal path) problems. = i 1 1 1 1 1 … = i 2 2 2 2 2 = i 3 3 3 3 3 = = = t t t t = T 1 2 3 T 3 How many paths through this trellis?

Dynamic Programming = i Π 1 1 1 1 12 Π 22 = i 2 2 2 2 States: Π 32 = i 3 3 3 3 C C C − + t t t 1 1 Suppose cost can be decomposed into stages: Π = i j Cost of going from state to state ij

Dynamic Programming = i Π 1 1 1 1 12 Π = 22 j = i 2 2 2 2 2 Π 32 = i 3 3 3 3 C C C − + t t t 1 1 Principle of Optimality for an n-stage assignment problem: = Π + C j C 1 i ( ) min ( ( )) − t i ij t

Dynamic Programming = i 1 1 1 1 = b ( 2 ) 2 t = j = i 2 2 2 2 2 = i 3 3 3 3 C C C − + t t t 1 1 = Π + C j C i ( ) min ( ( )) − t i ij t 1 = Π + b j C i ( ) arg min ( ( )) − t i ij t 1

Stereo Matching with Dynamic Programming Occluded Pixels Left scanline Scan across grid computing optimal cost for each node given its Dis-occluded Pixels upper-left neighbors. Backtrack from the terminal to Right scanline get the optimal path. Terminal

Scharstein and Szeliski

Segmentation-based Stereo Hai Tao and Harpreet W. Sawhney

Another Example

Result using a good technique Right Image Left Image Disparity

View Interpolation

Computing Correspondence Another approach is to match edges rather than windows of pixels: • • Which method is better? – Edges tend to fail in dense texture (outdoors) – Correlation tends to fail in smooth featureless areas

Summary of different stereo methods • Constraints: – Geometry, epipolar constraint. – Photometric: Brightness constancy, only partly true. – Ordering: only partly true. – Smoothness of objects: only partly true. • Algorithms: – What you compare: points, regions, features? • How you optimize: – Local greedy matches. – 1D search. – 2D search.

Stereo Vision Reading: Chapter 11 Stereo matching computes depth - PowerPoint PPT Presentation

Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: Calibrating camera positions. Finding all corresponding points (hardest part) Computing depth or surfaces. Slide credits

3D Vision: Stereo Marc Pollefeys, Torsten Sattler Spring 2016

3D Photography: Stereo Matching Kevin Kser, Marc Pollefeys Spring 2012

Today Recap: epipolar constraint Stereo image rectification Stereo: Stereo

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Stereo Matching 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University What is stereo

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

Depth from Stereo Dominic Cheng February 7, 2018 Agenda 1. Introduction to stereo 2.

Stereo Matching Wei-Chih Tu ( ) National Taiwan University Fall 2018 Stereo Matching

1 Basic Stereo Derivations Correspondence It is fundamentally ambiguous, even with stereo

Two-View Stereo Slides from S. Lazebnik, S. Seitz, Y. Furukawa Stereo What cues tell us

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing and Raquel Urtasun W. Luo

Dense Stereo Some Slides by Forsyth & Ponce, Jim Rehg, Sing Bing Kang (Does not line up well

CSE 152 Section 5 HW2: Stereo Geometry April 29, 2019 Owen Jow Stereo: two views. Why is one

Stereo Vision, Multi-View Object and Scene Reconstruction Veronica SCURTU ARTEMIS Department 1

Video-Rate Stereo Vision on a Reconfigurable Hardware Ahmad Darabiha Department of Electrical

Stereo Thurs Mar 22 Kristen Grauman UT Austin Multiple views Multi-view geometry, matching,

References and Resources Textbook: Simmons and Young, 3 rd edition: Chapter 3, Neuronal

Cognition in Visual Processing 707.031: Evaluation Methodology Winter 2015/16 Eduardo Veas

Lecture 1: Introduction 1 5-Jan-15 Lecture 1 - Fei-Fei Li & Andrej

DO NOT COPY Transcranial Alternating Current Stimulation (tACS) Emiliano Santarnecchi -

Visual Object Recognition Computational Models and Neurophysiological Mechanisms Neurobiology

DiCE Dichoptic Contrast Enhancement for VR and Stereo Displays Fangcheng Zhong University of

iVR Integrated Vision and Radio Localization with Zero Human Effort Jingao Xu, Hengjie Chen,

Sambuz

Useful Links

Newsletter

Mail Us

Stereo Vision Reading: Chapter 11 Stereo matching computes depth - PowerPoint PPT Presentation

Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: Calibrating camera positions. Finding all corresponding points (hardest part) Computing depth or surfaces. Slide credits

3D Vision: Stereo Marc Pollefeys, Torsten Sattler Spring 2016

3D Photography: Stereo Matching Kevin Kser, Marc Pollefeys Spring 2012

Today Recap: epipolar constraint Stereo image rectification Stereo: Stereo

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Stereo Matching 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University What is stereo

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

Depth from Stereo Dominic Cheng February 7, 2018 Agenda 1. Introduction to stereo 2.

Stereo Matching Wei-Chih Tu ( ) National Taiwan University Fall 2018 Stereo Matching

1 Basic Stereo Derivations Correspondence It is fundamentally ambiguous, even with stereo

Two-View Stereo Slides from S. Lazebnik, S. Seitz, Y. Furukawa Stereo What cues tell us

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing and Raquel Urtasun W. Luo

Dense Stereo Some Slides by Forsyth &amp; Ponce, Jim Rehg, Sing Bing Kang (Does not line up well

CSE 152 Section 5 HW2: Stereo Geometry April 29, 2019 Owen Jow Stereo: two views. Why is one

Stereo Vision, Multi-View Object and Scene Reconstruction Veronica SCURTU ARTEMIS Department 1

Video-Rate Stereo Vision on a Reconfigurable Hardware Ahmad Darabiha Department of Electrical

Stereo Thurs Mar 22 Kristen Grauman UT Austin Multiple views Multi-view geometry, matching,

References and Resources Textbook: Simmons and Young, 3 rd edition: Chapter 3, Neuronal

Cognition in Visual Processing 707.031: Evaluation Methodology Winter 2015/16 Eduardo Veas

Lecture 1: Introduction 1 5-Jan-15 Lecture 1 - Fei-Fei Li &amp; Andrej

DO NOT COPY Transcranial Alternating Current Stimulation (tACS) Emiliano Santarnecchi -

Visual Object Recognition Computational Models and Neurophysiological Mechanisms Neurobiology

DiCE Dichoptic Contrast Enhancement for VR and Stereo Displays Fangcheng Zhong University of

iVR Integrated Vision and Radio Localization with Zero Human Effort Jingao Xu*, Hengjie Chen*,

Sambuz

Useful Links

Newsletter

Mail Us

Dense Stereo Some Slides by Forsyth & Ponce, Jim Rehg, Sing Bing Kang (Does not line up well

Lecture 1: Introduction 1 5-Jan-15 Lecture 1 - Fei-Fei Li & Andrej

iVR Integrated Vision and Radio Localization with Zero Human Effort Jingao Xu, Hengjie Chen,