Stereo II CSE 576 Ali Farhadi Several slides from - PowerPoint PPT Presentation

Stereo ¡II ¡ CSE ¡576 ¡ Ali ¡Farhadi ¡ ¡ ¡ ¡ Several ¡slides ¡from ¡Larry ¡Zitnick ¡and ¡Steve ¡Seitz ¡

Camera parameters A camera is described by several parameters • Translation T of the optical center from the origin of world coords • Rotation R of the image plane • focal length f, principle point (x ’ c , y ’ c ), pixel size (s x , s y ) • blue parameters are called “ extrinsics, ” red are “ intrinsics ” Projection equation • The projection matrix models the cumulative effect of all parameters • Useful to decompose into a series of operations identity matrix intrinsics projection rotation translation • The definitions of these parameters are not completely standardized – especially intrinsics—varies from one book to another

Extrinsics ¡ How ¡do ¡we ¡get ¡the ¡camera ¡to ¡ “ canonical ¡form ” ? ¡ • (Center ¡of ¡projecHon ¡at ¡the ¡origin, ¡x-‑axis ¡points ¡right, ¡y-‑axis ¡points ¡up, ¡z-‑ axis ¡points ¡backwards) ¡ Step ¡1: ¡Translate ¡by ¡-‑ c ¡ 0 ¡

Extrinsics ¡ How ¡do ¡we ¡get ¡the ¡camera ¡to ¡ “ canonical ¡form ” ? ¡ • (Center ¡of ¡projecHon ¡at ¡the ¡origin, ¡x-‑axis ¡points ¡right, ¡y-‑axis ¡points ¡up, ¡z-‑ axis ¡points ¡backwards) ¡ Step ¡1: ¡Translate ¡by ¡-‑ c ¡ ¡ How ¡do ¡we ¡represent ¡ translaHon ¡as ¡a ¡matrix ¡ mulHplicaHon? ¡ 0 ¡

Extrinsics ¡ How ¡do ¡we ¡get ¡the ¡camera ¡to ¡ “ canonical ¡form ” ? ¡ • (Center ¡of ¡projecHon ¡at ¡the ¡origin, ¡x-‑axis ¡points ¡right, ¡y-‑axis ¡points ¡up, ¡z-‑ axis ¡points ¡backwards) ¡ Step ¡1: ¡Translate ¡by ¡-‑ c ¡ Step ¡2: ¡Rotate ¡by ¡ R ¡ 0 ¡ 3x3 ¡rotaHon ¡matrix ¡

Extrinsics ¡ How ¡do ¡we ¡get ¡the ¡camera ¡to ¡ “ canonical ¡form ” ? ¡ • (Center ¡of ¡projecHon ¡at ¡the ¡origin, ¡x-‑axis ¡points ¡right, ¡y-‑axis ¡points ¡up, ¡z-‑ axis ¡points ¡backwards) ¡ Step ¡1: ¡Translate ¡by ¡-‑ c ¡ Step ¡2: ¡Rotate ¡by ¡ R ¡ 0 ¡

PerspecHve ¡projecHon ¡ (converts ¡from ¡3D ¡rays ¡in ¡camera ¡ coordinate ¡system ¡to ¡pixel ¡coordinates) ¡ (intrinsics) ¡ (upper ¡triangular ¡ in ¡general, ¡ ¡ matrix) ¡ : ¡ aspect ¡ra+o ¡ (1 ¡unless ¡pixels ¡are ¡not ¡square) ¡ : ¡ skew ¡(0 ¡unless ¡pixels ¡are ¡shaped ¡like ¡rhombi/parallelograms) ¡ : ¡ principal ¡point ¡((0,0) ¡unless ¡opHcal ¡axis ¡doesn ’ t ¡intersect ¡projecHon ¡plane ¡at ¡origin) ¡

ProjecHon ¡matrix ¡ intrinsics ¡ projecHon ¡ rotaHon ¡ translaHon ¡

ProjecHon ¡matrix ¡ = ¡ 0 ¡ (in ¡homogeneous ¡image ¡coordinates) ¡

Epipolar constraint: Calibrated case X x x’ • Assume that the intrinsic and extrinsic parameters of the cameras are known • We can multiply the projection matrix of each camera (and the image points) by the inverse of the calibration matrix to get normalized image coordinates • We can also set the global coordinate system to the coordinate system of the first camera. Then the projection matrices of the two cameras can be written as [I | 0] and [R | t]

Epipolar constraint: Calibrated case X = ( x, 1) T x x’ = Rx+t t R The vectors Rx , t , and x’ are coplanar

Epipolar constraint: Calibrated case X x x’ ′ = = ′ x T ⋅ × = E x 0 with E [ t ] R x [ t ( R x ) ] 0 × Essential Matrix (Longuet-Higgins, 1981) The vectors Rx , t , and x’ are coplanar

Epipolar constraint: Calibrated case X x x’ ′ = = ′ x T ⋅ × = E x 0 with E [ t ] R x [ t ( R x ) ] 0 × • E x is the epipolar line associated with x ( l ' = E x ) • E T x ' is the epipolar line associated with x' ( l = E T x ' ) • E e = 0 and E T e ' = 0 • E is singular (rank two) • E has five degrees of freedom

Epipolar constraint: Uncalibrated case X x x’ • The calibration matrices K and K ’ of the two cameras are unknown • We can write the epipolar constraint in terms of unknown normalized coordinates: ′ ′ ′ − − = = ′ ˆ 1 ˆ 1 ˆ = x T x K x , x K x ˆ ˆ E x 0

Epipolar constraint: Uncalibrated case X x x’ ′ ′ ′ − − = = = x T ˆ ˆ T T 1 E x 0 x F x 0 with F K E K − = 1 ˆ x K x Fundamental Matrix ′ ′ ′ − = 1 ˆ x K x (Faugeras and Luong, 1992)

Epipolar constraint: Uncalibrated case X x x’ ′ ′ ′ − − = = = x T ˆ ˆ T T 1 E x 0 x F x 0 with F K E K • F x is the epipolar line associated with x ( l ' = F x ) • F T x ' is the epipolar line associated with x' ( l ' = F T x ' ) • F e = 0 and F T e ' = 0 • F is singular (rank two) • F has seven degrees of freedom

The eight-point algorithm ′ ′ ′ ⎡ ⎤ = = f T x ( u , v , 1 ) , x ( u , v , 1 ) 11 ⎢ ⎥ f ⎢ ⎥ 12 ⎢ ⎥ f 13 ⎢ ⎥ ⎡ ⎤ ⎡ ⎤ f f f u f ⎢ ⎥ 11 12 13 21 ⎢ ⎥ ⎢ ⎥ [ ] [ ] ′ ′ = ⎢ ⎥ ′ ′ ′ ′ ′ ′ u v 1 f f f v 0 = u u u v u v u v v v u v 1 f 0 ⎢ ⎥ ⎢ ⎥ 21 22 23 22 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ f f f f 1 ⎢ ⎥ A ⎣ ⎦ ⎣ ⎦ 23 31 32 33 ⎢ ⎥ f ⎢ ⎥ 31 ⎢ f ⎥ Smallest 32 ⎢ ⎥ f ⎣ ⎦ 33 eigenvalue of Minimize: A T A N ∑ ′ T 2 ( x F x ) i i = i 1 under the constraint || F || 2 =1

The eight-point algorithm N ∑ ′ T 2 • Meaning of error ( x F x ) : i i = i 1 sum of squared algebraic distances between points x ’ i and epipolar lines F x i (or points x i and epipolar lines F T x ’ i ) • Nonlinear approach: minimize sum of squared geometric distances [ ] N ∑ ′ ′ + 2 2 T d ( x , F x ) d ( x , F x ) i i i i = i 1

Problem with eight-point algorithm " $ f 11 & ' f 12 & ' & ' f 13 & ' & ' f 21 " $ u u u v u v u v v v u v = 0 ! ! ! ! ! ! & ' # % f 22 & ' & ' f 23 & ' f 31 & ' & ' f 32 # %

Problem with eight-point algorithm ⎡ ⎤ f 11 ⎢ ⎥ f ⎢ ⎥ 12 ⎢ ⎥ f 13 ⎢ ⎥ f [ ] ⎢ ⎥ ′ ′ ′ ′ ′ ′ 21 = − u u u v u v u v v v u v 1 ⎢ ⎥ f 22 ⎢ ⎥ f ⎢ ⎥ 23 ⎢ ⎥ f ⎢ ⎥ 31 ⎢ ⎥ f ⎣ ⎦ Poor numerical conditioning 32 Can be fixed by rescaling the data

The normalized eight-point algorithm (Hartley, 1995) • Center the image data at the origin, and scale it so the mean squared distance between the origin and the data points is 2 pixels • Use the eight-point algorithm to compute F from the normalized points • Enforce the rank-2 constraint (for example, take SVD of F and throw out the smallest singular value) • Transform fundamental matrix back to original units: if T and T ’ are the normalizing transformations in the two images, then the fundamental matrix in original coordinates is T ’ T F T

Comparison of estimation algorithms 8-point Normalized 8-point Nonlinear least squares Av. Dist. 1 2.33 pixels 0.92 pixel 0.86 pixel Av. Dist. 2 2.18 pixels 0.85 pixel 0.80 pixel

Moving on to stereo … Fuse a calibrated binocular stereo pair to produce a depth image image 1 image 2 Dense depth map Many of these slides adapted from Steve Seitz and Lana Lazebnik

Depth from disparity X ′ − x x f = z ′ − O O z x x’ f f O Baseline O’ B ⋅ B f ′ = − = disparity x x z Disparity is inversely proportional to depth.

Basic stereo matching algorithm • If necessary, rectify the two stereo images to transform epipolar lines into scanlines • For each pixel x in the first image • Find corresponding epipolar scanline in the right image • Search the scanline and pick the best match x’ • Compute disparity x-x’ and set depth(x) = fB/(x-x’)

Basic stereo matching algorithm • For each pixel in the first image • Find corresponding epipolar line in the right image • Search along epipolar line and pick the best match • Triangulate the matches to get depth information • Simplest case: epipolar lines are scanlines • When does this happen?

Simplest Case: Parallel images Epipolar constraint: ′ = = × x T E x 0 , E t R R = I t = ( T , 0, 0) x ⎡ ⎤ 0 0 0 x’ ⎢ ⎥ = × = − E t R 0 0 T ⎢ ⎥ t ⎢ ⎥ 0 T 0 ⎣ ⎦ ′ ⎡ ⎤ ⎛ ⎞ ⎛ ⎞ 0 0 0 u 0 ⎜ ⎟ ⎜ ⎟ ⎢ ⎥ ( ) ( ) ′ ′ − = − = = u v 1 0 0 T v 0 u v 1 T 0 Tv T v ⎜ ⎟ ⎜ ⎟ ⎢ ⎥ ⎜ ⎟ ⎜ ⎟ ′ ⎢ ⎥ 0 T 0 1 T v ⎣ ⎦ ⎝ ⎠ ⎝ ⎠ The y-coordinates of corresponding points are the same

Stereo image rectification

Stereo II CSE 576 Ali Farhadi Several slides from - PowerPoint PPT Presentation

Stereo II CSE 576 Ali Farhadi Several slides from Larry Zitnick and Steve Seitz Camera parameters A camera is described by several parameters Translation T of the

3D Photography: Stereo Matching Kevin Kser, Marc Pollefeys Spring 2012

3D Vision: Stereo Marc Pollefeys, Torsten Sattler Spring 2016

Today Recap: epipolar constraint Stereo image rectification Stereo: Stereo

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Stereo Matching 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University What is stereo

Depth from Stereo Dominic Cheng February 7, 2018 Agenda 1. Introduction to stereo 2.

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

Two-View Stereo Slides from S. Lazebnik, S. Seitz, Y. Furukawa Stereo What cues tell us

Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing and Raquel Urtasun W. Luo

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

Stereo Matching Wei-Chih Tu ( ) National Taiwan University Fall 2018 Stereo Matching

Dense Stereo Some Slides by Forsyth & Ponce, Jim Rehg, Sing Bing Kang (Does not line up well

1 Basic Stereo Derivations Correspondence It is fundamentally ambiguous, even with stereo

CSE 152 Section 5 HW2: Stereo Geometry April 29, 2019 Owen Jow Stereo: two views. Why is one

Standard VGA Standard VGA Presentation Presentation Test Screen Test Screen The WHAT and WHY

Stereo Vision 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University Whats different

Evaluation of Anomaly Detection Method based on Pattern Recognition Romain Fontugne The

Improving Domain-specific Transfer Learning Applications for Image Recognition and Differential

SO YOUR DESIGNER WANTS A MASTHEAD... By ChenHuiJing / @hj_chen GET IMAGE RATIO

Geometry and Structure from Motion Computer Vision Fall 2018 Columbia University Stereo

1 2 CS 532: 3D Computer Vision Lecture 2 Enrique Dunn edunn@stevens.edu Lieb 310 Image

Computer Vision Levente Hajder, Dmitry Chetverikov Etvs Lornd University, Faculty of

Stereo Vision Approaches for Human to Robot Handover Aleksej Logacjov University of Hamburg

Stereo II CSE 576 Ali Farhadi Several slides from - PowerPoint PPT Presentation

Stereo II CSE 576 Ali Farhadi Several slides from Larry Zitnick and Steve Seitz Camera parameters A camera is described by several parameters Translation T of the

3D Photography: Stereo Matching Kevin Kser, Marc Pollefeys Spring 2012

3D Vision: Stereo Marc Pollefeys, Torsten Sattler Spring 2016

Today Recap: epipolar constraint Stereo image rectification Stereo: Stereo

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Stereo Matching 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University What is stereo

Depth from Stereo Dominic Cheng February 7, 2018 Agenda 1. Introduction to stereo 2.

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

Two-View Stereo Slides from S. Lazebnik, S. Seitz, Y. Furukawa Stereo What cues tell us

Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing and Raquel Urtasun W. Luo

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

Stereo Matching Wei-Chih Tu ( ) National Taiwan University Fall 2018 Stereo Matching

Dense Stereo Some Slides by Forsyth &amp; Ponce, Jim Rehg, Sing Bing Kang (Does not line up well

1 Basic Stereo Derivations Correspondence It is fundamentally ambiguous, even with stereo

CSE 152 Section 5 HW2: Stereo Geometry April 29, 2019 Owen Jow Stereo: two views. Why is one

Standard VGA Standard VGA Presentation Presentation Test Screen Test Screen The WHAT and WHY

Stereo Vision 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University Whats different

Evaluation of Anomaly Detection Method based on Pattern Recognition Romain Fontugne The

Improving Domain-specific Transfer Learning Applications for Image Recognition and Differential

SO YOUR DESIGNER WANTS A MASTHEAD... By ChenHuiJing / @hj_chen GET IMAGE RATIO

Geometry and Structure from Motion Computer Vision Fall 2018 Columbia University Stereo

1 2 CS 532: 3D Computer Vision Lecture 2 Enrique Dunn edunn@stevens.edu Lieb 310 Image

Computer Vision Levente Hajder, Dmitry Chetverikov Etvs Lornd University, Faculty of

Stereo Vision Approaches for Human to Robot Handover Aleksej Logacjov University of Hamburg

Dense Stereo Some Slides by Forsyth & Ponce, Jim Rehg, Sing Bing Kang (Does not line up well