COMPUTER VISION Multi-view Geometry Emanuel Aldea < - PowerPoint PPT Presentation

COMPUTER VISION Multi-view Geometry Emanuel Aldea < emanuel.aldea@u-psud.fr > http://hebergement.u-psud.fr/emi/ Computer Science and Multimedia Master - University of Pavia

Context of pose estimation Why do we need anything beside the existing algorithms ? ◮ Generic pose estimation and refinement algorithms fail in some contexts, e.g. : E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (2/19)

Context of pose estimation Why do we need anything beside the existing algorithms ? ◮ Generic pose estimation and refinement algorithms fail in some contexts, e.g. : ◮ Large homogeneous areas (ground, facades) E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (2/19)

Context of pose estimation Why do we need anything beside the existing algorithms ? ◮ Generic pose estimation and refinement algorithms fail in some contexts, e.g. : ◮ Large homogeneous areas (ground, facades) ◮ Repetitive static patterns (arches, window corners etc.) E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (2/19)

Context of pose estimation Why do we need anything beside the existing algorithms ? ◮ Generic pose estimation and refinement algorithms fail in some contexts, e.g. : ◮ Large homogeneous areas (ground, facades) ◮ Repetitive static patterns (arches, window corners etc.) ◮ Similarity of people body parts E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (2/19)

Context of pose estimation Why do we need anything beside the existing algorithms ? ◮ Generic pose estimation and refinement algorithms fail in some contexts, e.g. : ◮ Large homogeneous areas (ground, facades) ◮ Repetitive static patterns (arches, window corners etc.) ◮ Similarity of people body parts ◮ Wide baseline : perspective change, strong occlusions E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (2/19)

Camera-IMU fusion for localization Why is image based localization powerful ? ◮ Affordable in terms of hardware and computational cost E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (3/19)

Camera-IMU fusion for localization Why is image based localization powerful ? ◮ Affordable in terms of hardware and computational cost ◮ Major issue when the scene is not well textured : hard to estimate the reliability of the estimation E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (3/19)

Camera-IMU fusion for localization Why is image based localization powerful ? ◮ Affordable in terms of hardware and computational cost ◮ Major issue when the scene is not well textured : hard to estimate the reliability of the estimation ◮ Minor issue : scale must be estimated separately (i.e. the norm of the translation is unknown) E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (3/19)

Camera-IMU fusion for localization Why is image based localization powerful ? ◮ Affordable in terms of hardware and computational cost ◮ Major issue when the scene is not well textured : hard to estimate the reliability of the estimation ◮ Minor issue : scale must be estimated separately (i.e. the norm of the translation is unknown) ◮ Benefit of coupling with IMU and GPS : avoid faulty results E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (3/19)

Camera-IMU fusion for localization Why is image based localization powerful ? ◮ Affordable in terms of hardware and computational cost ◮ Major issue when the scene is not well textured : hard to estimate the reliability of the estimation ◮ Minor issue : scale must be estimated separately (i.e. the norm of the translation is unknown) ◮ Benefit of coupling with IMU and GPS : avoid faulty results Single image based relative pose estimation ◮ Sensor performance : reliable but mediocre (low cost equipment) ◮ We know that the vision estimation is often very inaccurate E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (3/19)

Camera-IMU fusion for localization The skeleton of an M-Estimator approach Identify a solution close to the sensor pose which is guided by matches from images :       �  + λ ( s ) 2 s = arg min ˆ  c w ( k )(1 − g ( k , s )) (1) s k ∈ Ω  Details regarding the terms : ◮ Ω is the set of potentially correct associations, and w ( k ) measures the visual quality of the association k ◮ g ( k , s ) evaluates the agreement between the current pose s and the association k ◮ λ ( s ) is a measure of the proximity of the solution to the sensor pose ◮ c controls the relative importance of the regularisaton and data attachment terms E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (4/19)

Camera-IMU fusion for localization The skeleton of an M-Estimator approach Identify a solution close to the sensor pose which is guided by matches from images :       �  + λ ( s ) 2 s = arg min ˆ  c w ( k )(1 − g ( k , s )) (1) s  k ∈ Ω Details regarding the terms : ◮ Ω is the set of potentially correct associations, and w ( k ) measures the visual quality of the association k ◮ g ( k , s ) evaluates the agreement between the current pose s and the association k ◮ λ ( s ) is a measure of the proximity of the solution to the sensor pose ◮ c controls the relative importance of the regularisaton and data attachment terms Initialization : ◮ these types of optimizations are non-convex, and thus sensitive to the initialization ◮ stochastic initialization by sampling poses around the prior ◮ aims to draw a candidate in the bassin of attraction of the estimator ◮ problem if the sensor information is not sufficient to build a prior E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (4/19)

Camera-IMU fusion for localization The agreement function g ( k , s ) � � − d ( k , s ) 2 g ( k , s ) = exp (2) 2 σ 2 h The distance d ( k , s ) is an image space error in k when we consider s . The parameter σ h has an important impact on the profile of the energy (the smaller it is, the more sensitive the functional). The visual quality w ( k ) ◮ related to how similar p and p ′ are visually, based on a descriptor distance d ( p , p ′ ) ◮ a robust way to define w ( k ) in terms of the two closest distances between p and any p ′ : w v ( k ) = 1 − d 1 NN ( k ) d 2 NN ( k ) E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (5/19)

Camera-IMU fusion for localization The agreement function g ( k , s ) � � − d ( k , s ) 2 g ( k , s ) = exp (2) 2 σ 2 h The distance d ( k , s ) is an image space error in k when we consider s . The parameter σ h has an important impact on the profile of the energy (the smaller it is, the more sensitive the functional). The visual quality w ( k ) ◮ related to how similar p and p ′ are visually, based on a descriptor distance d ( p , p ′ ) ◮ a robust way to define w ( k ) in terms of the two closest distances between p and any p ′ : w v ( k ) = 1 − d 1 NN ( k ) d 2 NN ( k ) The proximity measure λ ( s ) ◮ defined as a Mahalanobis distance between s and the prior s 0 (avec δ s = s − s 0 ) : λ ( s ) = 1 � δ s T Σ − 1 s 0 δ s | s | E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (5/19)

Adapting the method for a specific context Learning the weights ◮ The w v ( k ) is widely used but it exhibits known limitations in urban environments ◮ (Yi et al., CVPR18) proposed a neural network which estimates the correspondence weights w g ( k ) based on a learnt global coherence ◮ The two algorithms have fundamentally different behaviors : 0.06 0.7 inliers inliers outliers 0.6 0.05 outliers 0.5 0.04 Frequence Frequence 0.4 0.03 0.3 0.02 0.2 0.01 0.1 0 0 0.5 1 0 0 0.2 0.4 0.6 0.8 1 w v w g ◮ Relying on a composite weight (stricter than the sum) improves significantly the performance of the M-Estimator E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (6/19)

Example : static camera image E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (7/19)

Example : dynamic camera image E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (8/19)

Pose estimation and epipole with pure vision E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (9/19)

Pose estimation and epipole with sensor-vision fusion E. Aldea (CS&MM- U Pavia) COMPUTER VISION Chap III : Sensors, Multi-view Geometry (10/19)

COMPUTER VISION Multi-view Geometry Emanuel Aldea < - PowerPoint PPT Presentation

COMPUTER VISION Multi-view Geometry Emanuel Aldea < emanuel.aldea@u-psud.fr > http://hebergement.u-psud.fr/emi/ Computer Science and Multimedia Master - University of Pavia Context of pose estimation Why do we need anything beside the

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Deep Learning for Geometry Processing 3D Representations View-Based and Volumetric CNNs 3D

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

Stochastic geometry and random generation 1 Stochastic geometry and random generation

The Epipolar Geometry COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

48-175 Descriptive Geometry Basic Concepts of Descriptive Geometry Descriptive geometry is

Multi-view Active Learning Ion Muslea University of Southern California Outline Multi-view

3d Geometry for Computer Graphics Lesson 1: Basics & PCA 3d geometry 3d geometry 3d

MULTI-VIEW GEOMETRY (1 lecture) Epipolar Geometry The Essential and Fundamental Matrices

Multi-View Representation Learning: Algorithms and Applications Changqing Zhang ( )

Semantic Multi-View Model For Low-Power Carlos Gmez, Julien DeAntoni, Frdric Mallet

CS 4495 Computer Vision Calibration and Projective Geometry (1) Aaron Bobick School of

Hyperbolic Geometry Victor Gonzalez Mentor: Ryan Kirk May 4, 2016 Hyperbolic Geometry We are

Geometry Problems Geometry Problems Examples for Typical ACM Instances Elementary Geometry

Multi-plane multi-view approach to project the sphere viewing sphere Introduction Global Map

Thesis presentation Event Extraction from Text and Translation to Event Calculus Geert Heyman

sss trs t

SLAM CAD&CG SLAM:

Reconnaissance dobjets Reconnaissance d objets Reconnaissance d objets Reconnaissance

Takayuk Tanigawa ( ) Center for Planetary Science / ILTS, Hokkaido Univ. NCU-CPS

From words to phrases in Distributional Semantic Models R affaella B ernardi U niversit ` a di T

Digital representations of the collection objects in the Museum fr Naturkunde Berlin Falko

Geoapplications development http://rgeo.wikience.org Higher School of Economics, Moscow,

COMPUTER VISION Multi-view Geometry Emanuel Aldea < - PowerPoint PPT Presentation

COMPUTER VISION Multi-view Geometry Emanuel Aldea < emanuel.aldea@u-psud.fr > http://hebergement.u-psud.fr/emi/ Computer Science and Multimedia Master - University of Pavia Context of pose estimation Why do we need anything beside the

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Deep Learning for Geometry Processing 3D Representations View-Based and Volumetric CNNs 3D

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

Stochastic geometry and random generation 1 Stochastic geometry and random generation

The Epipolar Geometry COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

48-175 Descriptive Geometry Basic Concepts of Descriptive Geometry Descriptive geometry is

Multi-view Active Learning Ion Muslea University of Southern California Outline Multi-view

3d Geometry for Computer Graphics Lesson 1: Basics &amp; PCA 3d geometry 3d geometry 3d

MULTI-VIEW GEOMETRY (1 lecture) Epipolar Geometry The Essential and Fundamental Matrices

Multi-View Representation Learning: Algorithms and Applications Changqing Zhang ( )

Semantic Multi-View Model For Low-Power Carlos Gmez, Julien DeAntoni, Frdric Mallet

CS 4495 Computer Vision Calibration and Projective Geometry (1) Aaron Bobick School of

Hyperbolic Geometry Victor Gonzalez Mentor: Ryan Kirk May 4, 2016 Hyperbolic Geometry We are

Geometry Problems Geometry Problems Examples for Typical ACM Instances Elementary Geometry

Multi-plane multi-view approach to project the sphere viewing sphere Introduction Global Map

Thesis presentation Event Extraction from Text and Translation to Event Calculus Geert Heyman

sss trs t

SLAM CAD&amp;CG SLAM:

Reconnaissance dobjets Reconnaissance d objets Reconnaissance d objets Reconnaissance

Takayuk Tanigawa ( ) Center for Planetary Science / ILTS, Hokkaido Univ. NCU-CPS

From words to phrases in Distributional Semantic Models R affaella B ernardi U niversit ` a di T

Digital representations of the collection objects in the Museum fr Naturkunde Berlin Falko

Geoapplications development http://rgeo.wikience.org Higher School of Economics, Moscow,

3d Geometry for Computer Graphics Lesson 1: Basics & PCA 3d geometry 3d geometry 3d

SLAM CAD&CG SLAM: