3D Deep Learning on Geometric Forms Hao Su Many 3D representations - PowerPoint PPT Presentation

3D Deep Learning on Geometric Forms Hao Su

Many 3D representations are available Candidates: multi-view images depth map volumetric polygonal mesh point cloud primitive-based CAD models

3D representation Candidates: multi-view images depth map volumetric polygonal mesh point cloud primitive-based CAD models Novel view image synthesis [Su et al., ICCV15] [Dosovitskiy et al., ECCV16]

3D representation Candidates: multi-view images depth map volumetric polygonal mesh point cloud primitive-based CAD models

3D representation Candidates: multi-view images depth map volumetric polygonal mesh point cloud primitive-based CAD models a chair assembled by cuboids

Two groups of representations Candidates: multi-view images Rasterized form depth map (regular grids) volumetric polygonal mesh Geometric form point cloud (irregular) primitive-based CAD models

Extant 3D DNNs work on grid-like representations Candidates: multi-view images depth map volumetric polygonal mesh point cloud primitive-based CAD models

Ideally, a 3D representation should be Friendly to learning • easily formulated as the input/output of a neural network • fast forward-/backward- propagation • etc.

Ideally, a 3D representation should be Friendly to learning • easily formulated as the input/output of a neural network • fast forward-/backward- propagation • etc. Flexible • can precisely model a great variety of shapes • etc.

Ideally, a 3D representation should be Friendly to learning • easily formulated as the output of a neural network • fast forward-/backward- propagation • etc. Flexible • can precisely model a great variety of shapes • etc. Geometrically manipulable for networks • geometrically deformable, interpolable and extrapolable for networks • convenient to impose structural constraints • etc. Others

The problem of grid representations Affability Geometric Flexibility to learning manipulability Multi-view images Volumetric occupancy Expensive to compute: O(N 3 ) Depth map Cannot model “back side”

Typical artifacts of volumetric reconstruction Missing or extra thin structures Volumes are hard for the network to rotate / deform / interpolate

Learn to analyze / generate Geometric Forms? Candidates: multi-view images Rasterized form depth map (regular grids) volumetric polygonal mesh Geometric form point cloud (irregular) primitive-based CAD models

Outline Motivation 3D point cloud / CAD model reconstruction 3D point cloud analysis, e.g., segmentation

3D perception from a single image

Monocular vision a typical prey a typical predator Cited from https://en.wikipedia.org/wiki/Binocular_vision

Visual cues are complicated contrast color texture motion symmetry part category-specific 3D knowledge ……

Data-driven 2D-3D lifting Cabinet of things

ShapeNet: a large-scale 3D datasets of objects … ~3 million models in total ~2,000 classes Rich annotations (in progress)

3D point clouds A dual formulation of occupancy Flexibility Geometric manipulability Affability to learning Lagrangian Eulerian Prob. distribution Particle filters Volumetric Point occupancy clouds

Result: 3D reconstruction from real Images Input Reconstructed 3D point cloud

An end-to-end synthesis-for-learning system Image rendering   ( x 0 1 , y 0 1 , z 0 1 )     ( x 0 2 , y 0 2 , z 0 2 )   ... sampling     ( x 0 n , y 0 n , z 0 n )   3D model Groundtruth point cloud

An end-to-end learning system Image Predicted set   ( x 1 , y 1 , z 1 )     Deep Neural ( x 2 , y 2 , z 2 )   Network ...     ( x n , y n , z n )     ( x 0 1 , y 0 1 , z 0 1 )     ( x 0 2 , y 0 2 , z 0 2 )   ...     ( x 0 n , y 0 n , z 0 n )   Groundtruth point cloud

An end-to-end learning system Image Predicted set   ( x 1 , y 1 , z 1 )     Deep Neural ( x 2 , y 2 , z 2 )   Network ...     ( x n , y n , z n )   Point Set Distance   ( x 0 1 , y 0 1 , z 0 1 )     ( x 0 2 , y 0 2 , z 0 2 )   ...     ( x 0 n , y 0 n , z 0 n )   Groundtruth point cloud

Network architecture: Vanilla version Fully connected layer as predictor in standard classification network fully connected conv Encoder input shape embedding 𝑆 " point set Predictor

Network architecture: Vanilla version Fully connected layer as predictor in standard classification network fully connected conv Encoder input shape embedding 𝑆 " point set Predictor 𝑒 𝑆 " Independently regress n*3 numbers from : 𝑜×3

Natural statistics of geometry • Many objects, especially man-made objects, contain large smooth surfaces • Deconvolution can generate locally smooth textures for images

Network architecture: Output from deconv branch Two branch version conv deconv fully connected set union input Encoder 𝑜 ' =24*32=768 points Predictor point set 𝑜 ( =256 points 3-channel map of XYZ coordinates

Network architecture: Output from deconv branch Two branch version conv deconv fully connected set union input Encoder 𝑜 ' =24*32=768 points Predictor point set 𝑜 ( =256 points 3-channel map of XYZ coordinates  C 1 C 1 ∈ R n 1 × 3 � C = C 2 ∈ R n 2 × 3 C 2

Network architecture: Output from deconv branch Two branch version conv deconv fully connected set union input Encoder 𝑜 ' =24*32=768 points Predictor point set 𝑜 ( =256 points 3-channel map of XYZ coordinates

Network architecture: The role of two branches blue : deconv branch – large, consistent, smooth structures red : fully-connected branch – flexibly reconstruct intricate structures

An end-to-end learning system Predicted set   ( x 1 , y 1 , z 1 )     Deep Neural ( x 2 , y 2 , z 2 )   Network ...     ( x n , y n , z n )   Point Set Loss   ( x 0 1 , y 0 1 , z 0 1 )     ( x 0 2 , y 0 2 , z 0 2 )   ...     ( x 0 n , y 0 n , z 0 n )   Groundtruth point cloud

Distance metrics between point sets Given two sets of points, measure their discrepancy

Common distance metrics Worst case: Hausdorff distance (HD) Average case: Chamfer distance (CD) Optimal case: Earth Mover’s distance (EMD)

Common distance metrics Worst case: Hausdorff distance (HD) d HD( S 1 , S 2 ) = max { max x i ∈ S 1 min y j ∈ S 2 k x i � y j k , max y j ∈ S 2 min x i ∈ S 1 k x i � y j k } A single farthest pair determines the distance. In other words, not robust to outliers!

Common distance metrics Worst case: Hausdorff distance (HD) Average case: Chamfer distance (CD) Average all the nearest neighbor distance by nearest neighbors

Common distance metrics Worst case: Hausdorff distance (HD) Average case: Chamfer distance (CD) Optimal case: Earth Mover’s distance (EMD) Solves the optimal transportation (bipartite matching) problem!

Required properties of distance metrics Geometric requirement • Induces a nice shape space • In other words, a good metric should reflect the natural shape differences Computational requirement • Defines a loss that is numerically easy to optimize

How distance metric affects the learned geometry? A fundamental issue: there is always uncertainty in prediction By loss minimization, the network tends to predict a “ mean shape ” that averages out uncertainty in geometry

How distance metric affects the learned geometry? A fundamental issue: there is always uncertainty in prediction, due to • limited network ability • Insufficient training data • inherent ambiguity of groundtruth for 2D-3D dimension lifting • etc. By loss minimization, the network tends to predict a “ mean shape ” that averages out uncertainty in geometry

3D Deep Learning on Geometric Forms Hao Su Many 3D representations - PowerPoint PPT Presentation

3D Deep Learning on Geometric Forms Hao Su Many 3D representations are available Candidates: multi-view images depth map volumetric polygonal mesh point cloud primitive-based CAD models 3D representation Candidates: multi-view images

Forms, CGI Objectives DD1335 (Lecture 5) Basic Internet Programming Spring 2010 1 / 19 Forms,

Forms of elliptic curves Wouter Castryck Forms of elliptic curves First definitions Well-known

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Forms, CGI Objectives The basics of HTML forms How form content is submitted GET, POST

CE419 Session 17: Forms Web Programming Forms <form> is the way that allows users to

Geometric Optimization Piotr Indyk April 26, 2005 Lecture 19: Geometric Optimization Geometric

Geometric Algebra A powerful tool for solving geometric problems in visual computing Leandro A.

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

3D Deep Learning on Geometric Forms Hao Su Many 3D representations are available Candidates:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

Squeezing down the computing Edit Master text styles Second level requirements of deep neural

Stereo CSE 576 Ali Farhadi Several slides from

Depth Perception in Grasshopper -Shashank Chepurwar -Ritvik Srivastava Grasshopper -Agile

Using Geometry to Detect Grasp Poses in 3D Point Clouds ten Pas, Platt Northeastern University

Recent Trends in 3D Computer Vision and Deep Learning Introductory meeting Winter Semester

Initial word... Robogames Initial word... Robogames Initial word... Robogames The "CS

Results from the Telescope Array Experiment Charlie Jui University of Utah TeVPA 2013 Irvine,

Ultra-High Energy Cosmic Ray Observations Karl-Heinz Kampert, University of Wuppertal e-mail:

3D Deep Learning on Geometric Forms Hao Su Many 3D representations - PowerPoint PPT Presentation

3D Deep Learning on Geometric Forms Hao Su Many 3D representations are available Candidates: multi-view images depth map volumetric polygonal mesh point cloud primitive-based CAD models 3D representation Candidates: multi-view images

Forms, CGI Objectives DD1335 (Lecture 5) Basic Internet Programming Spring 2010 1 / 19 Forms,

Forms of elliptic curves Wouter Castryck Forms of elliptic curves First definitions Well-known

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Forms, CGI Objectives The basics of HTML forms How form content is submitted GET, POST

CE419 Session 17: Forms Web Programming Forms &lt;form&gt; is the way that allows users to

Geometric Optimization Piotr Indyk April 26, 2005 Lecture 19: Geometric Optimization Geometric

Geometric Algebra A powerful tool for solving geometric problems in visual computing Leandro A.

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

3D Deep Learning on Geometric Forms Hao Su Many 3D representations are available Candidates:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

Squeezing down the computing Edit Master text styles Second level requirements of deep neural

Stereo CSE 576 Ali Farhadi Several slides from

Depth Perception in Grasshopper -Shashank Chepurwar -Ritvik Srivastava Grasshopper -Agile

Using Geometry to Detect Grasp Poses in 3D Point Clouds ten Pas, Platt Northeastern University

Recent Trends in 3D Computer Vision and Deep Learning Introductory meeting Winter Semester

Initial word... Robogames Initial word... Robogames Initial word... Robogames The &quot;CS

Results from the Telescope Array Experiment Charlie Jui University of Utah TeVPA 2013 Irvine,

Ultra-High Energy Cosmic Ray Observations Karl-Heinz Kampert, University of Wuppertal e-mail:

CE419 Session 17: Forms Web Programming Forms <form> is the way that allows users to

Initial word... Robogames Initial word... Robogames Initial word... Robogames The "CS