Volumetric and Multi-View CNNs for Object Classification on 3D Data - PowerPoint PPT Presentation

Jun 28, 2023 •141 likes •404 views

Volumetric and Multi-View CNNs for Object Classification on 3D Data Charles R. Qi, Hao Su, Matthias Niener, Angela Dai, MengyuanYan, Leonidas J.Guibas Rich Applications of 3D Augmented Robot Reality Perception 3D Representations for

Volumetric and Multi-View CNNs for Object Classification on 3D Data Charles R. Qi*, Hao Su*, Matthias Nießner, Angela Dai, MengyuanYan, Leonidas J.Guibas
Rich Applications of 3D Augmented Robot Reality Perception
3D Representations for Generic Object Classification Volumetric Multi-Views 3DShapeNets by Z. Wu et MVCNN by H. Su et al. al. CVPR 15 ICCV 15 VoxNet by D. Maturana et DeepPano by B. Shi et al. al. IEEE/RSJ 15 IEEE/SPL 15
Volumetric CNNs Revisited Volumetric CNNs 3DShapeNets by Z. Wu et al. CVPR 15
Multi-View CNNs Revisited Multi-View CNNs MVCNN by H. Su et al. ICCV 15
Shape Classification Results Revisited 95 90.1% 90 85 77.3% 80 75 70 3DShapeNets MVCNN Wu et al. Su et al.
Shape Classification Results Revisited 95 90.1% 90 85 77.3% 80 Big gap between 75 volumetric and multi-view 70 based methods Why? 3DShapeNets MVCNN Wu et al. Su et al.
Cause 1: Architecture and Engineering LeNet, 1998 AlexNet, 2012
Cause 1: Architecture and Engineering LeNet, 1998 3DShapeNets, 2015 AlexNet, 2012
Cause 2: Resolution Multi-View CNNs MVCNN Su et al. 224x224 Images
Cause 2: Resolution Multi-View CNNs Volumetric CNNs MVCNN Su et al. 3DShapeNets Wu et al. 30x30x30 Volumes 224x224 Images
Diagnosis of Causes: Variable Control • Same resolution, study architectures • Same architecture, look into resolutions
Sphere Rendering Occupancy Grid Image Polygon Mesh 30x30x30 224x224
Sphere Rendering Same “3D Resolution” Occupancy Grid Image Polygon Mesh 30x30x30 224x224
Investigation into Architecture Multi-View Different 3D CNN Image CNN Architecture Same 3D Resolution (30x30x30) Sphere Rendering Occupancy Grid Images Volumes
CNNs with Same 3D Resolution Inputs 88 Shape Classification Accuracy 86 84 82 80 78 76 74 72 MVCNN with Sphere 3DShapeNets Rendering Images Wu et al.
Novel 3D CNN Architectures  3D NIN with Subvolume Supervision Push Harder for Learning Better!
Novel 3D CNN Architectures  Anisotropic Probing Network
Results of Our Novel 3D CNNs 88 Shape Classification Accuracy 86 84 82 80 78 76 74 72 MVCNN with 3DShapeNets Ours 3D CNN Sphere Rendering Wu et al. Images
Results of Our Novel 3D CNNs Closed the Gap under same 3D Resolution 88 Shape Classification Accuracy 86 84 82 80 78 76 74 72 MVCNN with 3DShapeNets Ours 3D CNN Sphere Rendering Wu et al. Images
Investigation into Resolution Multi-View Multi-View Same 3D CNN Image CNN Image CNN Architecture Different 3D Resolution Standard Rendering Sphere Rendering 30x30x30 Images Images Volume
Performance Trend wrt 3D Resolution 94 92 Accuracy (%) 90 88 86 MVCNN-Sphere 84 82 0 50 100 150 200 250 3D Resolution
Performance Trend wrt 3D Resolution 94 92 Accuracy (%) 90 88 86 MVCNN-Sphere 84 Our 3D CNN 82 0 50 100 150 200 250 3D Resolution
Generalization to Real Scans Shape retrieval on scan data Real Scan Dataset 243 objects 12 categories
Volumetric and Multi-View CNNs for Object Classification on 3D Data Code and Data Available Online! http://graphics.stanford.edu/projects/3dcnn/ Welcome to Our Poster #38!

Recommend

Deep Learning for Geometry Processing 3D Representations View-Based and Volumetric CNNs 3D

Deep Learning for Geometry Processing 3D Representations View-Based and Volumetric CNNs 3D Representations for Object Classification Multi-View CNNs Su et al. 2015 Multi-View CNNs Su et al. 2015 Multi-View CNNs Su et al. 2015 Multi-View

877 views • 86 slides

Fusing Non Fusing Non- -Volumetric, Spatially Volumetric, Spatially- - Localized Data with

Fusing Non Fusing Non- -Volumetric, Spatially Volumetric, Spatially- - Localized Data with Localized Data with V l V l Volumetric Data Volumetric Data i D i D Robert Weersink, Harsimran Braisch, Greg Bootsma David Jaffray Greg

308 views • 20 slides

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Towards Deep Multi-View Stereo Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View Stereo Multi View Stereo 2 / 40 Towards Deep Multi-View Stereo Outline 1 Gipuma: massively parallel multi-view

718 views • 40 slides

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

4/13/2017 OOP Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object oriented Programming (Using C++) ht t p: / / www. com pgeom . com / ~pi yush/ t each/ 3330 Objects: State (fields), Behavior (member

635 views • 6 slides

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object Tracking Origins SONAR, RADAR Given a raw stream of sensory data: Localize objects Estimate object identities over time

340 views • 16 slides

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented by: Adam Balint Presented by: Adam Balint Email: balint@uoguelph.ca Email: balint@uoguelph.ca Working with more complex data Working with more

608 views • 25 slides

Monte Carlo methods for volumetric light transport Monte Carlo methods for volumetric light

Monte Carlo methods for volumetric light transport Monte Carlo methods for volumetric light transport simulation simulation STAR at EG 2018 STAR at EG 2018 Advanced methods and acceleration data structures Advanced methods and acceleration

543 views • 30 slides

Understanding Geometry of Encoder-Decoder CNNs (E-D CNNs) Jong Chul Ye & Woon Kyoung Sung

Understanding Geometry of Encoder-Decoder CNNs (E-D CNNs) Jong Chul Ye & Woon Kyoung Sung BISPL - BioImaging, Signal Processing and Learning Lab. Dept. Bio & Brain Engineering Dept. of Mathematical Sciences KAIST, Korea E-D CNN

911 views • 26 slides

Volumetric Scene Reconstruction Volumetric Scene Reconstruction from Multiple Views from

Volumetric Scene Reconstruction Volumetric Scene Reconstruction from Multiple Views from Multiple Views Chuck Dyer Chuck Dyer University of Wisconsin University of Wisconsin dyer@cs.wisc.edu dyer@cs.wisc.edu

719 views • 41 slides

Volumetric Scene Reconstruction Volumetric Scene Reconstruction Goal Goal from Multiple

Image Image- -Based Scene Reconstruction Based Scene Reconstruction Volumetric Scene Reconstruction Volumetric Scene Reconstruction Goal Goal from Multiple Views from Multiple Views Automatic construction of photo Automatic

378 views • 21 slides

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view from Kildrum Road Aerial view from western entrance View from southern playground View of school entrance View from site entrance View of

456 views • 18 slides

Multi-view Active Learning Ion Muslea University of Southern California Outline Multi-view

Multi-view Active Learning Ion Muslea University of Southern California Outline Multi-view active learning Robust multi-view learning View validation as meta-learning Related Work Contributions Future work

684 views • 43 slides

From image classification to object detection Image classification Object detection Image source

From image classification to object detection Image classification Object detection Image source Slides from L. Lazebnik What are the challenges of object detection? Images may contain more than one class, multiple instances from the same

1.11k views • 57 slides

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Object oriented Object oriented Object oriented Object oriented approach and UML approach and UML approach and UML approach and UML Goals The goals of this chapter are to introduce the object oriented approach to software systems

1.06k views • 92 slides

On-line Hierarchical Multi-label Text Classification Jesse Read Supervised by Bernhard (and Eibe

On-line Hierarchical Multi-label Text Classification Jesse Read Supervised by Bernhard (and Eibe and Geoff) On-line Hierarchical Multi-label Text Classification 1 Multi-label Classification Multi-class (Single-label) Classification e.g.

551 views • 24 slides

Volumetric Image Visualization Alexandre Xavier Falc ao LIDS - Institute of Computing -

Volumetric Image Visualization Alexandre Xavier Falc ao LIDS - Institute of Computing - UNICAMP afalcao@ic.unicamp.br Alexandre Xavier Falc ao MO815 - Volumetric Image Visualization Object iso-surfaces and curvilinear cuts A discrete

580 views • 40 slides

Fuzzing Low-Level Code Mathias Payer <mathias.payer@epfl.ch> https://hexhive.github.io 1

Fuzzing Low-Level Code Mathias Payer <mathias.payer@epfl.ch> https://hexhive.github.io 1 HexHive is hiring! 2 Challenge: vulnerabilities everywhere 3 Challenge: software complexity Google Chrome: 76 MLoC Chrome and OS ~100 mLoC, 27

725 views • 35 slides

Event-based Methods for Security Protocols Federico Crazzolara C&C Laboratories, NEC Europe

Event-based Methods for Security Protocols Federico Crazzolara C&C Laboratories, NEC Europe (joint work with G. Winskel while at BRICS) DIMACS, July 8, 2003 Road map 1) Security Protocol Language (SPL) Transition vs.

591 views • 25 slides

Code developments developments for for ray ray- -tracing tracing simulations simulations

Code developments developments for for ray ray- -tracing tracing simulations simulations Code in Spiral FFAG lattices RACCAM Project. RACCAM Project. in Spiral FFAG lattices I. Spiral FFAG median plane magnetic field modeling

347 views • 24 slides

12 zimmerka,channmn,shumwanm,wardsr (in two rows, so that you can face 13

n Team am 11 lamantds,lint,audretad,fry Sit with your team 12 zimmerka,channmn,shumwanm,wardsr (in two rows, so that you can face 13 lapresga,draycs,roserrm each other) 14 Check out geislekj,degrotpc,evansea,houstoef VectorGraphics

501 views • 24 slides

Interactions between Software Product Lines and Adversarial Machine Learning Paul TEMPLE 1 Gilles

Interactions between Software Product Lines and Adversarial Machine Learning Paul TEMPLE 1 Gilles PERROUIN 1 , 2 Pierre-Yves SCHOBBENS 1 Patrick HEYMANS 1 1 NaDI, PReCISE, Faculty of Computer Science, University of Namur 2 FNRS April, 12 th 2019

183 views • 16 slides

To Preserve or Not to Preserve Invalid Solutions in Search-Based Software Engineering: A Case

To Preserve or Not to Preserve Invalid Solutions in Search-Based Software Engineering: A Case Study in Cloud Cost Optimization Jianmei Guo Alibaba Group 2018.11.17 @CSBSE [Guo and Shi, ICSE18] 1 Search-based software engineering (SBSE) 2

231 views • 20 slides

ECS 289M Lecture 6 April 12, 2006 Safety Result If the scheme is acyclic and attenuating,

ECS 289M Lecture 6 April 12, 2006 Safety Result If the scheme is acyclic and attenuating, the safety question is decidable April 12, 2006 ECS 289M, Foundations of Computer Slide 2 and Information Security Expressive Power How do

278 views • 24 slides

Accelerating Atomistic Simulation on Many-core Computing Platform Liu Peng Collaboratory for

Accelerating Atomistic Simulation on Many-core Computing Platform Liu Peng Collaboratory for Advanced Computing & Simulations Computer Science Department University of Southern California UnConvential High Performance Computing Euro Par

255 views • 12 slides