Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi - PowerPoint PPT Presentation

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( 祁芮中台 ) GAMES Webinar December 5th, 2019 Joint work with Or Litany, Kaiming He, Leonidas Guibas. ICCV 2019.

3D object detection Estimate oriented 3D bounding boxes and semantic classes from sensor data. 2

Prior work relies on 2D object detection Frustum-based detector Bird’s eye view detector [MV3D by Chen et al. CVPR 2017] [F-PointNet by Qi et al. CVPR 2018] 3

Prior work relies on 2D object detection 3D CNN detector [Deep Sliding Shapes by Song et al. CVPR 2016] 4

Observation: 2D v.s. 3D 5

Our idea: “ask” the points to vote for object centers Voting from surface points Detected 3D bounding boxes 6

Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance From U. Toronto CSC420

Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance - Votes clustering to find peaks From U. Toronto CSC420

Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance - Votes clustering to find peaks - Find patches that voted for the peaks by back-projection From U. Toronto CSC420

Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance - Votes clustering to find peaks - Find patches that voted for the peaks by back-projection - Find full objects based on back-projected patches From U. Toronto CSC420

Hough voting detector recap + Computation is only on “interest” points instead of on all pixels/voxels. + Support “templates” (used in 6DoF pose estimation) - Not end-to-end optimizable From U. Toronto CSC420

3D object proposal: A return of hough voting! Deep hough voting with PointNet++ Interest points → seed points sampled from the point clouds Votes → learned mapping from point features to votes Clustering → local pointnet layers to group and aggregate local votes Object recovery → learned bounding box predictor End-to-end optimizable!

Deep Hough voting: Detection pipeline PointNet++ 21

Deep Hough voting: Detection pipeline 22

Results: SUN RGB-D (single depth images) 23

Results: ScanNet (3D reconstructions) 24

Comparing with previous methods SUN RGB-D: +3.7mAP with just 3D geometry data as input. 25

Comparing with previous methods ScanNet: +18.3 mAP compared with prior art (3D CNN based method) with 3D & multi-view images. 26

Can images help the VoteNet detection? Images are in high resolution, have rich texture, and can even provide useful geometric cues for object localization & shape/pose estimation. 27

ImVoteNet : Boosting 3D Object Detection in Point Clouds with Image Votes On-going work with Xinlei Chen, Or Litany and Leonidas Guibas 28

ImVoteNet detection pipeline 29

Geometric cues from images: Lifted image votes 32

Results on SUN RGB-D 63.4 57.7 +5.7mAP with lifted image cues for voting 34

Results on SUN RGB-D 36

Summary VoteNet : a revival of Hough voting with 3D deep learning. ● End-to-end optimizable hough voting with point cloud deep nets. ● A new detection model with a simple design shows state-of-the-art results on SUN RGB-D and ScanNet with geometry data only. Code: https://github.com/facebookresearch/votenet ImVoteNet : boosting 3D detection with lifted image votes. Many open possibilities to extend the pipeline (e.g. 6D pose estimation, template based detection). 40

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi - PowerPoint PPT Presentation

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( ) GAMES Webinar December 5th, 2019 Joint work with Or Litany, Kaiming He, Leonidas Guibas. ICCV 2019. 3D object detection Estimate oriented 3D bounding boxes and

Generalized Hough Transform 16-385 Computer Vision Hough Circles Finding Circles by Hough

Deep Hough Voting for 3D Object Detection in Point Clouds Charles R. Qi, Or Litany, Kaiming He,

Clouds A B Clouds A Eastern 2/3 of the U.S. Clouds Clouds on Mars are made of _____ . A.

When you look up into the sky, you will often see clouds. No two clouds are the same, and there

2 Microstructures of Warm Clouds Clouds that lie completely below the 0 C isotherm, referred to

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

The Multi-IFO Hough Search using LIGO S4 Data Badri Krishnan (For the LIGO Scientific

CS 4495 Computer Vision Finding 2D Shapes and the Hough Transform Aaron Bobick School of

Michigan Votes in 2020 Voter registration, absentee voting, and Election Day New Voting Laws New

Electronic Voting Electronic voting at a precinct Analysis of an Internet Voting Focus

Perception with Point Clouds Robert Platt Northeastern University Topics depth sensors

Hough Transform 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University motivation

6 Artificial Modification of Clouds The microstructures of clouds are influenced by the concen-

4. Droplet Growth in Warm Clouds In warm clouds, droplets can grow by condensation in a

Session 3: Hydrology & Clouds 3:00- 5:30 PM Session 3: Hydrology & Clouds 3:00- 5:30 PM

Opening up accessible design Karwai Pun Interaction designer Home Office Digital, Data and

3.1 Introduction Remanufacturing, Refurbishing and Re-use From TU Delft From TU Delft From

A Futurist approach to dynamic environments Jano van Hemert, Leiden University, The

Today In Class Rik Eberhardt Today In CMS.611 / 6.037 & SIT IN YOUR TEAMS Working in

All Saints Day Welcome Home Gathering Song For All the Saints For all the saints Who from their

Healthy New YOU! St. Pius X Church welcomes you, to renew all things in Christ Week 3

CSCI 5832 Natural Language Processing Lecture 1 Jim Martin 1/18/08 1 Today 1/15 An

Understanding the propagation of hard errors to software and implications for resilient system

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi - PowerPoint PPT Presentation

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( ) GAMES Webinar December 5th, 2019 Joint work with Or Litany, Kaiming He, Leonidas Guibas. ICCV 2019. 3D object detection Estimate oriented 3D bounding boxes and

Generalized Hough Transform 16-385 Computer Vision Hough Circles Finding Circles by Hough

Deep Hough Voting for 3D Object Detection in Point Clouds Charles R. Qi, Or Litany, Kaiming He,

Clouds A B Clouds A Eastern 2/3 of the U.S. Clouds Clouds on Mars are made of _____ . A.

When you look up into the sky, you will often see clouds. No two clouds are the same, and there

2 Microstructures of Warm Clouds Clouds that lie completely below the 0 C isotherm, referred to

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

The Multi-IFO Hough Search using LIGO S4 Data Badri Krishnan (For the LIGO Scientific

CS 4495 Computer Vision Finding 2D Shapes and the Hough Transform Aaron Bobick School of

Michigan Votes in 2020 Voter registration, absentee voting, and Election Day New Voting Laws New

Electronic Voting Electronic voting at a precinct Analysis of an Internet Voting Focus

Perception with Point Clouds Robert Platt Northeastern University Topics depth sensors

Hough Transform 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University motivation

6 Artificial Modification of Clouds The microstructures of clouds are influenced by the concen-

4. Droplet Growth in Warm Clouds In warm clouds, droplets can grow by condensation in a

Session 3: Hydrology &amp; Clouds 3:00- 5:30 PM Session 3: Hydrology &amp; Clouds 3:00- 5:30 PM

Opening up accessible design Karwai Pun Interaction designer Home Office Digital, Data and

3.1 Introduction Remanufacturing, Refurbishing and Re-use From TU Delft From TU Delft From

A Futurist approach to dynamic environments Jano van Hemert, Leiden University, The

Today In Class Rik Eberhardt Today In CMS.611 / 6.037 &amp; SIT IN YOUR TEAMS Working in

All Saints Day Welcome Home Gathering Song For All the Saints For all the saints Who from their

Healthy New YOU! St. Pius X Church welcomes you, to renew all things in Christ Week 3

CSCI 5832 Natural Language Processing Lecture 1 Jim Martin 1/18/08 1 Today 1/15 An

Understanding the propagation of hard errors to software and implications for resilient system

Session 3: Hydrology & Clouds 3:00- 5:30 PM Session 3: Hydrology & Clouds 3:00- 5:30 PM

Today In Class Rik Eberhardt Today In CMS.611 / 6.037 & SIT IN YOUR TEAMS Working in