deep hough voting for 3d object detection in point clouds
play

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi - PowerPoint PPT Presentation

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( ) GAMES Webinar December 5th, 2019 Joint work with Or Litany, Kaiming He, Leonidas Guibas. ICCV 2019. 3D object detection Estimate oriented 3D bounding boxes and


  1. Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( 祁芮中台 ) GAMES Webinar December 5th, 2019 Joint work with Or Litany, Kaiming He, Leonidas Guibas. ICCV 2019.

  2. 3D object detection Estimate oriented 3D bounding boxes and semantic classes from sensor data. 2

  3. Prior work relies on 2D object detection Frustum-based detector Bird’s eye view detector [MV3D by Chen et al. CVPR 2017] [F-PointNet by Qi et al. CVPR 2018] 3

  4. Prior work relies on 2D object detection 3D CNN detector [Deep Sliding Shapes by Song et al. CVPR 2016] 4

  5. Observation: 2D v.s. 3D 5

  6. Our idea: “ask” the points to vote for object centers Voting from surface points Detected 3D bounding boxes 6

  7. Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance From U. Toronto CSC420

  8. Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance From U. Toronto CSC420

  9. Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance From U. Toronto CSC420

  10. Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance From U. Toronto CSC420

  11. Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance - Votes clustering to find peaks From U. Toronto CSC420

  12. Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance - Votes clustering to find peaks - Find patches that voted for the peaks by back-projection From U. Toronto CSC420

  13. Hough voting detector recap Hough voting pipeline (on 2D images): - Select interest points - Match patch around each interest point to a training patch (codebook) - Vote for object center given that training instance - Votes clustering to find peaks - Find patches that voted for the peaks by back-projection - Find full objects based on back-projected patches From U. Toronto CSC420

  14. Hough voting detector recap + Computation is only on “interest” points instead of on all pixels/voxels. + Support “templates” (used in 6DoF pose estimation) - Not end-to-end optimizable From U. Toronto CSC420

  15. 3D object proposal: A return of hough voting! Deep hough voting with PointNet++ Interest points → seed points sampled from the point clouds Votes → learned mapping from point features to votes Clustering → local pointnet layers to group and aggregate local votes Object recovery → learned bounding box predictor End-to-end optimizable!

  16. Deep Hough voting: Detection pipeline PointNet++ 21

  17. Deep Hough voting: Detection pipeline 22

  18. Results: SUN RGB-D (single depth images) 23

  19. Results: ScanNet (3D reconstructions) 24

  20. Comparing with previous methods SUN RGB-D: +3.7mAP with just 3D geometry data as input. 25

  21. Comparing with previous methods ScanNet: +18.3 mAP compared with prior art (3D CNN based method) with 3D & multi-view images. 26

  22. Can images help the VoteNet detection? Images are in high resolution, have rich texture, and can even provide useful geometric cues for object localization & shape/pose estimation. 27

  23. ImVoteNet : Boosting 3D Object Detection in Point Clouds with Image Votes On-going work with Xinlei Chen, Or Litany and Leonidas Guibas 28

  24. ImVoteNet detection pipeline 29

  25. ImVoteNet detection pipeline 30

  26. ImVoteNet detection pipeline 31

  27. Geometric cues from images: Lifted image votes 32

  28. ImVoteNet detection pipeline 33

  29. Results on SUN RGB-D 63.4 57.7 +5.7mAP with lifted image cues for voting 34

  30. Results on SUN RGB-D 36

  31. Summary VoteNet : a revival of Hough voting with 3D deep learning. ● End-to-end optimizable hough voting with point cloud deep nets. ● A new detection model with a simple design shows state-of-the-art results on SUN RGB-D and ScanNet with geometry data only. Code: https://github.com/facebookresearch/votenet ImVoteNet : boosting 3D detection with lifted image votes. Many open possibilities to extend the pipeline (e.g. 6D pose estimation, template based detection). 40

Recommend


More recommend