Object Detection and Tracking in 3D World Xinshuo Weng 3D Object - PowerPoint PPT Presentation

Object Detection and Tracking in 3D World Xinshuo Weng

3D Object Detection

Goal Inputs: ● LiDAR point cloud ○

Goal Inputs: ● LiDAR point cloud ○ Monocular Images ○

Goal Inputs: ● LiDAR point cloud ○ Monocular Images ○ Stereo images ○ Left Right

Goal Inputs: ● LiDAR point cloud ○ Monocular Images ○ Stereo images ○ Or fusion ○

Goal Inputs: ● LiDAR point cloud ○ Monocular Images ○ Stereo images ○ Or fusion ○ Outputs: ● Eight corners ○ Four corners + height ○ Size (l,w,h) + center (x,y,z) + heading ( 𝜾 ) ○

3D Object Detection from LiDAR Point Cloud Shi et al, “PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud”, CVPR, 2019.

3D Object Detection from Monocular Images Goal: estimate 7 DoF parameters ● Leverage the 2D-3D bounding box consistency constraint ● Provide 4 constraints ○ Mousavian et al, “3D Bounding Box Estimation Using Deep Learning and Geometry”, CVPR, 2017.

3D Object Detection from Monocular Images Goal: estimate 7 DoF parameters ● Leverage the 2D-3D bounding box consistency constraint ● Provide 4 constraints ○ Need at least another three ○ Mousavian et al, “3D Bounding Box Estimation Using Deep Learning and Geometry”, CVPR, 2017.

3D Object Detection from Stereo Images Li et al, “Stereo R-CNN based 3D Object Detection for Autonomous Driving”, CVPR, 2019.

3D Object Detection from Stereo Images 2D bounding box (x, y, z, 𝜾 ) Size (l, w, h) Li et al, “Stereo R-CNN based 3D Object Detection for Autonomous Driving”, CVPR, 2019.

3D Object Detection from Stereo Images Matching loss Li et al, “Stereo R-CNN based 3D Object Detection for Autonomous Driving”, CVPR, 2019.

3D Object Detection from Images and LiDAR Qi et al, “Frustum PointNets for 3D Object Detection from RGB-D Data”, CVPR, 2018.

Our Recent Work on Monocular 3D Object Detection ● Accepted to autonomous driving workshop in ICCV 2019 ● Motivation: to bridge the performance gap between LiDAR and camera for 3D object detection ● KITTI dataset leaderboard: LiDAR-based 3D detection Monocular 3D detection X. Weng and K. Kitani, “Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud”, ICCVW, 2019.

Our Recent Work on Monocular 3D Object Detection X. Weng and K. Kitani, “Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud”, ICCVW, 2019.

Our Recent Work on Monocular 3D Object Detection Contributions: ● Pseudo-LiDAR framework ○ Two observations: ○ Long tail ■ Local misalignment ■ X. Weng and K. Kitani, “Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud”, ICCVW, 2019.

Our Recent Work on Monocular 3D Object Detection Contributions: ● Pseudo-LiDAR framework ○ Two observations: ○ Long tail – instance mask proposal ■ Local misalignment ■ X. Weng and K. Kitani, “Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud”, ICCVW, 2019.

Our Recent Work on Monocular 3D Object Detection Contributions: ● Pseudo-LiDAR framework ○ Two observations: ○ Long tail – instance mask proposal ■ Local misalignment – bounding box consistency loss (BBCL) and optimization (BBCO) ■ X. Weng and K. Kitani, “Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud”, ICCVW, 2019.

Our Recent Work on Monocular 3D Object Detection Inputs are monocular images only ● Current 1 st position on both KITTI 3D detection / bird’s eye view detection leaderboard among ● monocular methods X. Weng and K. Kitani, “Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud”, ICCVW, 2019.

Our Recent Work on Monocular 3D Object Detection [6] R. Urtasun et al (University of Toronto). Monocular 3D Object Detection for Autonomous Driving. CVPR 2016. [30] J. Kosecka (George Mason Unibrtsity). 3D Bounding Box Estimation Using Deep Learning and Geometry. CVPR 2017. [58] Z. Chen (Wuhan University) et al. Multi-Level Fusion based 3D Object Detection from Monocular Images. CVPR 2018. X. Weng and K. Kitani, “Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud”, ICCVW, 2019.

3D Multi-Object Tracking

Goal Inputs: ● LiDAR point cloud ○ Monocular Image ○ Stereo image, add video ○ Or fusion ○ Outputs: ● Eight corners ○ Four corners + height ○ Size + center + orientation ○ identity ○

Goal Inputs: ● LiDAR point cloud ○ Monocular Image ○ Stereo image, add video ○ Or fusion ○ Outputs: ● Eight corners ○ Four corners + height ○ Size + center + orientation ○ Identity – association problem ○

Typical Multi-Object Tracking Solver Tracking-by-detection pipeline ●

Typical Multi-Object Tracking Solver Tracking-by-detection pipeline ● detector ●

Typical Multi-Object Tracking Solver Tracking-by-detection pipeline ● detector + appearance model + motion model ●

Typical Multi-Object Tracking Solver Tracking-by-detection pipeline ● detector + appearance model + motion model + data association (e.g., Hungarian algorithm) ●

Typical Multi-Object Tracking (MOT) Solver Tracking-by-detection pipeline ● detector + appearance model + motion model + data association ● Deep Deep motion Deep association appearance network network network

3D MOT from LiDAR Point Cloud Luo et al, “Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net”, CVPR, 2018.

3D MOT from LiDAR Point Cloud SimNet AssocNet Baser et al, “FANTrack: 3D Multi-Object Tracking with Feature Association Network”, arXiv, 2019.

3D MOT from LiDAR Point Cloud Frossard et al, “End-to-end Learning of Multi-sensor 3D Tracking by Detection”, ICRA, 2018.

Our Recent Work on 3D Multi-Object Tracking Tracking by detection ● Detection: state-of-the-art 3D object detector ---- PointRCNN ○ Tracking: Kalman filter with 3D constant velocity model + Hungarian algorithm, no appearance model ○ X. Weng and K. Kitani, “Simple Baseline and New Evaluation Tool for 3D Multi-Object Tracking”, arXiv, 2019.

Our Recent Work on 3D Multi-Object Tracking Inputs are only LiDAR point cloud only ● Current 1 st position on KITTI 3D tracking leaderboard, 2 nd position on KITTI 2D tracking leaderboard among ● published works X. Weng and K. Kitani, “Simple Baseline and New Evaluation Tool for 3D Multi-Object Tracking”, arXiv, 2019.

Our Recent Work on 3D Multi-Object Tracking 2D tracking results on KITTI test set 3D tracking results on KITTI validation set [1] Raquel Urtasun. End-to-End Learning of Multi-Sensor 3D Tracking by Detection. ICRA 2018. [2] Krzysztof Czarnecki. University of Waterloo. FANTrack: 3D Multi-Object Tracking with Feature Association Network. arXiv 2019. [3] Karl Granstrom, Chalmer University of Technology. Mono-Camera 3D Multi-Object Tracking Using Deep Learning Detections and PMBM Filtering. ITSC 2018. [5] K. Madhava Krishna. IIIT Hyderabad, India. Beyond Pixels: Leveraging Geometry and Shape Cues for Online Multi-Object Tracking. ICRA 2018. X. Weng and K. Kitani, “Simple Baseline and New Evaluation Tool for 3D Multi-Object Tracking”, arXiv 2019.

Takeaway Message With proper use, conceptually simple idea can achieve an unprecedented improvement of ● performance in practice

Object Detection and Tracking in 3D World Xinshuo Weng 3D Object - PowerPoint PPT Presentation

Object Detection and Tracking in 3D World Xinshuo Weng 3D Object Detection Goal Goal Inputs: LiDAR point cloud Goal Inputs: LiDAR point cloud Monocular Images Goal Inputs: LiDAR point cloud Monocular

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation

People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Foreground detection and tracking in 2D/3D Jos Luis Landabaso Montse Pards Outline 2D

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

Object Detection Sanja Fidler CSC420: Intro to Image Understanding 1 / 48 Object Detection The

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

From image classification to object detection Image classification Object detection Image source

AutoML for Object Detection Xiangyu Zhang MEGVII Research 1 AutoML for Advances in AutoML

Lecture 21: Motion and tracking Thursday, Nov 29 Prof. Kristen Grauman Prof. Kristen Grauman

Detection vs. tracking Lecture 21: Motion and tracking Thursday, Nov 29 Prof. Kristen

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Regularity Properties and Deformation of Wheeled Robots Trajectories Quang-Cuong Pham and

Project in History of Control: The History of Robot Control Bjrn Olofsson and Andreas Stolt

3D (Multi) Object Detection, Tracking and Segmentation 1 CV3DST | Laura Leal-Taix, Aljoa

Learning and control with movement primitives in multiple coordinate systems Sylvain Calinon

Higher-dimensional Auslander algebras of type A and the higher-dimensional Waldhausen S

Distributed Percep.on and Es.ma.on in Mul.-Robot Systems

15-780: Grad AI Lecture 16: Probability Geoff Gordon (this lecture) Tuomas Sandholm TAs Erik

Homotopy-Aware RRT* : Toward Human-Robot Topological Path-Planning Daqing Yi Michael A. Goodrich

Object Detection and Tracking in 3D World Xinshuo Weng 3D Object - PowerPoint PPT Presentation

Object Detection and Tracking in 3D World Xinshuo Weng 3D Object Detection Goal Goal Inputs: LiDAR point cloud Goal Inputs: LiDAR point cloud Monocular Images Goal Inputs: LiDAR point cloud Monocular

Overview Introduction Object Tracking Vehicle Tracking Theory &amp; Implementation

People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Foreground detection and tracking in 2D/3D Jos Luis Landabaso Montse Pards Outline 2D

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

Object Detection Sanja Fidler CSC420: Intro to Image Understanding 1 / 48 Object Detection The

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

From image classification to object detection Image classification Object detection Image source

AutoML for Object Detection Xiangyu Zhang MEGVII Research 1 AutoML for Advances in AutoML

Lecture 21: Motion and tracking Thursday, Nov 29 Prof. Kristen Grauman Prof. Kristen Grauman

Detection vs. tracking Lecture 21: Motion and tracking Thursday, Nov 29 Prof. Kristen

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Regularity Properties and Deformation of Wheeled Robots Trajectories Quang-Cuong Pham and

Project in History of Control: The History of Robot Control Bjrn Olofsson and Andreas Stolt

3D (Multi) Object Detection, Tracking and Segmentation 1 CV3DST | Laura Leal-Taix, Aljoa

Learning and control with movement primitives in multiple coordinate systems Sylvain Calinon

Higher-dimensional Auslander algebras of type A and the higher-dimensional Waldhausen S

Distributed Percep.on and Es.ma.on in Mul.-Robot Systems

15-780: Grad AI Lecture 16: Probability Geoff Gordon (this lecture) Tuomas Sandholm TAs Erik

Homotopy-Aware RRT* : Toward Human-Robot Topological Path-Planning Daqing Yi Michael A. Goodrich

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation