Deep Watershed Transform for Instance Segmentation Min Bai & - PowerPoint PPT Presentation

Deep Watershed Transform for Instance Segmentation Min Bai & Raquel Urtasun To appear at IEEE CVPR 2017 in Hawaii Presented at NVIDIA GTC 2017

Semantic Segmentation ● Input: RGB Image ● Output at each pixel: ○ Semantic label

Instance Segmentation ● Input: RGB Image ● Output at each pixel: ○ Semantic label ○ Instance label ■ Same for each px in object ■ Different among objects ○ Difficulty: How to phrase the problem?

Applications ● Object tracking Image credit: Davi Frossard

Applications ● Interacting with the environment Image credit: http://www.rethinkrobotics.com/build-a-bot/

Applications ● Useful information for other algorithms such as optical flow, etc Image credit: Shenlong Wang

Semantic Segmentation ● Semantic segmentation is a well studied problem ○ Our instance segmentation method leverages an existing technique ○ H. Zhao et al, Pyramid Scene Parsing Network , https://arxiv.org/abs/1612.01105 Image credit: H. Zhao et al.

Watershed Transform ● Classical image segmentation technique Image (left) credit: Adrian Fisher

Scalar Field and Gradient ● Scalar field: single number at each pixel ● Gradient: vector at each pixel, pointing toward direction of greatest ascent Image source: Wikipedia: byVivekj78 - Own work, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=15346899

Overview of Approach Input Image Gradient of Energy Landscape Energy Landscape Predicted Instances Semantic Segmentation

Why Predict Direction First? Input Image Direction of Gradient Energy Landscape Much sharper difference in the direction label at the boundary!

Overall Network

Direction Prediction Network Input Image Ground Truth Directions Predicted Directions Semantic Segmentation

Energy Prediction Network Ground Truth Energy Ground Truth Instances Predicted Energy Predicted Instances

Training and Inference ● Pre-train both networks ● End-to-end fine-tuning ● Network trained on NVIDIA DGX-1 ○ Approximately 25 hours total for training on one GP100 core ○ ~0.1s per image for forward pass ○ Thank you NVIDIA for the generous gift! Image source: www.nvidia.com

Cityscapes Dataset ● 2975 training / 500 validation / 1525 testing images ● Instances: car, truck, bus, train, person, rider, motorcycle, bicycle

Cityscapes Instance Segmentation Leaderboard AP* AP* @ 50% AP* @ 50m AP* @ 100m van den Brand et al. 2.3% 3.7% 3.9% 4.9% Cordts et al. 4.6% 12.9% 7.7% 10.3% Uhrig et al. 8.9% 21.1% 15.3% 16.7% Ours 19.4% 35.3% 31.4% 36.8% * Average Precision (AP): higher is better Recently, new approaches have achieved even higher performance.

Sample Output Input RGB Direction Prediction Energy Prediction Semantic Segmentation Predicted Instances Ground Truth Instances

Preliminary TorontoCity Aerial Instance Segmentation Semantic Segmentation (ResNet) Input RGB Predicted Building Instances

Preliminary TorontoCity Aerial Instance Segmentation Weighted AP* Recall* @ Precision* @ Coverage* 50% 50% FCN-8 41.92% 11.37% 21.50% 36.00% ResNet-56 40.65% 12.13% 18.90% 45.36% Ours 56.22% 21.22% 67.16% 63.67% * higher is better

In Summary... ● Simple technique for instance segmentation ● Encodes object instances as energy map ● Predicts gradient direction as intermediate task for better supervision

Deep Watershed Transform for Instance Segmentation Min Bai & - PowerPoint PPT Presentation

Deep Watershed Transform for Instance Segmentation Min Bai & Raquel Urtasun To appear at IEEE CVPR 2017 in Hawaii Presented at NVIDIA GTC 2017 Semantic Segmentation Input: RGB Image Output at each pixel: Semantic label

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Topic 10: The Z Transform o Introduction to Z Transform o Relationship to the Fourier transform o

Fourier Series and Transform Overview Why Fourier transform? Trigonometric functions Who is

Budget-aware Semi-Supervised Semantic and Instance Segmentation Miriam Bellver, Amaia Salvador,

SMART GOVERNMENT INVOICING: INVOICE PROCESSING PLATFORM LEAD. TRANSFORM. DELIVER LEAD. TRANSFORM.

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Segmentation using Segmentation using Bayesian Decision Theory Bayesian Decision Theory

Video Segmentation for Video Segmentation for Surveillance Surveillance -- A Transform Domain

Newfound Lake Region Association Protecting the Watershed Lakes Management Advisory Council July

Whats a Watershed Protection Plan Process? Linda Shead, Watershed Coordinator What is a

Whats a Watershed Protection Plan Process? Linda Shead, Watershed Coordinator A Watershed

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds B. Yang, J. Wang,

Ultrasonic weather station and their application for atmospheric processes monitoring A.Ya.

C ENTER FOR A DVANCED S TUDIES W HEELER H IGH S CHOOL STEM Certified & STEAM Inspired The

"The Mis-education of Softw are Testers: Rethinking and Relearning Softw are Quality"

GAUTENG PROVINCIAL COMMAND COUNCIL Weekly Media Update on COVID-19 7 August 2020 Gauteng

Detecting Botnets with NetFlow V. Krmek, T. Plesnk {vojtec|plesnik}@ics.muni.cz FloCon

Linux IoT Botnet Wars and the Lack of Security Hardening Drew Moseley Solutions Architect

Reinsurance Reserving: Top-Down versus Bottom-Up Casualty Loss Reserve Seminar September 15,

Jobenomics deals with the process of creating and mass- Jobenomics deals with the process of

Deep Watershed Transform for Instance Segmentation Min Bai & - PowerPoint PPT Presentation

Deep Watershed Transform for Instance Segmentation Min Bai & Raquel Urtasun To appear at IEEE CVPR 2017 in Hawaii Presented at NVIDIA GTC 2017 Semantic Segmentation Input: RGB Image Output at each pixel: Semantic label

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Topic 10: The Z Transform o Introduction to Z Transform o Relationship to the Fourier transform o

Fourier Series and Transform Overview Why Fourier transform? Trigonometric functions Who is

Budget-aware Semi-Supervised Semantic and Instance Segmentation Miriam Bellver, Amaia Salvador,

SMART GOVERNMENT INVOICING: INVOICE PROCESSING PLATFORM LEAD. TRANSFORM. DELIVER LEAD. TRANSFORM.

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Segmentation using Segmentation using Bayesian Decision Theory Bayesian Decision Theory

Video Segmentation for Video Segmentation for Surveillance Surveillance -- A Transform Domain

Newfound Lake Region Association Protecting the Watershed Lakes Management Advisory Council July

Whats a Watershed Protection Plan Process? Linda Shead, Watershed Coordinator What is a

Whats a Watershed Protection Plan Process? Linda Shead, Watershed Coordinator A Watershed

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds B. Yang, J. Wang,

Ultrasonic weather station and their application for atmospheric processes monitoring A.Ya.

C ENTER FOR A DVANCED S TUDIES W HEELER H IGH S CHOOL STEM Certified &amp; STEAM Inspired The

&quot;The Mis-education of Softw are Testers: Rethinking and Relearning Softw are Quality&quot;

GAUTENG PROVINCIAL COMMAND COUNCIL Weekly Media Update on COVID-19 7 August 2020 Gauteng

Detecting Botnets with NetFlow V. Krmek, T. Plesnk {vojtec|plesnik}@ics.muni.cz FloCon

Linux IoT Botnet Wars and the Lack of Security Hardening Drew Moseley Solutions Architect

Reinsurance Reserving: Top-Down versus Bottom-Up Casualty Loss Reserve Seminar September 15,

Jobenomics deals with the process of creating and mass- Jobenomics deals with the process of

C ENTER FOR A DVANCED S TUDIES W HEELER H IGH S CHOOL STEM Certified & STEAM Inspired The

"The Mis-education of Softw are Testers: Rethinking and Relearning Softw are Quality"