Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao - PowerPoint PPT Presentation

Weakly Supervised Object Detection, Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao Zhou, Qixian ang Ye Ye www.ucassdl.cn qxye@ucas.ac.cn people.ucas.ac.cn/~qxye

Problem Supervised object ect detection ection and inst stance ance segme egmentat ation ion pipeline Human Machine Object box Classes, annotation learning and/or mask Boxes, Masks,

Problem Supervised object ect detection ection and inst stance ance segme egmentat ation ion pipeline Human Machine Object box Classes, annotation learning and/or mask Boxes, Masks, Bounding box annotation Mask annotation

Problem 5min/I 100/c 19 years 20k+ 2M /perso rson class instances Copy from UIUC Yunchao’s Slides

Problem Imagery Training databases sets

Solutions Imagery Training databases sets Data annotation is exp xpensive ensive

Solutions Imagery Training databases sets Weakly Weakly supervi ervised ed supervi ervised ed data a annotat otation on learni ning ng Data annotation is efficie cient nt and d low-cos cost

Solutions Weakly Supervised Annotations Scribes bes Copy from UIUC Yunchao’s Slides

Solutions Weakly Supervised Annotations Scribes bes Point nt Copy from UIUC Yunchao’s Slides

Solutions Weakly Supervised Annotations Person on, Sheep, Dog Scribes bes Point nt Image-lev evel el label els The most t efficient ent one Copy from UIUC Yunchao’s Slides

Solutions Weakly labeled imagery is widely available on the Web

Solutions Imagery Training databases sets Weakly Weakly supervi ervised ed supervi ervised ed data a annotat otation on learni ning ng Data annotation is efficie cient nt and d low-cos cost

Our works MELM+ LM+Recu ecurren ent Learni arning ng CMIL : : Continuati tion Multiple Insta tance Learning MELM LM CVPR19 (WSOD) PAMI20 2019 19: Recurr urren ent Learni ning ng (WSO WSOD) CVPR18: 8: Min-en entrop opy y Laten ent Mode del (WSOD) SPN SPN PRM PRM IAM ICCV17 17: Soft Propo posal sal Network ork (WSOL) CVPR18 CVPR 18: Peak Respo ponse nse Mapp pping ng (WSIS) CVPR CVPR19 19: Inst stan ance ce Activat vation on Map (WSIS)

Our works- Challenge analysis Latent variable h Latent variable learning Multiple Instance learning Latent variable Latent 𝑀 Ɵ = 1 variable 2 ∥ Ɵ ∥ 2 +𝜇 max(0, 1 − 𝑧 𝑗 𝑔(𝑦 𝑗 , ℎ 𝑗 )) 𝑗 𝑔(𝑦, ℎ) = max Ɵ ∙ Φ(𝑦, ℎ) ℎ

Our works- Challenge analysis Local global 𝑀 Ɵ = 1 minimum 2 ∥ Ɵ ∥ 2 +𝜇 minimum max(0, 1 − 𝑧 𝑗 𝑔(𝑦 𝑗 , ℎ 𝑗 )) 𝑗 𝑔(𝑦, ℎ) = max Ɵ ∙ Φ(𝑦, ℎ)  (0) ℎ  (1)  (*)

Our works- Methodology Objecti tive functi tion stronger Local minimum minimum Epoch t  (0)  (1)  (*) Epoch 0  (2) Conv nvex Regular gulariz izatio ion Cont ntin inuat uation ion Optim imiz izat ation ion

Our works- Min-entropy latent model Object ect Object ect discov overy ery localizat zation on F . Wan, P. Wei, Z. Han, J. Jiao, Q. Ye, “Min -entropy Latent Model for Weakly Supervised object Detection,” IEEE CVPR2018

Our works- Min-entropy latent model (1) Instance (object and object part) are collected with a clique e partitio tition n module le; (2) Object clique discovery with a global min-entropy model; (3) Object localization with a local min-entropy model

Our works- Min-entropy latent model Clique e partit tition: ion:

Our works- Methodology Objecti tive stronger Local functi tion minimum minimum Epoch t  (0)  (1)  (*) Epoch 0  (2) Conv nvex Regular gulariz izatio ion Cont ntin inuat uation ion Optim imiz izat ation ion

Our works- Continuation Multiple Instance Learning F. Wan, C. Liu, J. Jiao, Q. Ye, “ CMIL: L: Continuation Multiple Instance Learning for Weakly Supervised object Detection (CVPR2019)

Our works- Min-entropy latent model

Our works- Recurrent Learning Recur urren rent Learnin earning Accumula ulated d Recur urren rent Learnin earning F. Wan, P. Wei, Z. Han, J. Jiao, Q. Ye, “Min -entropy Latent Model for Weakly Supervised object Detection ,” IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), DOI:10.1109/TPAMI.2019.2898858.

Our works- Results

Our works- Results SSER: Semant antic ic Stable able Extre remal al Regio ion

Our works- Soft Proposal Network Y. Zhu, Y. Zhou, Q. Ye, Q. Qiu, and J. Jiao, "Soft Proposal Network for Weakly Supervised Object Localization," in Proc. of IEEE Int. Conf. on Computer Vision (ICCV), 201

Our works- Soft Proposal Network

Our works- Peak Response Mapping Y. Zhou, Y. Zhu, Q. Ye, Q. Qiu , J. Jiao, “Weakly Supervised Instance Segmentation using Class Peak Response, IEEE CVPR 2018 (Spotlig tlight).

Our works- Peak Response Mapping

Our works- learning Instance Activation Maps Y. Zhu, Y. Zhou, H. Xu, Q. Ye., D. Doermann, J. Jiao, “Learning Instance Activation Maps for Weakly Supervised Instance Segmentation,” IEEE CVPR 2019.

Our works- learning Instance Activation Maps

The future Beyond regularization and continuation optimization Objecti tive functi tion strong nger Local minimum minimum Epoch t  (0)  (1)  (*) Epoch 0  (2) Contin ntinuat uation ion Optim imiz izat ation ion Conv nvex Regular gulariz izatio ion

The future Beyond weakly supervised detection and segmentation

The future Fill the gap of supervised and weakly supervised methods mAP on on PascalVOC 20 2007 with Fast-RCNN framework 0.65 0.6 15% 0.55 0.50 .505 0.47 .473 0.5 0.453 .453 0.44 .443 0.42 .428 0.45 0.412 .412 0.4 0.3 .348 0.35 0.3 0.25 WSDDN (2016) OICR (2017) WCCN (2017) TSC (2018) WeakRPN 2018 MELM (2018) CMIL (2019)

The future Weakly supervised detection meets X X= Few-shot Active Learning | Online Feedback | Temporal

The future X= Few-shot Active Learning | Online Feedback | Temporal Q. Ye, Z. Zhang, Q. Qiu, B. Zhang, J. Chen, and G. Sapiro, "Self-learning Scene-specific Pedestrian Detectors using a Progressive Latent Model," IEEE CVPR, 2017

Ref. [1] F. Wan, P. Wei, Z. Han, J. Jiao, Q. Ye, “Min - entropy Latent Model for Weakly Supervised object Detection,” IEEE Trans. PAMI, DOI:10.1109/TPAMI.2019.2898858. (MELM+Recu Recurr rrent Learn rnin ing) [2] F. Wan, C. Liu, J. Jiao, Q. Ye, “CMIL: Continuation Multiple Instance Learning for Weakly Supervised object Detection (CVPR2019) (C (C-MIL IL) [3] Y. Zhu, Y. Zhou, H. Xu, Q. Ye., D. Doermann , J. Jiao, “Learning Instance Activation Maps for Weakly Supervised Instance Segmentation,” IEEE CVPR 2019 . (IAM) [4] P. Tang, X. Wang, S. Bai, W. Shen, X. Bai, W. Liu, and A. L. Yuille , “ Pcl: Proposal cluster learning for weakly supervised object detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2018 . (PCL) CL) [5] Y. Zhou, Y. Zhu, Q. Ye, Q. Qiu , J. Jiao, “Weakly Supervised Instance Segmentation using Class Peak Response,” in Proc. of IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2018 (Spotlight). (PRM) [6] F. Wan, P. Wei, Z. Han, J. Jiao, Q. Ye, “Min - entropy Latent Model for Weakly Supervised object Detection,” in Proc. of IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2018: 1297-1306. (MELM) [7] Y. Zhu, Y. Zhou, Q. Ye, Q. Qiu, and J. Jiao, "Soft Proposal Network for Weakly Supervised Object Localization," in Proc. of IEEE Int. Conf. on Computer Vision (ICCV), 2017. (SPN) N) [8] Q. Ye, Z. Zhang, Q. Qiu, B. Zhang, J. Chen, and G. Sapiro, "Self-learning Scene-specific Pedestrian Detectors using a Progressive Latent Model," IEEE CVPR, 2017 (Self lf-Learn rnin ing) [9] B. Hakan and V. Andrea, “Weakly supervised deep detection networks,” in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp. 2846 – 2854. (WSDD DDN) N) [10] B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba , “Learning deep features for discriminative localization,” in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp.2921 – 2929. (CAM)

Thank! www.ucassdl.cn qxye@ucas.ac.cn people.ucas.ac.cn/~qxye

Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao - PowerPoint PPT Presentation

Weakly Supervised Object Detection, Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao Zhou, Qixian ang Ye Ye www.ucassdl.cn qxye@ucas.ac.cn people.ucas.ac.cn/~qxye Problem Supervised object ect detection ection and inst

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Category-level localization Cordelia Schmid Category-level localization Localization of

Budget-aware Semi-Supervised Semantic and Instance Segmentation Miriam Bellver, Amaia Salvador,

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline

Segmentation using Segmentation using Bayesian Decision Theory Bayesian Decision Theory

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds B. Yang, J. Wang,

Localization Nischal K N System Overview Mapping Hector Mapping Localization Path Planning

Category-level localization Cordelia Schmid Category-level localization Localization up to a

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation

Co-Segmentation of 3D Shapes via Subspace Clustering Ruizhen Hu Lubin Fan

Introduction to RFM segmentation Karolis Urbonas Head of Data Science, Amazon DataCamp

Divide And Conquer Small And Large Instance Small instance. Sort a list that has n <=

for Open-set Land Cover Classification of Satellite Imagery Razieh Kaviani Baghbaderani 1 , Ying

Energy Infrastructure Map of the World Varun Nair Tamasha Pathirathna Xiaolan You Qiwei

Whe here To To Find nd The Themes F For Google Sl Slides Spice up any presentation with

Learning Approaches to Estimate Depth from RGB Lecture 5 What will we learn - Latest Approaches

What is WordNet? Establishes Organizes over ontological and 150,000 words lexical Original

neat-EO.pink : Computer Vision framework for GeoSpatial Imagery @o_courtin @FOSDEM 2020 Error

Functional Imagery David Kavanagh Training Jackie Andrade Functional Imagery What would you

SAR Phenomenology Dr. Armin Doerry Detailed contact information at www.doerry.us This

Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao - PowerPoint PPT Presentation

Weakly Supervised Object Detection, Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao Zhou, Qixian ang Ye Ye www.ucassdl.cn qxye@ucas.ac.cn people.ucas.ac.cn/~qxye Problem Supervised object ect detection ection and inst

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Category-level localization Cordelia Schmid Category-level localization Localization of

Budget-aware Semi-Supervised Semantic and Instance Segmentation Miriam Bellver, Amaia Salvador,

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline

Segmentation using Segmentation using Bayesian Decision Theory Bayesian Decision Theory

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds B. Yang, J. Wang,

Localization Nischal K N System Overview Mapping Hector Mapping Localization Path Planning

Category-level localization Cordelia Schmid Category-level localization Localization up to a

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation

Co-Segmentation of 3D Shapes via Subspace Clustering Ruizhen Hu Lubin Fan

Introduction to RFM segmentation Karolis Urbonas Head of Data Science, Amazon DataCamp

Divide And Conquer Small And Large Instance Small instance. Sort a list that has n &lt;=

for Open-set Land Cover Classification of Satellite Imagery Razieh Kaviani Baghbaderani 1 , Ying

Energy Infrastructure Map of the World Varun Nair Tamasha Pathirathna Xiaolan You Qiwei

Whe here To To Find nd The Themes F For Google Sl Slides Spice up any presentation with

Learning Approaches to Estimate Depth from RGB Lecture 5 What will we learn - Latest Approaches

What is WordNet? Establishes Organizes over ontological and 150,000 words lexical Original

neat-EO.pink : Computer Vision framework for GeoSpatial Imagery @o_courtin @FOSDEM 2020 Error

Functional Imagery David Kavanagh Training Jackie Andrade Functional Imagery What would you

SAR Phenomenology Dr. Armin Doerry Detailed contact information at www.doerry.us This

Divide And Conquer Small And Large Instance Small instance. Sort a list that has n <=