PatchCut: Data-Driven Obje ject Segmentation via Local Shape - PowerPoint PPT Presentation

PatchCut: Data-Driven Obje ject Segmentation via Local Shape Transfer Jimei Yang, Brian Price, Scott Cohen, Zhe Lin, and Ming-Hsuan Yang Tayfun Ateş Burak Ercan

Contents • Introduction • Problem Statement and Motivation • Method Overview • Main contributions • Related Work • Proposed Method • Image Retrieval • Local Shape Transfer • PatchCut • High order MRF with Local Shape Transfer • Algorithm for Single Scale Segmentation • Cascade Object Segmentation Algorithm with Coarse to Fine Approach • Experiments • Conclusions 2

Problem Statement • Object segmentation is the task of separating a foreground object from its background 3

Motivation • Provides mid-level representations for high-level recognition tasks • Object recognition • Image classification • Semantic segmentation • Image captioning • Has immediate applications to image and video editing • Adobe Photoshop and After Effects 4

Method Overview • Object segmentation using examples • Multiscale image matching in patches by PatchMatch • Patch-wise segmentation candidates • An algorithm based on higher order MRF energy function to produce the segmentation • Coarse-to-fine approach 5

Main Contributions (1/2) • A novel nonparametric high-order MRF model via patch-level label transfer for object segmentation • An efficient iterative algorithm (PatchCut) that solves the proposed MRF energy function in patch-level without using graph cuts • State-of-the-art performance on various object segmentation benchmark datasets 6

Main Contributions (2/2) • Incorporating object shape information for segmentation • No offline training • No user interaction • No prior knowledge on category specific object models • Patch level local shape transfer scheme 7

Related Work (MRF) • Binary labeling on Markov Random Fields (MRFs) with foreground/background appearance models: • Y. Y. Boykov and M.-P. Jolly. Interactive graph cuts for optimal boundary & region segmentation of objects in n-d images. In ICCV , 2001. 8

Related Work (Interactive Methods) • Requires user input • Incorporating object shape information for • Color or texture cues to improve segmentation performance segmentation • Y. Y. Boykov and M.-P. Jolly. Interactive graph cuts for optimal boundary & region segmentation of objects in n-d images. In ICCV , 2001. • No offline training • V. Lempitsky, P. Kohli, C. Rother, and T. Sharp. Image segmentation with a bounding box prior. In ICCV , • No user interaction 2009. • C. Rother, V. Kolmogorov, and A. Blake. Grabcut - • No prior knowledge on category specific object interactive foreground extraction using iterated graph cuts. ACM Transactions on Graphics (SIGGRAPH) , 2004. models • J. Wu, Y. Zhao, J.-Y. Zhu, S. Luo, and Z. Tu. Milcut: A sweeping line multiple instance learning paradigm for • Patch level local shape transfer scheme interactive image segmentation. In CVPR , 2014. 9

Related Work (Salient Object Segmentation) • Segmenting object(s) that grab(s) our attention most • Incorporating object shape information for • Requires high contrast segmentation • F. Perazzi, P. Krahenb ¨ uhl, Y. Pritch, and A. Hornung. ¨ Saliency filters: Contrast based filtering for salient region detection. In CVPR , 2012. • No offline training • R. Margolin, A. Tal, and L. Zelnik-Manor. What makes a patch distinct? In CVPR , 2013. • No user interaction • M.-M. Cheng, N. J. Mitra, X. Huang, P. H. S. Torr, and S.- M. Hu. Global contrast based salient region • No prior knowledge on category specific object detection. PAMI , 2014. models • Patch level local shape transfer scheme 10

Related Work (Model Based Algorithms) • Offline learning based methods • Incorporating object shape information for • E. Borenstein and S. Ullman. Class-specific, top-down segmentation. In ECCV , 2002 segmentation • D. Larlus and F. Jurie. Combining appearance models and markov random fields for category level object segmentation. • No offline training In CVPR , 2008. • M. P. Kumar, P. Torr, and A. Zisserman. Obj cut. In • No user interaction CVPR , 2005 • L. Bertelli, T. Yu, D. Vu, and B. Gokturk. Kernelized • No prior knowledge on category specific object structural svm learning for supervised object segmentation. In CVPR , 2011. • models J. Yang, S. Safar, and M.-H. Yang. Max-margin Boltzmann machines for object segmentation. In CVPR , 2014. • Patch level local shape transfer scheme 11

Related Work (Data Driven Methods) • Global shape transfer without online learning • Incorporating object shape information for • Image match by either window based or local feature based segmentation • Less time efficient • D. Kuettel and V. Ferrari. Figure-ground segmentation • No offline training by transferring window masks. In CVPR , 2012. • E. Ahmed, S. Cohen, and B. Price. Semantic object selection. In CVPR , 2014. • No user interaction • J. Kim and K. Grauman. Shape sharing for object segmentation. In ECCV , 2012. • No prior knowledge on category specific object • J. Tighe and S. Lazebnik. Finding things: Image parsing with regions and per-exemplar detectors. In CVPR , models 2013. • Patch level local shape transfer scheme 12

Related Work (Structured Label Space) • Forest based image labeling algorithms • Each leaf node stores one example label patch • These trained forests are used for • Edge Detection • Semantic Labeling • P. Kontschieder, S. R. Bulo, H. Bischof, and M. Pelillo. Structured class-labels in random forests for semantic image labelling. In ICCV , 2011. • P. Dollar and C. Zitnick. Structured forests for fast edge detection. In ICCV , 2013. 13

Revisiting Main Contributions • Incorporating object shape information for segmentation • No offline training • No user interaction • No prior knowledge on category specific object models • Patch level local shape transfer scheme 14

Proposed Method • A data driven approach 15

Proposed Method • A data driven approach • What is meant by being data driven? How the proposed method uses data? 16

Proposed Method • A data driven approach • What is meant by being data driven? How the proposed method uses data? • For a single query image, it finds most similar M images (M is fixed as 16) with their segmentation masks and uses this information to create better segmentation results by proposing a multiscale patch based method. From: svcl.ucsd.edu 17

Proposed Method • A data driven approach • What is meant by being data driven? How the proposed method uses data? • For a single query image, it finds most similar M images (M is fixed as 16) with their segmentation masks and uses this information to create better segmentation results by proposing a multiscale patch based method. • Image retrieval is done representing the query and dataset images either by using features from Bag-Of-Words, or 7th layer of convolutional networks (ConvNet)* trained with ImageNet. From: svcl.ucsd.edu *Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408.5093, 2014. 18

Proposed Method Test image Segmentation of the test image (we want to estimate this) Example images (retrieved from the database) Segmentation ground truths of example images 19

Local Shape Transfer Downsampled versions of the test image, with scale s Downsampled versions of examples and their segmentations Size of the original image Sizes of the downsampled images K number of 16x16 patches for scale s 20

How to Find Matches for a Patch? SIFT descriptor of 32x32 patches Solve the matching problem: PatchMatch efficiently solves this! Match of kth patch patch in mth example Cost of this match 21

Patch Match From: vis.berkeley.edu/courses/cs294-69-fa11 22

Solution Space for the Test Image Local segmentation masks from the matched patches in m th example Authors assume that: • These masks constitute a patch-wise segmentation solution space for the test image • The segmentation mask of test image can be well approximated by these masks How can we validate this assumption? 23

Validation of the Assumption Let’s calculate the mean of local masks over M example images Mean shape prior mask can then be calculated by adding up Also find the oracle shape prior mask from the best possible (by using the ground truth as reference) • Object is well located in the coarsest scale, but blurry • In the finest scale, masks can become noisy • Background near the legs is mostly uniform • Background near the upper body is cluttered • A coarse-to-fine strategy can be employed 24

PatchCut (Some Preliminaries) The energy function: Segmentation problem is solved by minimizing this function The unary term: Negative log probability of the label given the pixel color and Gaussian Mixture Models (GMMs) and for foreground and background color The pairwise term: Measures the cost of assigning different labels to two adjacent pixels (based on their color difference) The shape term: Measures the inconsistency with shape prior Q This energy function can be minimized with alternating two steps similar to GrabCut: 1) 2) 25

PatchCut: Data-Driven Obje ject Segmentation via Local Shape - PowerPoint PPT Presentation

PatchCut: Data-Driven Obje ject Segmentation via Local Shape Transfer Jimei Yang, Brian Price, Scott Cohen, Zhe Lin, and Ming-Hsuan Yang Tayfun Ate Burak Ercan Contents Introduction Problem Statement and Motivation Method

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

Pr Proposal Tr Tracking an and Se Segmentation (PTS) S): A A cascaded network for video obje

P art I Understanding the Ob ject-Orien ted W orld View 11 Chapter 1 Ob ject-Orien

Co-Segmentation of 3D Shapes via Subspace Clustering Ruizhen Hu Lubin Fan

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Segmentation using Segmentation using Bayesian Decision Theory Bayesian Decision Theory

Introduction to RFM segmentation Karolis Urbonas Head of Data Science, Amazon DataCamp

Literary Elements OB OBJE JECT CTIVES IVES Identify elements of a short story Define

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline

Priority-Driven Scheduling of Periodic Tasks Priority-driven vs. clock-driven scheduling:

Customer Segmentation in Python Karolis Urbonas Head of Data Science, Amazon DataCamp Customer

Vi Video Ob eo Object ject Segm Segmen enta tati tion on CV3DST | Prof. Leal-Taix 1

Lithology/fluid prediction subsurface Torstein Fjeldstad Department of Mathematical sciences

Deep Learning Russ Salakhutdinov Associate Professor Machine Learning

Markov random fields Help from David Bolin, Johan Lindstrm and Finn Lindgren Julian Besag

Creative Teaching in Actuarial Science Diana Skrzydlo, ASA Outline Background Clickers

Online Slip Prediction for Mobile Robots 16831 Project Proposal Neal Seegmiller, Chris

29/06/2020 1 29/06/2020 The University of Genoa is a generalist university that develops

Energy Harvesting: Strategies for ultra low power consumption and reliability. Antonio Rubio and

Neural Probabilistic Models for Melody Prediction, Sequence Labelling and Classification