GMNet: Graph Matching Network for Large Scale Part Semantic - PowerPoint PPT Presentation

GMNet: Graph Matching Network for Large Scale Part Semantic Segmentation in the Wild Umberto Michieli, Edoardo Borsato, Luca Rossi, Pietro Zanuttigh umberto.michieli@dei.unipd.it

Sema Se mantic Se Segme mentation - Defini niti tion background Assign to each pixel a label representing the class to road signs which the pixel belongs. cars people Dense task • Deep learning revolutionized the field • road sidewalk (autoencoder models) [1] [1] Long et al., "Fully convolutional networks for semantic segmentation", CVPR 2015.

Mu Multi-Cl Class ss Part rt Parsi sing à Learn multiple parts of multiple objects 58 parts 108 parts 1 3 4 2 1 3 4 2 Multi-class part Input image Object-level Single-class part parsing parsing (e.g. person) parsing

Coarse Co se-to to-Fine Fine Lear earning ning Transfer knowledge form a coarse problem to a finer one 1 Spatial level coarse-to-fine: object-level classes split into 3 4 their parts à learn multiple parts of multiple objects 2 1 3 4 2 Annotations object-level Annotations part-level

Co Coarse se-to to-Fine Fine at t Spa patial tial Level el First idea (b (baseline): ): just train a network on all the different parts 1 Low results, 2 main reasons: q Object-level ambiguity: corresponding parts in different semantic classes often share similar appearance Sheep legs ? Cow legs

Co Coarse se-to to-Fine Fine at t Spa patial tial Level el First idea (b (baseline): ): just train a network on all the different parts 1 Low results, 2 main reasons: q Object-level ambiguity: corresponding parts in different semantic classes often share similar appearance q Part-level ambiguity: limited local context is captured Dog head ? Dog tail

Co Coarse se-to to-Fine Fine at t Spa patial tial Level el First idea (b (baseline): ): just train a network on all the different parts 1 Low results, 2 main reasons: q Object-level ambiguity: corresponding parts in different semantic classes often share similar appearance Ø object-level guidance via semantic embedding network ! Ø auxiliary reconstruction module from parts to objects q Part-level ambiguity: limited local context is captured Ø graph-matching module to preserve relative spatial relationships between ground truth and predicted parts.

GM GMNe Net Ar Archi hitectur ure object-level network " #,% � � � " #,% � part-level network � Channel-wise concatenation Trainable Pre-trained on object parsing

Gr Grap aph Match chin ing Module le cat head ∑ Part-wise body 𝐻𝑈 𝐻𝑈 ) 2 𝑛 𝑗,𝑘 ∑ 𝑘 (𝑛 𝑗,𝑘 2D dilation 𝑗 legs tail φ head Part-wise body 𝑞𝑠𝑓𝑒 𝑛 𝑗,𝑘 ∑ 2D dilation 𝑗 legs tail 𝑞𝑠𝑓𝑒 ) 2 ∑ 𝑘 (𝑛 𝑗,𝑘 φ cat Normalized matrices à proximity ratios L GM = || M GT � M pred || F Graph-Matching loss:

Dataset t – VO VOC2012 Pascal Parts RGB Object-level GT Pascal-Part-58 Pascal-Part-108 PASCAL-VOC 2012 : § 10103 images: 4998 train and 5105 validation § 21 object-level classes § Pascal-Part-58 [1] and Pascal-Part-108 [2,3] [1] Zhao et al., “Multi-class Part Parsing with Joint Boundary-Semantic Awareness”, iCCV 2019 [2] A. Gonzalez-Garcia et al., ”Do Semantic Parts Emerge in Convolutional Neural Networks?”, IJCV, 2017 [3] Michieli et al., “GMNet: Graph Matching Network for Large Scale Part Semantic Segmentation in the Wild”, ECCV, 2020

Expe Experiments s – Pa Pascal 58 Method mIoU Avg. RGB Annotation Baseline BSANet* GMNet (ours) SegNet 24.4 26.5 FCN 42.3 44.9 DeepLab v1 49.9 51.9 DRN D 38 50.0 50.9 DRN D 105 53.0 53.0 BSANet* 58.2 58.9 Baseline 54.4 55.7 (DeepLab v3) GMNet (ours) 59.0 61.8 * It is the only other method for multi-class part parsing and uses the same architecture (DeepLab v3+, ResNet-101) Multi-class Zhao et al., “Part Parsing with Joint Boundary-Semantic Awareness”, iCCV 2019

Expe Experiments s – Pa Pascal 108 Method mIoU Avg. RGB Annotation Baseline BSANet* GMNet (ours) SegNet 18.6 20.8 FCN 31.6 33.8 DeepLab v1 35.7 40.8 DRN D 38 39.1 41.9 DRN D 105 39.5 41.0 BSANet* 42.9 46.3 Baseline 41.3 43.7 (DeepLab v3) GMNet (ours) 45.8 50.5 * It is the only other method for multi-class part parsing and uses the same architecture (DeepLab v3+, ResNet-101) Multi-class Zhao et al., “Part Parsing with Joint Boundary-Semantic Awareness”, iCCV 2019

Co Conclusi sion Semantic segmentation of multiple p parts from multiple o objects Co Contributions: : • Ob Object-le level l se semanti tic embedding n network guides part-level decoding stage • Gr Graph-ma matching g mo module for accurate relative localization of semantic parts art performance on Pascal-Part-58 and 108 • GMNet achieves new st state-of of-th the-ar

Paper website: https://lttm.dei.unipd.it/paper_data/GMNet Code: https://github.com/LTTM/GMNet ArXiv: https://arxiv.org/abs/2007.09073 Contact: umberto.michieli@dei.unipd.it Michieli U., Borsato E., Rossi L. and Zanuttigh P., “GMNet: Graph Matching Network for Large Scale Part Semantic Segmentation in the Wild,” ECCV 2020.

GMNet: Graph Matching Network for Large Scale Part Semantic - PowerPoint PPT Presentation

GMNet: Graph Matching Network for Large Scale Part Semantic Segmentation in the Wild Umberto Michieli, Edoardo Borsato, Luca Rossi, Pietro Zanuttigh umberto.michieli@dei.unipd.it Sema Se mantic Se Segme mentation - Defini niti tion

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

Global Shape Matching Section 3.3: Articulated Matching using Graph Cuts Global Shape Matching:

Matching Bipartite Matching Input Given a (undirected) graph G = ( V , E ) Input Given a bipartite

Graph Matchings Matching A matching M in a graph G is a set of non-loop edges with no shared

Schema Matching in a Large Scale Schema Matching in a Large Scale Personal Schema Based Querying

Matching of Matrix Elements and Parton Showers CKKW matching in e + e collisions Lecture 2:

Network/Graph Network/Graph Informally a graph is a set of nodes Theory Theory joined by a

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

1 Shape- -Context: Matching Context: Matching Scale Invariance in Clutter ? Shape Scale

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

(Probably) Concave Graph Matching Haggai Maron and Yaron Lipman Weizmann Institute of Science

Efficient Large-Scale Graph Processing on Hybrid CPU and GPU Systems A. Gharaibeh, E.

Pregel Large-Scale Graph Processing William Jones Analysing large graphs is hard. We are

Graph Neural Network Fang Yuanqiang, 2019/05/18 Graph Neural Network Why GNN? Preliminary

Using GPUs to Accelerated Computational Performance Dr Eric McCreath Research School of Computer

How to Make Lisp More Special P a s c a l C o s t a n z a V U B r u s s e l Dynamic Scoping

Understanding GPU performance How to get peak FLOPS (GPU version) Kenjiro Taura 1 / 7 Contents

Convex Formulation of Continuous Multi-label Problems by Pock, Schoenemann, Graber,

Veronese Varieties over Fields with non-zero Characteristic: A Survey Hans Havlicek Vienna

Subprograms function (expression) Subprogram (subroutine) statement abstraction,

The story of the film so far... Partition rule: P ( A ) = P ( A | B ) P ( B ) + P ( A | B c ) P ( B

IATA's New Distribution Capability April 13, 2017 12 pm EDT www.thecompanydime.com