Part I Unsupervised Feature Learning with Convolutional Neural - PowerPoint PPT Presentation

Part I Unsupervised Feature Learning with Convolutional Neural Networks Thomas Brox Computer Vision Group University of Freiburg, Germany Research funded by ERC Starting Grant VideoLearn and Deutsche Telekom Stiftung Thomas Brox

Status quo: CNNs generate great features ILSVRC 2012 classification PASCAL VOC object detection Krizhevsky et al. 2012 Girshick et al. 2014 Do we need these massive amounts of class labels to learn generic features? Thomas Brox 2

Unsupervised feature learning • Dominant concept: reconstruction error + regularization • Existing frameworks: – Autoencoders (dimensionality reduction) (Hinton 1989, Vincent et al. 2008,…) – Sparse coding (sparsity prior) (Olshausen-Field 1996, Mairal et al. 2009, Bo et al. 2012,…) – Slowness prior (Wiscott-Sejnowski 2002, Zou et al. 2012,…) – Deep belief networks (prior in contrastive divergence) (Ranzato et al. 2007, Lee et al. 2009,…) • Reconstruction error models the input distribution  dubious objective Thomas Brox 3

Exemplar CNN: discriminative objective • Train CNN to discriminate surrogate classes Alexey Dosovitskiy Jost Tobias • Take data augmentation to the extreme Springenberg (translation, rotation, scaling, color, contrast, brightness) • Transformations define invariance properties of the features to be learned Acknowledgements to caffe.berkeleyvision.org Thomas Brox 4

Application to classification • Pooled responses from each layer used as features • Training of linear SVM STL-10 CIFAR-10 Caltech-101 Convolutional K-means network 60.1 70.7 - View-invariant K-means 63.7 72.6 - Multi-way local pooling - - 77.3 Slowness on video 61.0 - 74.6 Hierarchical Matching Pursuit (HMP) 64.5 - - Multipath HMP - - 82.5 Exemplar CNN 72.8 75.3 85.5 Outperforms all previous unsupervised feature learning approaches Thomas Brox 5

Which transformations are most relevant? Thomas Brox 6

How many surrogate classes? Thomas Brox 7

How many samples per class? Thomas Brox 8

Application to descriptor matching Descriptor matching between two images Thomas Brox 9

CNNs won’t work for descriptor matching, right? Philipp Fischer Alexey Dosovitskiy New larger dataset Mikolajczyk dataset Descriptors from a CNN outperform SIFT Thomas Brox 10

Supervised versus unsupervised CNN Philipp Fischer Alexey Dosovitskiy Mikolajczyk dataset New larger dataset Unsupervised feature learning advantageous for descriptor matching Thomas Brox 11

Relevance of improvement Philipp Fischer Alexey Dosovitskiy Improvement of Examplar CNN over SIFT is as big as SIFT over color patches Thomas Brox 12

Summary of part I Exemplar CNN: Unsupervised feature learning by discriminating surrogate classes Outperforms previous unsupervised methods on classification CNNs outperform SIFT even on descriptor matching Unsupervised training advantageous for descriptor matching Thomas Brox 13

Part II Benchmarking Video Segmentation Thomas Brox Computer Vision Group University of Freiburg, Germany Contains joint work with Fabio Galasso, Bernt Schiele (MPI Saarbrücken) Research funded by DFG and ERC Thomas Brox

Motion segmentation Brox-Malik ECCV 2010 Ochs et al. PAMI 2014 Thomas Brox 15

Benchmarking motion segmentation Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) 59 sequences split into a training and a test set Thomas Brox 16

Pixel-accurate ground truth … Ground truth mostly every 20 frames Thomas Brox 17

Precision-recall metric Region to ground truth assignment with Hungarian method Over-segmentation Under-segmentation Machine Ground truth P=1 R=0 P=0.94, R=0.67, P=0.98, R=0.80, P=1.00, R=0.56, F=0.78 F=0.88 F=0.72 Thomas Brox 18

Results on the test set Ochs-Brox CVPR 2012 Rao et al. CVPR 2008 Ochs et al. PAMI 2014 Brox-Malik ECCV 2010 Ochs-Brox ICCV 2011 SSC Elhamifar-Vidal CVPR 2009 Thomas Brox 19

Benchmarking general video segmentation VSB-100: Benchmark based on Berkeley Video Segmentation Dataset 100 HD videos (40 training, 60 test) Thomas Brox 20

Four human annotations per video Fabio Galasso Naveen S. Nagaraja Bernt Schiele Galasso et al. ICCV 13 Thomas Brox 21

Metric for supervoxels Normalize by size of largest For each region find ground Average over all ground truth region truth with max overlap human annotations (single region yields P=0) Evaluated pixels in the video minus the largest ground truth region GT Average over all For each ground truth find human annotations region with max overlap P=0 Size of all ground truth regions minus size of the largest ground truth region • Many-to-one matching (important for supervoxels) • Normalization penalizes extreme segmentations R=0 Thomas Brox 22

Results Arbelaez et al. Human +oracle performance Xu et al. ECCV 12 Corso et al. Simple baseline TMI 08 Galasso et al. ACCV 12 Ochs-Brox ICCV 11 Grundmann et al. Arbelaez et al. CVPR 10 (image segmentation) TPAMI 11 Thomas Brox 23

Motion segmentation subtask Human performance Arbelaez et al. +oracle Simple baseline Ochs-Brox ICCV 11 Galasso et al. ACCV 12 Grundmann et al. CVPR 10 Thomas Brox 24

About the “simple baseline” 1. Take superpixel hierarchy from Arbelaez et al. 2. Propagate labels to next frame using optical flow 3. Next frame: label determined by voting Image segmentation + optical flow > video segmentation? Image segmentation + optical flow < video segmentation There is work to do Thomas Brox 25

Balanced graph reduction Fabio Galasso Original pixels Superpixels Margret Keuper t=1 t=1 t=2 t=2 Bernt Schiele Edge reweighting necessary for weight balancing in Galasso et al. spectral clustering CVPR 14 Thomas Brox 26

Balancing clearly improves results Simple baseline Reweighted graph reduction Galasso et al. ACCV 12 Thomas Brox 27

Summary of part II FBMS-59: Motion segmentation benchmark VSB-100: General video segmentation benchmark Spectral clustering with superpixels: Don’t forget to rebalance t=1 t=2 Thomas Brox 28

Part I Unsupervised Feature Learning with Convolutional Neural - PowerPoint PPT Presentation

Part I Unsupervised Feature Learning with Convolutional Neural Networks Thomas Brox Computer Vision Group University of Freiburg, Germany Research funded by ERC Starting Grant VideoLearn and Deutsche Telekom Stiftung Thomas Brox Status quo:

Conformal Field Theories, Conformal Bootstrap and Applications Konstantinos Deligiannis December

Part 0: Git-ing Started Part 1: Essential Skills Part 2: Introduction to Git Part 3: Advanced

Overview Two-Part MDL Two-Part MDL Two-Part MDL for Two-Part MDL for Grammar Learning

FY17 CONSOLIDATED RESULTS UNIPOL AND UNIPOLSAI Bologna, 23 March 2018 2 PART 1 PART 2 PART 3

Answers To Common Questions (Part-2) ? Part 1 : Christian walk, Marriage Part 2 : Lifestyle

Cardiff Schools Facilities Presentation Part 1: History of Cardiff Schools Part 2: Todays

Wind Part 1: How do we measure it? Part 2: What exactly is wind? Part 3: Where is it? PART 1:

Introduction Part One: Initial Problem Part Two: Progress Over Six Months Part

SANLAM STAFF UMBRELLA PROVIDENT AND PENSION FUND AND RELATED GROUP INSURANCE agenda PART A -

FY17 Grants Program Presented by the DCCAH Grants Department Agenda: Part 1: The Challenge

Part 2 2017- 2018 Supts Proposed Budget Part 3 Call for Advocacy 2 Part 1 Budget Context

Commercial Dog Breeders Part 8: Housing (Part 2) Introduction Housing Part 1 Housing Part 2

Answers To Common Questions (Part-1) ? Part 1 : Christian walk, Marriage Part 2 : Lifestyle,

DMR - Part 2 of 3 May 2, 2020 Part 1 - Mike Moore KC2NM Part 2 - Rich Hoffarth K2AXP Part 3 -

Fusion - Part 3 of 3 May 16, 2020 Part 1 - Mike Moore KC2NM Part 2 - Rich Hoffarth K2AXP Part 3

The heartful PRESENTER Influence minds and win hearts Contents 04 PART 1 INTRODUCTION 06

The Nuclear Renaissance have we been here before? Malcolm Grimston, Chatham House Help

Lean Leadership Frank Hayden Competitive advantage No.1 selling small car in US Lowest

Thomas Campbell TMI March 21 - 23, 2010 www.monroeinstitute.org www.MyBigTOE.com 1 22 ND

MICROWAVE DATASETS Marilyn Drewry Data Management Lead mdrewry@itsc.uah.edu Presented at the

Cost of Strategic Play in Centralized School Choice Mechanisms Sepehr Ekbatani October 21, 2019

Outline Math Programming DMP204 SCHEDULING, TIMETABLING AND ROUTING 1. Scheduling CPM/PERT

Detection of De of tr trend ch change-poin int in in pas assive mic icrowave an and op

This Time, Its Personalized Preparing Your Site for Effective Personalization AGENDA 1.