Recap from Monday Visualizing Networks Caffe overview Slides are - PowerPoint PPT Presentation

Recap from Monday • Visualizing Networks • Caffe overview • Slides are now online

Today • Edges and Regions, GPB • Fast Edge Detection Using Structured Forests – Zhihao Li • Holistically-Nested Edge Detection – Yuxin Wu • Selective Search for Object Recognition – Chun-Liang Li

Logistics • Please read: – Region-based Convolutional Networks for Accurate Object Detection and Semantic Segmentation • If you’re up next, please meet us • Project Proposals Due in < 1 week – If you have questions, ask to meet

Edges and Regions David Fouhey

Task "I stand at the window and see a house, trees, sky. Theoretically I might say there were 327 brightnesses and nuances of colour. Do I have "327"? No. I have sky, house, and trees.” -Max Wertheimer Quote from Jitendra Malik’s page

Approaching the Task – Regions Decomposing image into K connected regions (Clustering task) …

Approaching the Task – Edges HxWx {0,1} classification problem

Are the Tasks Equivalent? Segmentation Boundaries ?

Are the Tasks Equivalent? Boundaries Segmentation ?

Are the Tasks Equivalent? Boundaries Segmentation ? Contours have to be closed!

Does This Matter in the CNN Era? HED – State of the Art

Are These Well-Defined Tasks? Should blue and yellow go in the same segment? Image credit: NYU depth dataset

Successes – Superpixels Problem: >10^5 pixels intractable for reasoning Solution: use bigger/super pixels that don’t ruin any boundaries First from Ren et al. 2003, Fish image from Achanta et al. 2012

Successes – Multiple Segmentations • Problem : No one segmentation is good • Solution : Use many, figure it out later Hoiem et al. 2005

Contributions of Paper • Merges the (edges + regions) approaches • Introduces machinery used throughout vision • Landmark paper in segmentation/boundary detection • Note: the questions are often as important as the answers

Questions from Piazza • Where’s the learning?! – Great idea! Two papers next • What’s this useful for? – Great question! Last paper today, paper for Monday.

Dataset – BSDS 500 Images – 500 Total – 300 Training, 200 Testing Annotation – 5 annotators (CV students) per image – Annotators annotate segment

Dataset – Instructions Divide each image into pieces, where each piece represents a distinguished thing in the image. It is important that all of the pieces have approximately equal importance. The number of things in each image is up to you. Something between 2 and 20 should be reasonable for any of our images Martin et al. “A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics.” ICCV 2001

Dataset – Image and Annotations

Evaluation Criteria – Boundaries Precision = TP / (TP+FP) (fraction of predicted + results that are +) Recall = TP / (TP+FN) (fraction of + results that are predicted +)

Evaluation Criteria – Segments • In words: Average the intersection/union of the best predicted region for all GT regions, weighted by GT region size • Previous evaluation criteria don’t clearly distinguish dumb baselines from algorithm outputs

GPB-OWT-UCM Boundary Segmentation Boundary Segmentation Detection Machinery Detection Machinery Local Spectral Spectral OWT+UCM Discontinuity Embedding Discontinuity

Local Terms • Core Idea: can compute histogram distances

Local Terms Luminance Max over Image Orientation 1 Orientation 2 Orientations

Local Terms Luminance Image Max over Orientations

Local Terms – Multiple Cues Accumulate evidence per-orientation Weighted Sum of Predictions

Learning • Simple linear combinations = few parameters • Gradient ascent in the reading • Logistic regression in past Contour strength in feature + scale weights

GPB-OWT-UCM Boundary Segmentation Boundary Segmentation Detection Machinery Detection Machinery Local Spectral Spectral OWT+UCM Discontinuity Embedding Discontinuity Probability of contour at location x,y, orientation t

Globalization – Motivation Local Globalized

Globalization 𝑋 ∈ 𝑆 𝐼𝑋 𝑦 𝐼𝑋 Normal Spectral Clustering 1. Use W to produce embedding/space defined by eigenvectors of a system of equations. See links on Piazza for why 2. Cluster in induced space This Paper 1. Use W to produce embedding/space defined by eigenvectors of a system of equations 2. Treat eigenvectors as images, compute gradient

Globalization Weighted Input Eigenvectors of Spectral System Sum of Gradients

Combining Global + Local • Linear weighting; weights learned with gradient ascent Orientations processed separately throughout Why is this important?

GPB-OWT-UCM Could cluster in this space Boundary Segmentation Boundary Segmentation Detection Machinery Detection Machinery Local Spectral Spectral OWT+UCM Discontinuity Embedding Discontinuity Probability of contour at location x,y, orientation t taking into consideration soft segmentations

Watershed Transform – 1D Version • Black region: probability of boundary • Black lines: watershed boundaries

Orientation Problem: probability of boundary is orientation- dependent Solution: get probability of boundary in direction

Output of Watershed Transform “ Oversegmentation ” of image with boundary strengths

UCM • Hierarchical merging; guarantees closed contours

GPB-OWT-UCM Boundary Segmentation Boundary Segmentation Detection Machinery Detection Machinery Local Spectral Spectral OWT+UCM Discontinuity Embedding Discontinuity Contour that can be cut at any point to yield closed regions

Results – State of the Art This : 72.6 Current SOA: 78.2

Results – Ablative Analysis • Combining Local + Global helps • Why does local help in high-recall regime?

Results – Ablative Analysis OWT/UCM: • Ensures closed boundaries • Helps a little

Next Up • Fast Edge Detection Using Structured Forests – Zhihao Li • Holistically-Nested Edge Detection – Yuxin Wu • Selective Search for Object Recognition – Chun-Liang Li

Recap from Monday Visualizing Networks Caffe overview Slides are - PowerPoint PPT Presentation

Recap from Monday Visualizing Networks Caffe overview Slides are now online Today Edges and Regions, GPB Fast Edge Detection Using Structured Forests Zhihao Li Holistically-Nested Edge Detection Yuxin Wu

Monday, 8 August 11 Monday, 8 August 11 Monday, 8 August 11 Monday, 8 August 11 Monday, 8

1 2 Monday, October 25, 2010 3 4 Monday, October 25, 2010 5 6 Monday, October 25, 2010 7

Monday, 16 October 2017 Monday, 16 October 2017 Monday, 16 October 2017 Monday, 16 October 2017

GitHub Infrastructure Tom Preston-Werner @mojombo Monday, October 4, 2010 Git? Monday, October

SALT Vagrant and Virtualbox Ben Hosmer @bhosmer Monday, April 22, 13 Local Dev Prod

The Operation Impact of Continuous Deployment Monday, December 12, 11 DevOps (a quick

Semiotics: Recap Examples References Jrg Cassens Data and Process Visualization SoSe 2017

Probabilistic Computation Lecture 13 BPP vs. PH 1 Recap 2 Recap Probabilistic computation 2

Access Methods 1 / 44 Recap Recap 2 / 44 Recap A More Detailed Architecture granularity:

Trees (Part 2) 1 / 59 Trees (Part 2) Recap Recap 2 / 59 Trees (Part 2) Recap B + Tree A B

Trees (Part 1) 1 / 57 Trees (Part 1) Recap Recap 2 / 57 Trees (Part 1) Recap Hash Tables

Proof of Stake Recap Bitcoin Incentives Block subsidy Transaction fees Recap

Probabilistic Computation Lecture 13 Understanding BPP 1 Recap 2 Recap Probabilistic

Ruby Monstas Session 14 Agenda Recap Standard Library: RSS Exercises Recap Recap: TodoList

117 Things to Know... Monday, May 24, 2010 117 Things to Know... Monday, May 24, 2010 1 The

Monday - Art 1 Week 3 Afternoons.notebook June 12, 2020 Monday - Art 2 Week 3

Edge states at spin quantum Hall transitions Roberto Bondesan (LPTENS/IPhT Saclay) With: I.

Boundaries of reduced C*-algebras of discrete groups Matthew Kennedy (joint work with Mehrdad

Research at the Boundary of Robotics and AI: Challenge Problems Prof: Peter Stone Department of

Pahoa Transit Hub County of Hawaii Mass Transit Agency Mass Transit and Multimodal Master Plan

DOE HEP Budget and Planning or Message from The Funding Frontier Intensity Frontier Workshop

Ambiguous Fullerene Patches Dr. Christy Graves University of Texas at Tyler CSD 5 Conference

Structure and analysis of www Rik Sarkar Hyperlinks Give a network structure to a set of

Computer Vision Exercise Session 10 Image Categorization Object Categorization Task