CS 103: Representation Learning, Information Theory and Control - PowerPoint PPT Presentation

CS 103: Representation Learning, Information Theory and Control Lecture 2, Jan 18, 2019

Today’s program What is a nuisance for a task? How do we design nuisance invariant representations? Invariance, equivariance, canonization A linear transformation is group equivariant if and only if it is a group convolution Image canonization with equivariant reference frame detector Applications to multi-object detection 2

Nuisance invariance

Why we need nuisance invariance 4 Images of office from Steps Toward a Theory of Visual Information , S. Soatto, 2011

Why we need nuisance invariance O ffi ce Mount Everest Team Disneyland Administration 5

What is a nuisance? It depends on the task Having different clothes is a nuisance for the task of recognizing the person. But what if our task is to tag the clothing style in the image? 6 Pictures of Ian McKellen from https://en.wikipedia.org/wiki/Ian_McKellen

Definition of tasks and nuisances Let x be the input data ( e.g., an image ) , and assume we want to infer the value of a hidden random variable y that depends on x, that is, we want to reconstruct the posterior distribution p( y | x ) . Then, we call y our task variable . Examples: Image classification: y is the label of the image Object detection: y is the label and bounding-box of all images in the image 3-D reconstruction: y is the 3-D geometry of the scene Control: y is the action to take to bring the system in a certain state 7

Definition of tasks and nuisances The observed image x may depend on a number of factors. Let’s write: e.g., shape of object e.g., illumination x = I ( ξ , ν ) Rendering function We will prove later that any image distribution can always be parametrized in this way, for an appropriate rendering function I . For now, think of I as a powerful and generic photorealistic rendering engine. 8

Effect of changing the rendering parameters Effect of changing the parameters of the rendering function. I ( ξ , ν ) I ( ξ , ν ′ � ) I ( ξ ′ � , ν ) 9

Effect of changing the rendering parameters Change of illumination, point of view ˜ ν = visibility ˜ I = h ( ξ , ˜ ν ) , ˜ ν = illumination I = h ( ξ , ν ) I = h (˜ ˜ ν ) , ˜ ν = viewpoint ˜ ξ , ˜ ξ 6 = ξ Change of identity 10 Images from Steps Toward a Theory of Visual Information , S. Soatto, 2011

Definition of nuisance Suppose that changing 𝛏 does not affect the task variable y . That is: p ( y | I ( ξ , ν )) = p ( y | I ( ξ , ν ′ � )) for all ν ′ � ∈ N Then we say that 𝛏 is a nuisance for the task y. Note: This is equivalent to saying that y is independent of 𝛏 , or alternatively that 𝛏 contains no information about the task y , i.e., I(y; 𝛏 ) = 0 Common examples: Illumination, change of contrast, rotations, translations, change of scale, … 11

Nuisance invariance We say that a representation z = f(x) is nuisance invariant if: f ( I ( ξ , ν )) = f ( I ( ξ , ν ′ � )) For all nuisances 𝛏 and 𝛏 ’. A representation is maximal invariant if all other invariant representations are a function of it. Idea: a nuisance invariant representation z throws away unneeded information. x f ( x ) 12

How do we design (maximal) invariant representations? Far from trivial in the general case. For simple (but important!) group nuisances we can develop a theory. I ( ξ , ν ′ � ) = g ν → ν ′ � ∘ I ( ξ , ν ) Translations, rotations 10 7 5 10 Permutation of vertexes 22 22 5 7 13

Group nuisances

This part was done on whiteboard, see LaTeX notes on class website.

Canonization

Invariance by canonization Idea: Instead of finding an invariant representation, apply a transformation to put the input in a standard form. I ( ξ , ν ) ⟼ g ν → ν 0 ∘ I ( ξ , ν ) = I ( ξ , ν 0 ) g ν → ν 0 17

Canonization for translations Suppose we want to canonize the image with respect to translations. 1. Decide a reference point that is uniquely defined, no matter how we translate the image   Examples: The barycenter of the image, the maximum (assuming it’s unique) 2. Write an algorithm to find the position of the reference point 3. Compute the translation that moves the reference point to the origin g ν ′ � → ν 0 Reference point (minimum) 18

Equivariant reference frame detector A reference frame detector R for a group G is any function R(x): X → G such that R ( g ⋅ x ) = g ⋅ R ( x ) That is, a reference frame detector is any equivariant function from X to G. Example: Let G = R 2 be the group of translations. Then R(x) = “position of the maximum of x” is a reference frame. 19

From equivariant frame detector to invariant representations Proposition. Let R be a reference frame detector for the group G . Define a representation f(x) as: f ( x ) = R ( x ) − 1 ⋅ x Then f(x) is a G -invariant representation. f ( g ⋅ x ) = R ( g ⋅ x ) − 1 ⋅ ( g ⋅ x ) Proof: = ( g ⋅ R ( x )) − 1 ⋅ g ⋅ x = R ( x ) − 1 ⋅ g − 1 ⋅ g ⋅ x = R ( x ) − 1 ⋅ x = f ( x ) 20

The canonization pipeline Canonization consists of the following steps 1. Build an equivariant reference frame detector 2. Choose a “ canonical ” reference frame 3. Find the reference frame of the input image 4. Invert the transformation to make the reference frame canonical Canonical frame Reference frame of input R ( x ) − 1 21

Some examples of canonization in vision Document analysis: Find border of the document and un-warp the image prior to analysis. Also: Normalize contrast and illumination 22 Image from https://blogs.dropbox.com/tech/2016/08/fast-document-rectification-and-enhancement/

Saccades Eyes move rapidly while looking at a fixed object. Image Trace of saccades Can we consider this a form of translation invariance by canonization? 23 Video and Images from https://en.wikipedia.org/wiki/Saccade

The R-CNN model for multi-object detection Region proposal: find regions of the image that may contain an interesting object (i.e., reference frame proposal) CNN classifier: warp the region to put it in canonical form (invariance) and feed it to a classifier Region proposal + CNN classifier = R-CNN 24

Region proposal mechanism Originally: hand-crafted proposal mechanisms based on saliency, uniformity of texture, scale, and so on. Nowadays: The same network does both the region proposal and the classification inside each region Fast R-CNN 25

Spatial Transformer Network Localisation network selects a local reference frame in the image Transformer resamples using that reference frame 26

CS 103: Representation Learning, Information Theory and Control - PowerPoint PPT Presentation

CS 103: Representation Learning, Information Theory and Control Lecture 2, Jan 18, 2019 Todays program What is a nuisance for a task? How do we design nuisance invariant representations? Invariance, equivariance, canonization A linear

Slide 1 / 103 Slide 2 / 103 1 What is metabolism? 2 What role do enzymes play in metabolic

Slide 1 / 103 1 What is metabolism? Slide 2 / 103 2 What role do enzymes play in metabolic

CS 103: Representation Learning, Information Theory and Control Lecture 5, Feb 8, 2019

Eukaryotic Cellular Reproduction: Mitosis & Meiosis www.njctl.org Slide 3 / 103 Slide 4 /

Eukaryotic Cellular Reproduction: Mitosis & Meiosis www.njctl.org Slide 3 / 103 Slide 4 /

CS 103: Representation Learning, Information Theory and Control Lecture 8, Mar 1, 2019 Recap

CS 103: Representation Learning, Information Theory and Control Lecture 6, Feb 15, 2019 VAEs and

CS 103: Representation Learning, Information Theory and Control Lecture 4, Feb 1, 2019 Seen last

CS 103: Representation Learning, Information Theory and Control Lecture 3, Jan 25, 2019 Seen

CS 103: Representation Learning, Information Theory and Control Lecture 1, Jan 11, 2019 What is

BUILDING INFORMATION: 103-105 GREENE STREET ADDRESS: 103-105 GREENE STREET AKA 101 GREENE

Safety Reviews Highways 101, 103 and 104 Purpose of Road Safety Reviews (Highways 101, 103 and

Title: Healing Class 103 Week 1 Healing 103 Week 1 Jesus Healing Individuals Part 5

Membership Survey Completion Rates of 660 Responses! 120% 103% 103% 100% 91% 91% 84% 80%

MKA-65-B MKA-65-PM MKA-66-P MKA-87-P MKA-103-N MKA-103-BNBF MKA-34NL MKA-34-NLP MKA-44N

EKT 103 KT 103 CHAPTER CHAPTER 5 5 DC Machine Contents Contents Overview of Direct

Energy Management Issue in Ad Hoc Networks Outline In ad hoc networks the devices are battery

Exploring Fatigue Online webinar, October 2020 Dr Anne Johnson Macmillan Consultant Occupational

Optimal and Heuristic Approaches for Constrained Flight Planning under Weather Uncertainty Florian

Sources of Authority Sources of Authority Sources of Authority Lesson No. 3 ENV H 471

Protocols and Roles Protocol: shared view; roles: each local view The Buyer Role Trade Protocol

KIRTLING STREET & HEATHWALL PUMPING STATION COMMUNITY LIAISON WORKING GROUP 12 October 2020

How mentoring can help you start contributing to open source Luciano Resende IBM | Spark

Valley Clean Energy CAC Meeting November 19, 2020 Via Teleconference Item 9 Update CAC

CS 103: Representation Learning, Information Theory and Control - PowerPoint PPT Presentation

CS 103: Representation Learning, Information Theory and Control Lecture 2, Jan 18, 2019 Todays program What is a nuisance for a task? How do we design nuisance invariant representations? Invariance, equivariance, canonization A linear

Slide 1 / 103 Slide 2 / 103 1 What is metabolism? 2 What role do enzymes play in metabolic

Slide 1 / 103 1 What is metabolism? Slide 2 / 103 2 What role do enzymes play in metabolic

CS 103: Representation Learning, Information Theory and Control Lecture 5, Feb 8, 2019

Eukaryotic Cellular Reproduction: Mitosis &amp; Meiosis www.njctl.org Slide 3 / 103 Slide 4 /

Eukaryotic Cellular Reproduction: Mitosis &amp; Meiosis www.njctl.org Slide 3 / 103 Slide 4 /

CS 103: Representation Learning, Information Theory and Control Lecture 8, Mar 1, 2019 Recap

CS 103: Representation Learning, Information Theory and Control Lecture 6, Feb 15, 2019 VAEs and

CS 103: Representation Learning, Information Theory and Control Lecture 4, Feb 1, 2019 Seen last

CS 103: Representation Learning, Information Theory and Control Lecture 3, Jan 25, 2019 Seen

CS 103: Representation Learning, Information Theory and Control Lecture 1, Jan 11, 2019 What is

BUILDING INFORMATION: 103-105 GREENE STREET ADDRESS: 103-105 GREENE STREET AKA 101 GREENE

Safety Reviews Highways 101, 103 and 104 Purpose of Road Safety Reviews (Highways 101, 103 and

Title: Healing Class 103 Week 1 Healing 103 Week 1 Jesus Healing Individuals Part 5

Membership Survey Completion Rates of 660 Responses! 120% 103% 103% 100% 91% 91% 84% 80%

MKA-65-B MKA-65-PM MKA-66-P MKA-87-P MKA-103-N MKA-103-BNBF MKA-34NL MKA-34-NLP MKA-44N

EKT 103 KT 103 CHAPTER CHAPTER 5 5 DC Machine Contents Contents Overview of Direct

Energy Management Issue in Ad Hoc Networks Outline In ad hoc networks the devices are battery

Exploring Fatigue Online webinar, October 2020 Dr Anne Johnson Macmillan Consultant Occupational

Optimal and Heuristic Approaches for Constrained Flight Planning under Weather Uncertainty Florian

Sources of Authority Sources of Authority Sources of Authority Lesson No. 3 ENV H 471

Protocols and Roles Protocol: shared view; roles: each local view The Buyer Role Trade Protocol

KIRTLING STREET &amp; HEATHWALL PUMPING STATION COMMUNITY LIAISON WORKING GROUP 12 October 2020

How mentoring can help you start contributing to open source Luciano Resende IBM | Spark

Valley Clean Energy CAC Meeting November 19, 2020 Via Teleconference Item 9 Update CAC

Eukaryotic Cellular Reproduction: Mitosis & Meiosis www.njctl.org Slide 3 / 103 Slide 4 /

Eukaryotic Cellular Reproduction: Mitosis & Meiosis www.njctl.org Slide 3 / 103 Slide 4 /

KIRTLING STREET & HEATHWALL PUMPING STATION COMMUNITY LIAISON WORKING GROUP 12 October 2020