Cue combinations, Bayesian models Thurs. March 1, 2018 1 Visual - PowerPoint PPT Presentation

COMP 546 Lecture 15 Cue combinations, Bayesian models Thurs. March 1, 2018 1

Visual Cues: image properties that can tell us about scene properties Image Scene texture depth gradient - size, shape, density - slant, tilt shading surface curvature binocular disparities depth motion (from moving observer) defocus blur 2

Last lecture: Likelihood 𝑞 𝐽 = 𝑗 𝑇 = 𝑡 ) • Probability of measuring image 𝐽 = 𝑗, when the scene is 𝑇 = 𝑡. (called “likelihood” of scene 𝑇 = 𝑡 , given the image 𝐽 = 𝑗 ). • Maximum likelihood method: Choose 𝑇 = 𝑡 that maximizes 𝑞 𝐽 = 𝑗 𝑇 = 𝑡 ) 3

This lecture: How to combine cues ? 𝑞 𝐽 1 , 𝐽 2 𝑇 ) 4

Example: texture only (monocular) stereo only texture and stereo [Hillis 2004] 5

Assume likelihood function is “conditionally independent”: 𝑞 𝐽 1 , 𝐽 2 𝑇 ) = 𝑞 𝐽 1 𝑇 ) 𝑞 𝐽 2 𝑇 ) e.g. 𝐽 1 is texture. 𝐽 2 is binocular disparity. 6

𝑞 𝐽 2 𝑇 ) 𝑞 𝐽 1 𝑇 ) 𝑇 = s Assume 𝑞 𝐽 1 = 𝑗 1 𝑇 = 𝑡 ) and 𝑞 𝐽 2 = 𝑗 2 𝑇 = 𝑡 ) are Gaussian shaped. 7

𝑞 𝐽 2 𝑇 ) 𝑞 𝐽 1 𝑇 ) 𝑇 = s 𝑡 1 𝑡 2 Assume 𝑞 𝐽 1 = 𝑗 1 𝑇 = 𝑡 ) and 𝑞 𝐽 2 = 𝑗 2 𝑇 = 𝑡 ) are Gaussian shaped. Their maxima might occur at different values of 𝑡 . Why ? 8

We want to find the 𝑡 that maximizes: − 𝑡 − 𝑡 1 2 − 𝑡 − 𝑡 2 2 2 𝜏 12 2 𝜏 22 𝑞 𝐽 1 | 𝑇 = 𝑡 𝑞 𝐽 2 | 𝑇 = 𝑡 = 𝑓 𝑓

We want to find the 𝑡 that maximizes: − 𝑡 − 𝑡 1 2 − 𝑡 − 𝑡 2 2 2 𝜏 12 2 𝜏 22 𝑞 𝐽 1 | 𝑇 = 𝑡 𝑞 𝐽 2 | 𝑇 = 𝑡 = 𝑓 𝑓 So, we want to find the 𝑡 that minimizes:

The lecture notes show that the solution 𝑇 = 𝑡 is 𝑡 = 𝑥 1 𝑡 1 + 𝑥 2 𝑡 2 where 𝑥 1 + 𝑥 2 = 1 0 < 𝑥 𝑗 < 1 “Linear Cue Combination”

The lecture notes show that the solution 𝑇 = 𝑡 is 𝑡 = 𝑥 1 𝑡 1 + 𝑥 2 𝑡 2 where 𝑥 1 + 𝑥 2 = 1 0 < 𝑥 𝑗 < 1 𝜏 2 2 𝜏 1 2 𝑥 1 = 𝑥 2 = 𝜏 12 + 𝜏 22 𝜏 12 + 𝜏 22 Thus, less reliable cue (larger 𝜏 ) get less weight.

Example: [Hillis 2004] texture only (monocular) stereo only Measure slant discrimination thresholds for cues in isolation . Estimate likelihood function parameters ( 𝑡 1 , 𝜏 1 , 𝑡 2 , 𝜏 2 ). 13

… then • present cues together texture and stereo • measure thresholds for 𝑇 • convert thresholds to likelihood parameters ( 𝑡 , σ ) 14

… then • present cues together texture and stereo • measure thresholds for 𝑇 • convert thresholds to likelihood parameters ( 𝑡 , σ ) • examine if these values are consistent with the model* 𝑡 = 𝑥 1 𝑡 1 + 𝑥 2 𝑡 2 *Model also makes prediction about σ in combined case. 15

𝑞 𝐽 2 𝑇 ) 𝑞 𝐽 1 𝑇 ) texture and stereo 𝑇 = s 𝑡 1 𝑡 2 Experimenter can manipulate 𝑡 1 , 𝑡 2 , 𝜏 1 , 𝜏 2 and predict effect on perception of slant. 16

COMP 546 Lecture 15 Cue combinations, Bayesian models Thurs. March 1, 2018 17

𝑞 𝐽 = 𝑗 𝑇 = 𝑡) ≠ 𝑞 𝑇 = 𝑡 𝐽 = 𝑗) Likelihood of scene 𝑡 , Probability of scene 𝑡 , given image 𝑗 given image 𝑗 What is the crucial difference ? 18

wire frame with independently chosen depths regular solid cube flat drawing All scenes above have the same likelihood 𝑞( 𝐽 = 𝑗 | 𝑇 = 𝑡 ). Why do we prefer the regular solid cube? [Kersten & Yuille 2003]

Some scenes may have a larger probability 𝑞(𝑇 = 𝑡 ). The marginal probably 𝑞(𝑇 = 𝑡 ) is called the "prior".

𝑞(𝐽, 𝑇 ) 𝑞 𝐽 𝑇 ) ≡ 𝑞(𝑇) 𝑞 (𝐽, 𝑇 ) 𝑞 𝑇 𝐽 ) ≡ 𝑞(𝐽) Thus, 𝑞 𝐽 𝑇 ) 𝑞 𝑇 = 𝑞 𝑇 𝐽 ) 𝑞 𝐽

Bayes Theorem 𝑞(𝐽, 𝑇 ) 𝑞 𝐽 𝑇 ) ≡ 𝑞(𝑇) 𝑞 (𝐽, 𝑇 ) 𝑞 𝑇 𝐽 ) ≡ 𝑞(𝐽) Thus, likelihood scene prior 𝑞 𝐽 𝑇 ) 𝑞 𝑇 𝑞 𝑇 𝐽 ) = 𝑞 𝐽 image prior posterior

Maximum ‘ a Posteriori’ (MAP) Given an image, 𝐽 = 𝑗, find the scene 𝑇 = 𝑡 that maximizes 𝑞( 𝑇 = 𝑡 | 𝐽 = 𝑗 ). likelihood scene prior 𝑞 𝐽 𝑇 ) 𝑞 𝑇 𝑞 𝑇 𝐽 ) = 𝑞 𝐽 image prior posterior

Maximum ‘ a Posteriori’ (MAP) Given an image, 𝐽 = 𝑗, find the scene 𝑇 = 𝑡 that maximizes 𝑞( 𝑇 = 𝑡 | 𝐽 = 𝑗 ). We don't care about 𝑞( 𝐽 = 𝑗 ). Why not ? likelihood scene prior 𝑞 𝐽 𝑇 ) 𝑞 𝑇 𝑞 𝑇 𝐽 ) = 𝑞 𝐽 image prior posterior

If the prior p(S) is uniform then maximum likelihood gives the same solution as maximum posterior (MAP). likelihood scene prior constant 𝑞 𝐽 𝑇 ) 𝑞 𝑇 𝑞 𝑇 𝐽 ) = 𝑞 𝐽 image prior posterior Interesting cases arise when the prior is non-uniform.

likelihood prior

Ames Room http://www.youtube.com/watch?v=Ttd0YjXF0no https://www.youtube.com/watch?v=gJhyu6nlGt8

Priors (“Natural Scenes Statistics”) • intensity • orientation of image lines, edges • disparity • motion • surface slant, tilt

orientation 𝜄 of lines, edges 𝑞(𝑇 = 𝜄) [Girshick 2011] People are indeed better at discriminating vertical and horizontal orientations than oblique orientations. Why? Because they use a prior ?

surface slant 𝜏 and tilt 𝜐 ceiling floor Here we represent (slant, tilt) using a concave hemisphere. See next slide.

𝑞(𝑇 = (𝜏, 𝜐)) Each disk shows 𝑞(𝜏, 𝜐) for surfaces represent slants and tilts using a concave visible over a range of viewing direction elevations, relative to line of sight. [Adams & Elder 2016]

𝑞(𝑇 = (𝜏, 𝜐))

Maximum a Posteriori (MAP) Choose the S = (slant,tilt) that maximizes the posterior. ∗ 𝑞( 𝑇 ) = 𝑞(𝐽 = 𝑗 | 𝑇 ) 𝑞 𝑇 𝐽 = 𝑗 ) posterior likelihood prior

Likelihood functions can have more than one maximum. overall (slant, tilt) 𝑞(𝐽 = 𝑗 | 𝑇 ) i.e. convex or concave ?

Depth Reversal Ambiguity and Shading (see Exercise) Likelihood (slant, tilt) 𝑞(𝐽 = 𝑗 | 𝑇 ) A valley illuminated from the right produces the same shading as a hill illuminated from the left.

What “priors” does the visual system use to resolve such twofold ambiguities ? Let’s look at a few related examples.

You can perceive the center point as a hill or a valley. When you see it as a hill, you perceive the tilt as 180 deg (leftward). But when you see it as a valley, the slant is 0 (rightward).

We tend to see the center as a hill. Why ?

We tend to see the center as a valley. Why ?

The visual system uses three priors to resolve the depth reversal ambiguity: - surface orientation: p(floor) > p(ceiling) - light source direction: p( above) > p( below) - ‘global’ surface curvature: p(convex) > p(concave)

Example in which all three priors assumptions are met light from above viewpoint from above (floor) shape is convex

Example in which all three prior assumptions fail shape is concave viewpoint from below (ceiling) light from below

Convex shape, illuminated from above the line of sight floor ceiling

Concave shape, illuminated from below the line of sight ceiling floor

We showed how people combined the three different "priors": Percent correct in judging local "hill" or "valley": = 50 +/- 10 floor vs. ceiling +/- 10 light from above vs. below +/- 10 globally convex/concave [Langer and Buelthoff, 2001]

Best Worst (80%) (20%)

These look weird, but in different ways. How ?

Reminder • A2 is due tonight • Midterm (optional) is first class after Study Break

Cue combinations, Bayesian models Thurs. March 1, 2018 1 Visual - PowerPoint PPT Presentation

COMP 546 Lecture 15 Cue combinations, Bayesian models Thurs. March 1, 2018 1 Visual Cues: image properties that can tell us about scene properties Image Scene texture depth gradient - size, shape, density - slant, tilt shading surface

Cue validity Cue validity - predictiveness of a cue for a given category Central

MATH 105: Finite Mathematics 6-5: Combinations Prof. Jonathan Duncan Walla Walla College Winter

Cue Based Feeding in the NICU ANNA ELSENBROCK, MS, OTR/L, CPST, CNT LAURA LUCAS, MS, RD, CSP, LD

Depth Perception Deep Blue See April 5, 2020 PSYCH 4041 / 6014 Overview Cue Theory

CUE Library Information Everything you need to know about CUE Library. We're here to help!

Breaking the TV Habit Valerie Lanard @valer @gigabody Duhiggs Habit Cycle Routine Cue

Optical flow Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys Slides from S.

Presentation to Accounting Firm Presentation to Accounting Firm FAS 141 Revised (Business

You need to get into a vault Try all combinations. Try a subset of combinations.

4/22/2009 You need to get into a vault Try all combinations. Try a subset of combinations.

2021 SECONDARY 3 SUBJECT COMBINATIONS Students Briefing 14 October 2020 Sec 3 Subject

Decision Procedures for Verification Combinations of Decision Procedures (3) 4.02.2019 Viorica

JUST THE MATHS SLIDES NUMBER 19.2 PROBABILITY 2 (Permutations and combinations) by

Sharp bounds on the expectations of linear combinations of k th records expressed in the Gini mean

Backtracking Search Mark Redekopp David Kempe Sandra Batista 2 GENERATING ALL COMBINATIONS

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Deep-Learning: general principles + Convolutional Neural Networks Pr. Fabien MOUTARDE Center

robots navigation LUKAS HFLIGER SUPERVISED BY MARIAN GEORGE 2 LUKAS HFLIGER 3 4 LUKAS

Deep Structured Learning Chunhua Shen School of Computer Science, The University of Adelaide

A PODS-based Extended Kalman Filter: Quantifying Sensing Uncertainties in Automatic Bird Species

Cosmic Rays Energy Spectrum from PeV to EeV energies measured by the TALE Detector Tareq

Hawkular Metrics Metric Storage & Alerting Stefan Negrea About Me Co-Creator of Hawkular

Using Geometry to Detect Grasp Poses in 3D Point Clouds ten Pas, Platt Northeastern University

Depth Perception in Grasshopper -Shashank Chepurwar -Ritvik Srivastava Grasshopper -Agile

Cue combinations, Bayesian models Thurs. March 1, 2018 1 Visual - PowerPoint PPT Presentation

COMP 546 Lecture 15 Cue combinations, Bayesian models Thurs. March 1, 2018 1 Visual Cues: image properties that can tell us about scene properties Image Scene texture depth gradient - size, shape, density - slant, tilt shading surface

Cue validity Cue validity - predictiveness of a cue for a given category Central

MATH 105: Finite Mathematics 6-5: Combinations Prof. Jonathan Duncan Walla Walla College Winter

Cue Based Feeding in the NICU ANNA ELSENBROCK, MS, OTR/L, CPST, CNT LAURA LUCAS, MS, RD, CSP, LD

Depth Perception Deep Blue See April 5, 2020 PSYCH 4041 / 6014 Overview Cue Theory

CUE Library Information Everything you need to know about CUE Library. We're here to help!

Breaking the TV Habit Valerie Lanard @valer @gigabody Duhiggs Habit Cycle Routine Cue

Optical flow Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys Slides from S.

Presentation to Accounting Firm Presentation to Accounting Firm FAS 141 Revised (Business

You need to get into a vault Try all combinations. Try a subset of combinations.

4/22/2009 You need to get into a vault Try all combinations. Try a subset of combinations.

2021 SECONDARY 3 SUBJECT COMBINATIONS Students Briefing 14 October 2020 Sec 3 Subject

Decision Procedures for Verification Combinations of Decision Procedures (3) 4.02.2019 Viorica

JUST THE MATHS SLIDES NUMBER 19.2 PROBABILITY 2 (Permutations and combinations) by

Sharp bounds on the expectations of linear combinations of k th records expressed in the Gini mean

Backtracking Search Mark Redekopp David Kempe Sandra Batista 2 GENERATING ALL COMBINATIONS

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Deep-Learning: general principles + Convolutional Neural Networks Pr. Fabien MOUTARDE Center

robots navigation LUKAS HFLIGER SUPERVISED BY MARIAN GEORGE 2 LUKAS HFLIGER 3 4 LUKAS

Deep Structured Learning Chunhua Shen School of Computer Science, The University of Adelaide

A PODS-based Extended Kalman Filter: Quantifying Sensing Uncertainties in Automatic Bird Species

Cosmic Rays Energy Spectrum from PeV to EeV energies measured by the TALE Detector Tareq

Hawkular Metrics Metric Storage &amp; Alerting Stefan Negrea About Me Co-Creator of Hawkular

Using Geometry to Detect Grasp Poses in 3D Point Clouds ten Pas, Platt Northeastern University

Depth Perception in Grasshopper -Shashank Chepurwar -Ritvik Srivastava Grasshopper -Agile

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Hawkular Metrics Metric Storage & Alerting Stefan Negrea About Me Co-Creator of Hawkular