343H: Honors AI Lecture 18: Decision Networks and VOI 3/27/2014 - PowerPoint PPT Presentation

343H: Honors AI Lecture 18: Decision Networks and VOI 3/27/2014 Kristen Grauman UT Austin Slides courtesy of Dan Klein, UC Berkeley Unless otherwise noted

Recall: Inference in Ghostbusters  A ghost is in the grid somewhere  Sensor readings tell how close a square is to the ghost  On the ghost: red  1 or 2 away: orange  3 or 4 away: yellow  5+ away: green  Sensors are noisy, but we know P(Color | Distance) P(red | 3) P(orange | 3) P(yellow | 3) P(green | 3) 0.05 0.15 0.5 0.3

Inference in Ghostbusters 3

Inference in Ghostbusters  Need to decide when and what to sense! 4

Decision Networks  MEU: choose the action which maximizes the expected utility given the evidence Umbrella  Can directly operationalize this with U decision networks Weather  New node types:  Chance nodes (just like BNs)  Actions (cannot have parents, act as observed evidence)  Utility node (depends on action and Forecast chance nodes) 5

Decision Networks  Action selection: Umbrella  Instantiate all evidence U  Set action node(s) each possible way Weather  Calculate posterior for all parents of utility node, given the evidence  Calculate expected utility for each action Forecast  Choose maximizing action 6

Example: Decision Networks Umbrella = leave Umbrella U Weather Umbrella = take A W U(A,W) W P(W) leave sun 100 sun 0.7 leave rain 0 rain 0.3 take sun 20 Optimal decision = leave take rain 70

Decisions as Outcome Trees {} Weather Weather U(t,s) U(t,r) U(l,s) U(l,r)  Almost exactly like expectimax / MDPs  What’s changed? 8

Example: Decision Networks W P(W|F=bad) Umbrella Umbrella = leave sun 0.34 rain 0.66 U Weather Umbrella = take A W U(A,W) leave sun 100 leave rain 0 take sun 20 Forecast take rain 70 =bad Optimal decision = take 9

Decisions as Outcome Trees {b} W | {b} W | {b} U(t,s) U(t,r) U(l,s) U(l,r) 10

Ghostbusters decision network 11

Value of Information  Idea: compute value of acquiring evidence  Can be done directly from decision network DrillLoc  Example: buying oil drilling rights U  Two blocks A and B, exactly one has oil, worth k  You can drill in one location OilLoc  Prior probabilities 0.5 each, & mutually exclusive  Drilling in either A or B has EU = k/2, MEU = k/2 D O U O P a a k a 1/2  Question: what’s the value of information of O? a b 0  Value of knowing which of A or B has oil b 1/2  Value is expected gain in MEU from new info b a 0  Survey may say “oil in a” or “oil in b,” prob 0.5 each b b k  If we know OilLoc, MEU is k (either way)  Gain in MEU from knowing OilLoc?  VPI(OilLoc) = k/2  Fair price of information: k/2 12

VPI Example: Weather MEU with no evidence Umbrella U MEU if forecast is bad Weather A W U leave sun 100 MEU if forecast is good leave rain 0 Forecast take sun 20 take rain 70 13

VPI Example: Weather MEU with no evidence Umbrella U MEU if forecast is bad Weather A W U leave sun 100 MEU if forecast is good leave rain 0 Forecast take sun 20 Forecast distribution take rain 70 F P(F) good 0.59 bad 0.41 14

Value of Information  Assume we have evidence E=e. Value if we act now: {e} a P(s | e)  Assume we see that E’ = e’. Value if we act then: U {e, e’} a  BUT E’ is a random variable whose value is unknown, so we don’t know what e’ will be P(s | e, e’) U  Expected value if E’ is revealed and then we act: {e} P(e’ | e)  Value of information: how much MEU goes up {e, e’} by revealing E’ first then acting, over acting now:

VPI Properties  Nonnegative  Nonadditive – consider, e.g., observing E j twice  Order-independent 16

Quick VPI Questions  The soup of the day is either clam chowder or split pea, but you wouldn’t order either one. What’s the value of knowing which it is?  There are two kinds of plastic forks at a picnic. One kind is slightly sturdier. What’s the value of knowing which?  You’re playing the lottery. The prize will be $0 or $100. You can play any number between 1 and 100 (chance of winning is 1%). What is the value of knowing the winning number?

Value of imperfect information?  No such thing  Information corresponds to the observation of a node in the decision network  If data is “noisy”, that just means we don’t observe the original variable, but another variable which is a noisy version of the original one. 18

VPI Question  VPI(OilLoc)? DrillLoc  VPI(ScoutingReport)? U  VPI(Scout)? Scout OilLoc  VPI(Scout | ScoutingReport)? Scouting report 19

Another VPI example 20

Training an object recognition system: The standard pipeline Annotators Category models Novel images Labeled data Kristen Grauman

The active visual learning pipeline Annotators Category models ? Selection Unlabeled/partially labeled data Labeled data Kristen Grauman

Active selection • Traditional active learning reduces supervision by obtaining labels for the most informative or uncertain examples first. Positive Negative ? Unlabeled [Mackay 1992, Freund et al. 1997, Tong & Koller 2001, Lindenbaum et al. 2004, Kapoor et al. 2007,…] Kristen Grauman

Problem: Active selection and recognition Less expensive to • Multiple levels of obtain annotation are possible • Variable cost depending on level and example • Many annotators working simultaneously More expensive to obtain Kristen Grauman

Idea: Cost-sensitive multi-level active learning • Compute decision-theoretic active selection criterion that weighs both: – which example to annotate, and – what kind of annotation to request for it as compared to – the predicted effort the request would require [Vijayanarasimhan & Grauman, NIPS 2008, CVPR 2009]

Idea: Cost-sensitive multi-level active learning … effort effort info info Most regions are understood, This looks expensive to but this region is unclear. annotate, and it does not seem informative. … effort effort info info This looks expensive to This looks easy to annotate, annotate, but it seems very but its content is already informative. understood. Kristen Grauman

Multi-level active queries • Predict which query will be most informative, given the cost of obtaining the annotation. • Three levels (types) to choose from: ? ? 3. Segment the 2. Does the 1. What object is image, name all image contain this region? objects. object X? Kristen Grauman

Decision-theoretic multi-level criterion Value of asking given Current Estimated risk if candidate Cost of getting question about given misclassification risk request were answered the answer data object Estimate risk of incorporating the candidate before obtaining true answer by computing expected value: where is set of all possible answers. Kristen Grauman

Decision-theoretic multi-level criterion Estimate risk of incorporating the candidate before obtaining true answer by computing expected value: where is set of all possible answers. How many terms are in the 3. 1. 2. ? expected value? ? Kristen Grauman

Decision-theoretic multi-level criterion Estimate risk of incorporating the candidate before obtaining true answer by computing expected value: where is set of all possible answers. Compute expectation via Gibbs sampling: • Start with a random setting of the labels. 3. 1. 2. ? • For S iterations: ? o Temporarily fix labels on M-1 regions; train. o Sample remaining region’s label. o Cycle that label into the fixed set. Kristen Grauman

Decision-theoretic multi-level criterion Estimate risk of incorporating the candidate before obtaining true answer by computing expected value: where is set of all possible answers. For M regions 3. 1. 2. ? ? Kristen Grauman

Decision-theoretic multi-level criterion Current Estimated risk if candidate Cost of getting misclassification risk request were answered the answer Cost of the answer: domain knowledge, or directly predict. Kristen Grauman

Recap: Actively seeking annotations Annotator Category models Issue request: “Get a full segmentation on image #32.” Compute Value of information scores Unlabeled/partially labeled data Labeled data Kristen Grauman

Multi-level active learning curves Annotation cost (sec) Region features: texture and color Kristen Grauman

Recap • Decision networks: – What action will maximize expected utility? – Connection to expectimax • Value of information: – How much are we willing to pay for a sensing action to gather information?

343H: Honors AI Lecture 18: Decision Networks and VOI 3/27/2014 - PowerPoint PPT Presentation

343H: Honors AI Lecture 18: Decision Networks and VOI 3/27/2014 Kristen Grauman UT Austin Slides courtesy of Dan Klein, UC Berkeley Unless otherwise noted Recall: Inference in Ghostbusters A ghost is in the grid somewhere Sensor

Honors Parent Orientation 2020 HONORS PARENT ORIENTATION 2020 HONORS PROGRAM OVERVIEW Our Vision

Honors Program Why an honors college? Why an honors college? Allows us to develop TAP with

AGENDA Basics About the Honors Program The Honors Center Honors Club & PTK

4 English I CP or Honors Credits English II CP or Honors of English III CP or

Honors Orientation 2016 Honors Advising Rachel Pawlowski Aundra Freeman Angel

LSA Honors Program Parent Orientation Goals for this Session Understand the Mission of the

343H: Honors AI Lecture 9: Bayes nets, part 1 2/13/2014 Kristen Grauman UT Austin Slides

343H: Honors AI Lecture 8 Probability 2/11/2014 Kristen Grauman UT Austin Slides courtesy of

CS 343H: Honors Artificial Intelligence Lecture 1: Introduction 1/14/2014 Kristen Grauman UT

343H: Honors AI Lecture 26: More applications 4/29/2014 Kristen Grauman UT Austin This week

CS 343H: Honors AI Lecture 23: Kernels and clustering 4/15/2014 Kristen Grauman UT Austin

343H: Honors AI Lecture 6: Adversarial Search 2/4/2014 Kristen Grauman UT-Austin Slides

343H: Honors AI Lecture 24: ML: Decision trees and neural networks 4/22/2014 Kristen Grauman

CS 343H: Honors AI Lecture 10: MDPs I 2/18/2014 Kristen Grauman UT Austin Slides courtesy of

343H: Honors AI Lecture 7: Expectimax Search 2/6/2014 Kristen Grauman UT-Austin Slides

CS 343H: Honors AI Lecture 14: Reinforcement Learning, part 3 3/3/2014 Kristen Grauman UT

Prepared Text of Remarks by David Poisson 1 SCG Legal 2016 Annual Meeting Boston, Massachusetts

Decision Networks CS 188: Artificial Intelligence Decision Networks and Value of Information

Bare Bones of the Data Certain dialects of American English allow a Condition B-violating pronoun

DualDICE Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections Ofir

Architecture of the Triposo travel guide Douwe Osinga & Jon Tirsen Douwe Osinga Jon Tirsen

Summary of the Dijet Topology Group Parallel Session Robert M. Harris Fermilab JTERM III

Space of Search Strategies CSE 473: Artificial Intelligence Blind Search DFS, BFS, IDS

OSGi Best and Worst Practices Martin Lippert Context Client apps using: Swing,

343H: Honors AI Lecture 18: Decision Networks and VOI 3/27/2014 - PowerPoint PPT Presentation

343H: Honors AI Lecture 18: Decision Networks and VOI 3/27/2014 Kristen Grauman UT Austin Slides courtesy of Dan Klein, UC Berkeley Unless otherwise noted Recall: Inference in Ghostbusters A ghost is in the grid somewhere Sensor

Honors Parent Orientation 2020 HONORS PARENT ORIENTATION 2020 HONORS PROGRAM OVERVIEW Our Vision

Honors Program Why an honors college? Why an honors college? Allows us to develop TAP with

AGENDA Basics About the Honors Program The Honors Center Honors Club &amp; PTK

4 English I CP or Honors Credits English II CP or Honors of English III CP or

Honors Orientation 2016 Honors Advising Rachel Pawlowski Aundra Freeman Angel

LSA Honors Program Parent Orientation Goals for this Session Understand the Mission of the

343H: Honors AI Lecture 9: Bayes nets, part 1 2/13/2014 Kristen Grauman UT Austin Slides

343H: Honors AI Lecture 8 Probability 2/11/2014 Kristen Grauman UT Austin Slides courtesy of

CS 343H: Honors Artificial Intelligence Lecture 1: Introduction 1/14/2014 Kristen Grauman UT

343H: Honors AI Lecture 26: More applications 4/29/2014 Kristen Grauman UT Austin This week

CS 343H: Honors AI Lecture 23: Kernels and clustering 4/15/2014 Kristen Grauman UT Austin

343H: Honors AI Lecture 6: Adversarial Search 2/4/2014 Kristen Grauman UT-Austin Slides

343H: Honors AI Lecture 24: ML: Decision trees and neural networks 4/22/2014 Kristen Grauman

CS 343H: Honors AI Lecture 10: MDPs I 2/18/2014 Kristen Grauman UT Austin Slides courtesy of

343H: Honors AI Lecture 7: Expectimax Search 2/6/2014 Kristen Grauman UT-Austin Slides

CS 343H: Honors AI Lecture 14: Reinforcement Learning, part 3 3/3/2014 Kristen Grauman UT

Prepared Text of Remarks by David Poisson 1 SCG Legal 2016 Annual Meeting Boston, Massachusetts

Decision Networks CS 188: Artificial Intelligence Decision Networks and Value of Information

Bare Bones of the Data Certain dialects of American English allow a Condition B-violating pronoun

DualDICE Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections Ofir

Architecture of the Triposo travel guide Douwe Osinga &amp; Jon Tirsen Douwe Osinga Jon Tirsen

Summary of the Dijet Topology Group Parallel Session Robert M. Harris Fermilab JTERM III

Space of Search Strategies CSE 473: Artificial Intelligence Blind Search DFS, BFS, IDS

OSGi Best and Worst Practices Martin Lippert Context Client apps using: Swing,

AGENDA Basics About the Honors Program The Honors Center Honors Club & PTK

Architecture of the Triposo travel guide Douwe Osinga & Jon Tirsen Douwe Osinga Jon Tirsen