CSE 473: Artificial Intelligence Bayesian Networks: Inference Hanna - PowerPoint PPT Presentation

CSE 473: Artificial Intelligence Bayesian Networks: Inference Hanna Hajishirzi Many slides over the course adapted from either Luke Zettlemoyer, Pieter Abbeel, Dan Klein, Stuart Russell or Andrew Moore 1

Outline § Bayesian Networks Inference § Exact Inference: Variable Elimination § Approximate Inference: Sampling

Bayes Net Representation 3

Reachability (D-Separation) ! Question: Are X and Y Active Triples Inactive Triples conditionally independent (dependent) (Independent) given evidence vars {Z}? ! Yes, if X and Y “separated” by Z ! Look for active paths from X to Y ! No active paths = independence! ! A path is active if each triple is active: ! Causal chain A → B → C where B is unobserved (either direction) ! Common cause A ← B → C where B is unobserved ! Common effect (aka v-structure) A → B ← C where B or one of its descendents is observed ! All it takes to block a path is a single inactive segment ! 5

Bayes Net Joint Distribution B# P(B)# E# P(E)# B # E # +b# 0.001# +e# 0.002# Qb# 0.999# Qe# 0.998# A # A# J# P(J|A)# A# M# P(M|A)# B# E# A# P(A|B,E)# +a# +j# 0.9# +a# +m# 0.7# +b# +e# +a# 0.95# +a# Qj# 0.1# +a# Qm# 0.3# J # M # +b# +e# Qa# 0.05# Qa# +j# 0.05# Qa# +m# 0.01# +b# Qe# +a# 0.94# Qa# Qj# 0.95# Qa# Qm# 0.99# +b# Qe# Qa# 0.06# Qb# +e# +a# 0.29# Qb# +e# Qa# 0.71# Qb# Qe# +a# 0.001# Qb# Qe# Qa# 0.999# 6

Bayes Net Joint Distribution B# P(B)# E# P(E)# B # E # +b# 0.001# +e# 0.002# Qb# 0.999# Qe# 0.998# A # A# J# P(J|A)# A# M# P(M|A)# B# E# A# P(A|B,E)# +a# +j# 0.9# +a# +m# 0.7# +b# +e# +a# 0.95# +a# Qj# 0.1# +a# Qm# 0.3# J # M # +b# +e# Qa# 0.05# Qa# +j# 0.05# Qa# +m# 0.01# +b# Qe# +a# 0.94# Qa# Qj# 0.95# Qa# Qm# 0.99# +b# Qe# Qa# 0.06# Qb# +e# +a# 0.29# Qb# +e# Qa# 0.71# Qb# Qe# +a# 0.001# Qb# Qe# Qa# 0.999# 7

Probabilistic Inference § Probabilistic inference: compute a desired probability from other known probabilities (e.g. conditional from joint) § We generally compute conditional probabilities § P(on time | no reported accidents) = 0.90 § These represent the agent ’ s beliefs given the evidence § Probabilities change with new evidence: § P(on time | no accidents, 5 a.m.) = 0.95 § P(on time | no accidents, 5 a.m., raining) = 0.80 § Observing new evidence causes beliefs to be updated

Inference ! Inference:#calcula)ng#some# ! Examples:# useful#quan)ty#from#a#joint# ! Posterior#probability# probability#distribu)on# ! Most#likely#explana)on:# 9

Inference by Enumeration § General case: § Evidence variables: § Query* variable: All variables § Hidden variables: § We want: § First, select the entries consistent with the evidence § Second, sum out H to get joint of Query and evidence: § Finally, normalize the remaining entries to conditionalize § Obvious problems: § Worst-case time complexity O(d n ) § Space complexity O(d n ) to store the joint distribution

Inference by Enumerataion P ( Antilock | observed variables ) = ? 12

Variable Elimination § Why is inference by enumeration so slow? § You join up the whole joint distribution before you sum out the hidden variables § You end up repeating a lot of work! § Idea: interleave joining and marginalizing! § Called “ Variable Elimination ” § Still NP-hard, but usually much faster than inference by enumeration § We ’ ll need some new notation to define VE

Review T W P § Joint distribution: P(X,Y) hot sun 0.4 § Entries P(x,y) for all x, y § Sums to 1 hot rain 0.1 cold sun 0.2 cold rain 0.3 § Selected joint: P(x,Y) T W P § A slice of the joint distribution § Entries P(x,y) for fixed x, all y cold sun 0.2 § Sums to P(x) cold rain 0.3

Review § Specified family: P(y | X) T W P § Entries P(y | x) for fixed y, hot rain 0.2 but for all x § Sums to … who knows! cold rain 0.6 § In general, when we write P(Y 1 … Y N | X 1 … X M ) § It is a “ factor, ” a multi-dimensional array § Its values are all P(y 1 … y N | x 1 … x M ) § Any assigned X or Y is a dimension missing (selected) from the array

Inference § Inference is expensive with enumeration § Variable elimination: § Interleave joining and marginalization: Store initial results and then join with the rest

Example: Traffic Domain § Random +r ¡ 0.1 ¡ R -‑r ¡ 0.9 ¡ Variables § R: Raining T +r ¡ +t ¡ 0.8 ¡ § T: Traffic +r ¡ -‑t ¡ 0.2 ¡ § L: Late for class! -‑r ¡ +t ¡ 0.1 ¡ L -‑r ¡ -‑t ¡ 0.9 ¡ § First query: P(L) +t ¡ +l ¡ 0.3 ¡ +t ¡ -‑l ¡ 0.7 ¡ -‑t ¡ +l ¡ 0.1 ¡ -‑t ¡ -‑l ¡ 0.9 ¡

Variable Elimination Outline § Maintain a set of tables called factors § Initial factors are local CPTs (one per node) +r ¡ +t ¡ 0.8 ¡ +t ¡ +l ¡ 0.3 ¡ +r ¡ 0.1 ¡ +r ¡ -‑t ¡ 0.2 ¡ +t ¡ -‑l ¡ 0.7 ¡ -‑r ¡ 0.9 ¡ -‑r ¡ +t ¡ 0.1 ¡ -‑t ¡ +l ¡ 0.1 ¡ -‑r ¡ -‑t ¡ 0.9 ¡ -‑t ¡ -‑l ¡ 0.9 ¡ § Any known values are selected § E.g. if we know , the initial factors are +r ¡ +t ¡ 0.8 ¡ +r ¡ 0.1 ¡ +t ¡ +l ¡ 0.3 ¡ +r ¡ -‑t ¡ 0.2 ¡ -‑r ¡ 0.9 ¡ -‑t ¡ +l ¡ 0.1 ¡ -‑r ¡ +t ¡ 0.1 ¡ -‑r ¡ -‑t ¡ 0.9 ¡ § VE: Alternately join factors and eliminate variables

Operation 1: Join Factors § First basic operation: joining factors § Combining factors: § Just like a database join § Get all factors over the joining variable § Build a new factor over the union of the variables involved § Example: Join on R R,T R +r ¡ +t ¡ 0.8 ¡ +r ¡ +t ¡ 0.08 ¡ +r ¡ 0.1 ¡ +r ¡ -‑t ¡ 0.2 ¡ +r ¡ -‑t ¡ 0.02 ¡ -‑r ¡ 0.9 ¡ T -‑r ¡ +t ¡ 0.1 ¡ -‑r ¡ +t ¡ 0.09 ¡ -‑r ¡ -‑t ¡ 0.9 ¡ -‑r ¡ -‑t ¡ 0.81 ¡ § Computation for each entry: pointwise products

Example: Multiple Joins +r ¡ 0.1 ¡ Join R R -‑r ¡ 0.9 ¡ R, T +r ¡ +t ¡ 0.08 ¡ +r ¡ -‑t ¡ 0.02 ¡ -‑r ¡ +t ¡ 0.09 ¡ T +r ¡ +t ¡ 0.8 ¡ -‑r ¡ -‑t ¡ 0.81 ¡ +r ¡ -‑t ¡ 0.2 ¡ L -‑r ¡ +t ¡ 0.1 ¡ -‑r ¡ -‑t ¡ 0.9 ¡ L +t ¡ +l ¡ 0.3 ¡ +t ¡ +l ¡ 0.3 ¡ +t ¡ -‑l ¡ 0.7 ¡ +t ¡ -‑l ¡ 0.7 ¡ -‑t ¡ +l ¡ 0.1 ¡ -‑t ¡ +l ¡ 0.1 ¡ -‑t ¡ -‑l ¡ 0.9 ¡ -‑t ¡ -‑l ¡ 0.9 ¡

Example: Multiple Joins R, T, L +r ¡ +t ¡ 0.08 ¡ +r ¡ -‑t ¡ 0.02 ¡ +r ¡ +t ¡ +l ¡ Join T -‑r ¡ +t ¡ 0.09 ¡ R, T 0.024 ¡ -‑r ¡ -‑t ¡ 0.81 ¡ +r ¡ +t ¡ -‑l ¡ 0.056 ¡ +r ¡ -‑t ¡ +l ¡ 0.002 ¡ L +r ¡ -‑t ¡ -‑l ¡ 0.018 ¡ +t ¡ +l ¡ 0.3 ¡ -‑r ¡ +t ¡ +l ¡ 0.027 ¡ +t ¡ -‑l ¡ 0.7 ¡ -‑r ¡ +t ¡ -‑l ¡ 0.063 ¡ -‑t ¡ +l ¡ 0.1 ¡ -‑r ¡ -‑t ¡ +l ¡ 0.081 ¡ -‑t ¡ -‑l ¡ 0.9 ¡ -‑r ¡ -‑t ¡ -‑l ¡ 0.729 ¡

Operation 2: Eliminate § Second basic operation: marginalization § Take a factor and sum out a variable § Shrinks a factor to a smaller one § A projection operation § Example: +r ¡ +t ¡ 0.08 ¡ +t ¡ 0.17 ¡ +r ¡ -‑t ¡ 0.02 ¡ -‑t ¡ 0.83 ¡ -‑r ¡ +t ¡ 0.09 ¡ -‑r ¡ -‑t ¡ 0.81 ¡

Multiple Elimination R, T, L T, L L Sum Sum +r ¡ +t ¡ +l ¡ 0.024 ¡ out T out R +r ¡ +t ¡ -‑l ¡ 0.056 ¡ +r ¡ -‑t ¡ +l ¡ +l ¡ 0.134 ¡ 0.051 ¡ +t ¡ +l ¡ 0.002 ¡ -‑l ¡ 0.886 ¡ +r ¡ -‑t ¡ -‑l ¡ 0.119 ¡ +t ¡ -‑l ¡ 0.018 ¡ -‑r ¡ +t ¡ +l ¡ 0.083 ¡ -‑t ¡ +l ¡ 0.027 ¡ -‑r ¡ +t ¡ -‑l ¡ 0.747 ¡ -‑t ¡ -‑l ¡ 0.063 ¡ -‑r ¡ -‑t ¡ +l ¡ 0.081 ¡ -‑r ¡ -‑t ¡ -‑l ¡ 0.729 ¡

CSE 473: Artificial Intelligence Bayesian Networks: Inference Hanna - PowerPoint PPT Presentation

CSE 473: Artificial Intelligence Bayesian Networks: Inference Hanna Hajishirzi Many slides over the course adapted from either Luke Zettlemoyer, Pieter Abbeel, Dan Klein, Stuart Russell or Andrew Moore 1 Outline Bayesian Networks

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

CSE 473 Artificial Intelligence (AI) Rajesh Rao (Instructor) Yi-Shu Wei (TA) Hunter Whalen (TA)

CSE 473 Artificial Intelligence (AI) Rajesh Rao (Instructor) Jennifer Hanson (TA) Evan Herbst

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

1/29/10 CSE 3402: Intro to Artificial Intelligence CSE 3402: Intro to Artificial Intelligence

What is Artificial Intelligence? CPSC 322 Lecture 1 September 5, 2007 What is Artificial

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

Artificial Intelligence as Law Bart Verheij Department of Artificial Intelligence, Bernoulli

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Lecture Overview What is Artificial Intelligence? Agents acting in an environment

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

1.1 What is AI? 1. What is Artificial Intelligence? 2. AI Past and Present 3. Rational

8th November 2019 Artificial Intelligence Finance Institute NYU Courant Artificial Intelligence

Keep it Active: Engaging Students in Virtual Classrooms Electronic Seminar on Mathematics

Constrained optimization Newton-like Linear Integer methods programming programming Descent

1 Cookies Cookie crumbs HTTP is stateless: doesn't remember from one request to next

Web Security, Part 2 CS 161 - Computer Security Profs. Vern Paxson & David Wagner TAs: John

Automated Information Collection in Windows NT Networks Dirk Reimers reimers@secunet.de secunet

A unifying computational framework for teaching and active learning Scott Cheng-Hsin Yang, Wai

G. Thomas Welsh Director HR Benefits and Services Economic Aspects of Health Management

I SPA-Tubes wit h YAP:Ce Act ive Windows f or X and Gamma Ray I maging. C. D' Ambr osio 1 , F. de

Sambuz

Useful Links

Newsletter

Mail Us

CSE 473: Artificial Intelligence Bayesian Networks: Inference Hanna - PowerPoint PPT Presentation

CSE 473: Artificial Intelligence Bayesian Networks: Inference Hanna Hajishirzi Many slides over the course adapted from either Luke Zettlemoyer, Pieter Abbeel, Dan Klein, Stuart Russell or Andrew Moore 1 Outline Bayesian Networks

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

CSE 473 Artificial Intelligence (AI) Rajesh Rao (Instructor) Yi-Shu Wei (TA) Hunter Whalen (TA)

CSE 473 Artificial Intelligence (AI) Rajesh Rao (Instructor) Jennifer Hanson (TA) Evan Herbst

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

1/29/10 CSE 3402: Intro to Artificial Intelligence CSE 3402: Intro to Artificial Intelligence

What is Artificial Intelligence? CPSC 322 Lecture 1 September 5, 2007 What is Artificial

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

Artificial Intelligence as Law Bart Verheij Department of Artificial Intelligence, Bernoulli

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Lecture Overview What is Artificial Intelligence? Agents acting in an environment

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

1.1 What is AI? 1. What is Artificial Intelligence? 2. AI Past and Present 3. Rational

8th November 2019 Artificial Intelligence Finance Institute NYU Courant Artificial Intelligence

Keep it Active: Engaging Students in Virtual Classrooms Electronic Seminar on Mathematics

Constrained optimization Newton-like Linear Integer methods programming programming Descent

1 Cookies Cookie crumbs HTTP is stateless: doesn't remember from one request to next

Web Security, Part 2 CS 161 - Computer Security Profs. Vern Paxson &amp; David Wagner TAs: John

Automated Information Collection in Windows NT Networks Dirk Reimers reimers@secunet.de secunet

A unifying computational framework for teaching and active learning Scott Cheng-Hsin Yang, Wai

G. Thomas Welsh Director HR Benefits and Services Economic Aspects of Health Management

I SPA-Tubes wit h YAP:Ce Act ive Windows f or X and Gamma Ray I maging. C. D' Ambr osio 1 , F. de

Sambuz

Useful Links

Newsletter

Mail Us

Web Security, Part 2 CS 161 - Computer Security Profs. Vern Paxson & David Wagner TAs: John