An Introduction to Bayesian Network Inference using Variable - PowerPoint PPT Presentation

An Introduction to   Bayesian Network Inference using Variable Elimination Jhonatan Oliveira Department of Computer Science University of Regina

Outline • Introduction F B • Background • Bayesian networks L D • Variable Elimination • Repeated Computation H • Conclusions

Introduction Bayesian networks are probabilistic graphical models used when reasoning under uncertainty .

Uncertainty family out dog out • Conflicting information • Missing information bowel problem dog out

Real World Applications

Real World Applications TrueSkill™

Real World Applications Turbo Codes

Real World Applications Mars Exploration Rover

Background Probability theory: introducing joint probability distribution, chain rule, and conditional independence

Joint Probability Distribution • A multivariate function over a finite set of variables • Assigns a real number between 0 and 1 to each configuration (combination of variable’s values) of the variables • Summing all assigned real numbers yields 1

Joint Probability Distribution Family Bowel Lights On Dog Out Hear Bark P(L,F,D,B,H) Out Problem 0 0 0 0 0 0.01 0 0 0 0 1 0.25 0 0 0 1 0 0.08 0 0 1 0 0 0.19

Joint Probability Distribution Family Bowel Lights On Dog Out Hear Bark P(L,F,D,B,H) Out Problem 1st 0 0 0 0 0 0.01 Query 0.25 0 0 0 0 1 2nd 0 0 0 1 0 0.08 Query + 0 0 1 0 0 0.19

Joint Probability Distribution The size issue = 32 probabilities

Chain Rule P(…) = P(L) P(F|L) P(D|L,F) P(B|L,F,D) P(H|L,F,D,B) Conditional Probability Tables

Chain Rule The size issue = 62 probabilities

Conditional Independence Given: family out dog out dog out hear bark

Conditional Independence Given: family out dog out dog out hear bark Independence I(family out, dog out, hear bark) : family out dog out hear bark

    Conditional Independence • Given I(X,Y,Z): • P(X|Y,Z) = P(X|Y)   I(L,F,D) • Given I(L,F,D) • P(D|L,F) = P(D|F)

Bayesian network A graphical interpretation of probability theory

Directed Acyclic Graph Family out Bowel problem Lights on Dog out Hear bark

Testing Independences F B L D H A set of variables X is d-separated from a set of variables Y in the DAG if all paths from X to Y are blocked

Testing Independences F B L D H Is F d-separated from H given D? Yes, namely, I(F,D,H) holds in P(L,F,D,B,H)

Testing Independences P(F) P(B) F B P(L|F) P(D|B,F) L D P(H|D) H The size issue = 18 probabilities

Bayesian Network P(F) P(B) F B P(L|F) P(D|B,F) L D P(H|D) H A directed acyclic graph B and   a set of conditional probability tables P(U) = P(v | Pa(v)), where v is in B and Pa(v) are the parents of v

Bayesian Network F B L D H P(L,F,D,B,H) = P(L|F) P(F) P(B) P(D|B,F) P(H|D)

Inference P(L,F,D,B,H) P(L|F) part P(F) P(B) P(L) P(D|B,F) P(H|D)

Inference P(L|F) P(F) P(L,F) F P(B) P(L) X + P(D|B,F) P(H|D)

Inference Multiplication L F P(L|F) L F P(L,F) 0 0 0.8 0 0 0.64 F P(F) 0 1 0.3 X = 0 1 0.09 0 0.8 1 0.3 1 0 0.2 1 0 0.16 1 1 0.7 1 1 0.21

Inference Marginalization L F P(L,F) F 0 0 0.2 L P(F) + 0 1 0.3 = 0 0.5 1 0.5 1 0 0.4 1 1 0.1

Inference Algorithms P(L|F) P(F) Shafer-Shennoy Lauritzen and Spiegalhalter P(B) P(L) Hugin Lazy Propagation Variable Elimination P(D|B,F) P(H|D)

Variable Elimination Eliminates all variables that are not in the query

Variable Elimination Algorithm Input: factorization F , elimination ordering L , query X , evidence Y Output: P(X|Y) For each variable v in L : multiply all CPTs in F involving v yielding CPT P1 marginalize v out of P1 remove all CPTs from F involving v append P1 to F Multiply all remaining CPTs in F yielding P(X,Y) return P(X|Y) = P(X,Y) / P(Y)

Variable Elimination Algorithm P(H | L)? F B L D H P(L,F,D,B,H) = P(L|F) P(F) P(B) P(D|B,F) P(H|D)

Variable Elimination Algorithm Input Factorization: P(L|F) P(F) P(B) P(D|B,F) P(H|D) Query variable: H Evidence variable: L=1 Elimination ordering: B, F, D

Variable Elimination Algorithm Eliminating D P(D,H,L) = P(H|D) P(D,L) P(H,L) = marginalize D from P(D,H,L) Factorization: P(H,L) Output P(L) = marginalize H from P(H,L) P(H|L) = P(H,L) / P(L)

Repeated Computation Variable Elimination can perform repeated computation

Variable Elimination Algorithm P(H | F)? F B L D H P(L,F,D,B,H) = P(L|F) P(F) P(B) P(D|B,F) P(H|D)

Variable Elimination Algorithm Input Factorization: P(L|F) P(F) P(B) P(D|B,F) P(H|D) Query variable: H Evidence variable: F=1 Elimination ordering: L, B, D

Repeated Computation • Store past computation • Find relevant computation for new query • Retrieve computation that can be reused

Conclusions • Bayesian networks are useful F B probabilistic graphical models • Inference can be performed by Variable Elimination L D • Future work will investigate how to avoid repeated computation during Variable H Elimination

References • Bonaparte Project: http://www.bonaparte-dvi.com/ • McEliece, Robert J.; MacKay, David J. C.; Cheng, Jung-Fu (1998), "Turbo decoding as an instance of Pearl's "belief propagation" algorithm", IEEE Journal on Selected Areas in Communications 16 (2): 140–152, doi:10.1109/49.661103, ISSN 0733-8716. • Microsoft True Skill: http://research.microsoft.com/en-us/projects/trueskill/ • N. Serrano, "A Bayesian Framework for Landing Site Selection during Autonomous Spacecraft Descent," Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on, Beijing, 2006, pp. 5112-5117 • Koller, D., & Friedman, N. (2009). Probabilistic Graphical Models - Principles and Techniques. MIT Press 2009. • Darwiche, A. (2009). Modeling and Reasoning with Bayesian Networks (1st ed.). Cambridge University Press. • Shafer, G., & Shenoy, P. P. (1989). Probability Propagation. • Charniak, E. (1991). Bayesian networks without tears. AI Magazine, 12(4), 50–63.

An Introduction to Bayesian Network Inference using Variable - PowerPoint PPT Presentation

An Introduction to Bayesian Network Inference using Variable Elimination Jhonatan Oliveira Department of Computer Science University of Regina Outline Introduction F B Background Bayesian networks L D Variable

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Basics of Bayesian Inference A frequentist thinks of unknown parameters as fixed Basics of

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Exact inference (Ch. 14) Bayesian Network A Bayesian network (Bayes net) is: (1) a directed

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Meta-Bayesian Analysis A Bayesian decision-theoretic analysis of Bayesian inference under model

EST5104 Bayesian Inference EST5803 Advanced Bayesian Inference Ricardo Ehlers ehlers@icmc.usp.br

Machine Learning: Foundations Lecturer: Yishay Mansour Lecture 2 Bayesian Inference Kfir Bar

Analytics, Inference and Computation in Cosmology: Exercises on Bayesian Inference Roberto

Approximate Bayesian inference for latent Gaussian models avard Rue 1 H Department of

CS 730/730W/830: Intro AI Bayesian Networks Approx. Inference Exact Inference 1 handout: slides

CS 730/830: Intro AI Bayesian Networks Approx. Inference Exact Inference Wheeler Ruml (UNH)

Introduction to Bayesian Inference Frank Wood April 6, 2010 Introduction Overview of Topics

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Inference Suppose you are given a Bayesian network with the graph structure and the parameters

Probability Using Words and Numbers to Describe Probability Learning Objective To be able to

Early History on the Application of Probability Methods in the Evaluation of Generating Capacity

ZEGAs High Probability Options Strategy (HiPOS) April 2020 Disclosure Information presented

Probabilistjc verifjcatjon Chiara Marsigli with the help of the WG and Laurie Wilson in

Partially specified Probabilities: decisions and games May 2007 Ehud Lehrer The problem

with 3x3 cm 2 THGEM Berkin Ulukutlu RD51 Collaboration Meeting & MPGD Stability Workhshop

Enhancing the flexible ramping product to better address net load uncertainty Ryan Kurlinski

WHAT DID WE DO YESTERDAY? OUR VOCABULARY LIST Statistics: the branch of mathematics

An Introduction to Bayesian Network Inference using Variable - PowerPoint PPT Presentation

An Introduction to Bayesian Network Inference using Variable Elimination Jhonatan Oliveira Department of Computer Science University of Regina Outline Introduction F B Background Bayesian networks L D Variable

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Basics of Bayesian Inference A frequentist thinks of unknown parameters as fixed Basics of

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Exact inference (Ch. 14) Bayesian Network A Bayesian network (Bayes net) is: (1) a directed

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Meta-Bayesian Analysis A Bayesian decision-theoretic analysis of Bayesian inference under model

EST5104 Bayesian Inference EST5803 Advanced Bayesian Inference Ricardo Ehlers ehlers@icmc.usp.br

Machine Learning: Foundations Lecturer: Yishay Mansour Lecture 2 Bayesian Inference Kfir Bar

Analytics, Inference and Computation in Cosmology: Exercises on Bayesian Inference Roberto

Approximate Bayesian inference for latent Gaussian models avard Rue 1 H Department of

CS 730/730W/830: Intro AI Bayesian Networks Approx. Inference Exact Inference 1 handout: slides

CS 730/830: Intro AI Bayesian Networks Approx. Inference Exact Inference Wheeler Ruml (UNH)

Introduction to Bayesian Inference Frank Wood April 6, 2010 Introduction Overview of Topics

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Inference Suppose you are given a Bayesian network with the graph structure and the parameters

Probability Using Words and Numbers to Describe Probability Learning Objective To be able to

Early History on the Application of Probability Methods in the Evaluation of Generating Capacity

ZEGAs High Probability Options Strategy (HiPOS) April 2020 Disclosure Information presented

Probabilistjc verifjcatjon Chiara Marsigli with the help of the WG and Laurie Wilson in

Partially specified Probabilities: decisions and games May 2007 Ehud Lehrer The problem

with 3x3 cm 2 THGEM Berkin Ulukutlu RD51 Collaboration Meeting &amp; MPGD Stability Workhshop

Enhancing the flexible ramping product to better address net load uncertainty Ryan Kurlinski

WHAT DID WE DO YESTERDAY? OUR VOCABULARY LIST Statistics: the branch of mathematics

with 3x3 cm 2 THGEM Berkin Ulukutlu RD51 Collaboration Meeting & MPGD Stability Workhshop