CS 331: Artificial Intelligence in the last column tails black 3 - PDF document

Full Joint Probability Distributions Coin Card Candy P(Coin, Card, Candy) tails black 1 0.15 tails black 2 0.06 The probabilities CS 331: Artificial Intelligence in the last column tails black 3 0.09 sum to 1 tails red 1 0.02 Fundamentals of Probability II tails red 2 0.06 tails red 3 0.12 heads black 1 0.075 heads black 2 0.03 heads black 3 0.045 heads red 1 0.035 heads red 2 0.105 heads red 3 0.21 Thanks to Andrew Moore for some course material 1 2 This cell means P(Coin=heads, Card=red, Candy=3) = 0.21 Joint Probability Distribution Joint Probability Distribution From the full joint probability distribution, we can calculate any P( Coin = heads OR Card = red ) = probability involving these three random variables. P( Coin=heads, Card=black, Candy=1 ) + P( Coin=heads, Card=black , Candy=2 ) + P( Coin=heads, Card=black , Candy=3 ) + e.g. P( Coin = heads OR Card = red ) P( Coin=tails, Card=red, Candy=1 ) + P( Coin=tails, Card=red , Candy=2 ) + P( Coin=tails, Card=red , Candy=3 ) + P( Coin=heads, Card=red, Candy=1 ) + P( Coin=heads, Card=red, Candy=2 ) + P( Coin=heads, Card=red, Candy=3 ) = 0.075 + 0.03 + 0.045 + 0.02 + 0.06 + 0.12 + 0.035 + 0.105 + 0.21 = 0.7 Marginalization Marginalization We can even calculate marginal probabilities Or even: (the probability distribution over a subset of the P( Card=black ) = variables) e.g.: P( Coin=heads, Card=black, Candy=1 ) + P( Coin=heads, Card=black, Candy=2 ) + P( Coin=tails, Card=red ) = P( Coin=heads, Card=black, Candy=3 ) + P( Coin=tails, Card=black , Candy=1 ) + P( Coin=tails, Card=red , Candy=1 ) + P( Coin=tails, Card=black, Candy=2 ) + P( Coin=tails, Card=red, Candy=2 ) + P( Coin=tails, Card=black, Candy=3 ) = 0.075 + 0.03 + 0.045 + 0.015 + 0.06 + 0.09 = 0.315 P( Coin=tails, Card=red, Candy=3 ) = 0.02 + 0.06 + 0.12 = 0.2 5 6 1

Marginalization Marginalization The general marginalization rule for any sets For continuous variables, marginalization of variables Y and Z : involves taking the integral:    ( ) ( , z ) P Y P Y  ( ) ( , ) P Y P Y z d z z z is over all possible combinations of values of Z or (remember Z is a set)   P ( Y ) P ( Y | z ) P ( z ) z 7 8 CW: Practice Conditional Probabilities Coin Card Candy P(Coin, Card, Candy) tails black 1 0.15 tails black 2 0.06 tails black 3 0.09 tails red 1 0.02 tails red 2 0.06 tails red 3 0.12 heads black 1 0.075 heads black 2 0.03 heads black 3 0.045 heads red 1 0.035 heads red 2 0.105 heads red 3 0.21 9 Conditional Probabilities Conditional Probabilities Note that 1/P( Card=black ) remains constant in the two equations. 2

Normalization CW: Practice Coin Card Candy P(Coin, Card, Candy) tails black 1 0.15 tails black 2 0.06 tails black 3 0.09 tails red 1 0.02 tails red 2 0.06 tails red 3 0.12 heads black 1 0.075 heads black 2 0.03 heads black 3 0.045 heads red 1 0.035 heads red 2 0.105 heads red 3 0.21 13 14 Inference Inference • Suppose you get a query such as • We will write the query as P ( X | e ) P( Card = red | Coin = heads ) This is a probability distribution hence the boldface Coin is called the evidence variable because we observe it. More generally, it’s a set of variables. X = Query variable (a single variable for now) Card is called the query variable (we’ll E = Set of evidence variables assume it’s a single variable for now) e = the set of observed values for the evidence variables Y = Unobserved variables There are also unobserved (aka hidden) variables like Candy 15 16 Inference Inference      ( | ) ( , ) ( , , ) P X e P X e P X e y We will write the query as P ( X | e ) y      P ( X | e ) P ( X , e ) P ( X , e , y ) Computing P ( X | e ) involves going through all possible entries of the full joint probability y Summation is over all possible distribution and adding up probabilities with X = x i , combinations of values of the E = e , and Y = y unobserved variables Y X = Query variable (a single variable for now) Suppose you have a domain with n Boolean variables. What is the space and time complexity of E = Set of evidence variables computing P( X | e )? e = the set of observed values for the evidence variables Y = Unobserved variables 18 3

Independence Independence • How do you avoid the exponential space We say that variables X and Y are and time complexity of inference? independent if any of the following hold: (note that they are all equivalent) • Use independence (aka factoring)  ( X | Y ) ( X ) P P or  ( | ) ( ) P Y X P Y or  ( , ) ( ) ( ) P X Y P X P Y 19 20 Independence Independence 21 22 Why is independence useful? Independence Another example: • Suppose you have n coin flips and you want to calculate the joint distribution P ( C 1 , …, C n ) • If the coin flips are not independent, you need 2 n This table has 2 values This table has 3 values values in the table • If the coin flips are independent, then n • You now need to store 5 values to calculate P ( Coin , Card ,   P ( C ,..., C ) P ( C ) Each P( C i ) table has 2 Candy ) 1 n i entries and there are n of  i 1 • Without independence, we needed 6 them for a total of 2 n values 23 24 4

Independence CW: Practice Coin Card Candy P(Coin, Card, Candy) • Independence is powerful! Are Coin and Card tails black 1 0.15 tails black 2 0.06 • It required extra domain knowledge. A independent in this tails black 3 0.09 different kind of knowledge than numerical distribution? tails red 1 0.02 probabilities. It needed an understanding of tails red 2 0.06 relationships among the random variables. tails red 3 0.12 Recall: heads black 1 0.075  ( | ) ( ) P X Y P X heads black 2 0.03 heads black 3 0.045  P ( Y | X ) P ( Y ) heads red 1 0.035  heads red 2 0.105 ( , ) ( ) ( ) P X Y P X P Y heads red 3 0.21 for independent X and Y 25 26 5

CS 331: Artificial Intelligence in the last column tails black 3 - PDF document

Full Joint Probability Distributions Coin Card Candy P(Coin, Card, Candy) tails black 1 0.15 tails black 2 0.06 The probabilities CS 331: Artificial Intelligence in the last column tails black 3 0.09 sum to 1 tails red 1 0.02

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

KY 331 Widening US 60 to Rinaldo Rd. MP 0.436 to 2.62 KY 331 Widening Daviess County PL &

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

What is Artificial Intelligence? CPSC 322 Lecture 1 September 5, 2007 What is Artificial

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

Shortest Paths Eric Price UT Austin CS 331, Spring 2020 Coronavirus Edition CS 331, Spring

Linear Programming Eric Price UT Austin CS 331, Spring 2020 Coronavirus Edition CS 331, Spring

All Pairs Shortest Paths Eric Price UT Austin CS 331, Spring 2020 Coronavirus Edition CS 331,

Artificial Intelligence as Law Bart Verheij Department of Artificial Intelligence, Bernoulli

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Lecture Overview What is Artificial Intelligence? Agents acting in an environment

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

1.1 What is AI? 1. What is Artificial Intelligence? 2. AI Past and Present 3. Rational

Statistical learning Chapter 20, Sections 13 Chapter 20, Sections 13 1 Outline

What s the probability that the next candy is lime? What is P(d i+1 | d 1 ,, d i ) =

The geometry of Boolean algebra Chris Heunen 1 / 22 Boolean algebra: example , ,

Market Trend and Production Challenge of Organic Tropical and Sub- Tropical Fruits Vitoon

Statistical Learning February 4, 2010 CS 489 / 698 University of Waterloo Outline

LEARNING PROBABILISTIC MODELS AIMA CHAPTER 20 Instructor: Sael Lee Materials form AIMA resources,

Unit2: Probabilityanddistributions 2. Bayes theorem and Bayesian inference If you

Example Suppose there are five kinds of bags of candies: 10% are h 1 : 100% cherry candies 20% are

CS 331: Artificial Intelligence in the last column tails black 3 - PDF document

Full Joint Probability Distributions Coin Card Candy P(Coin, Card, Candy) tails black 1 0.15 tails black 2 0.06 The probabilities CS 331: Artificial Intelligence in the last column tails black 3 0.09 sum to 1 tails red 1 0.02

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

KY 331 Widening US 60 to Rinaldo Rd. MP 0.436 to 2.62 KY 331 Widening Daviess County PL &amp;

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

What is Artificial Intelligence? CPSC 322 Lecture 1 September 5, 2007 What is Artificial

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

Shortest Paths Eric Price UT Austin CS 331, Spring 2020 Coronavirus Edition CS 331, Spring

Linear Programming Eric Price UT Austin CS 331, Spring 2020 Coronavirus Edition CS 331, Spring

All Pairs Shortest Paths Eric Price UT Austin CS 331, Spring 2020 Coronavirus Edition CS 331,

Artificial Intelligence as Law Bart Verheij Department of Artificial Intelligence, Bernoulli

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Lecture Overview What is Artificial Intelligence? Agents acting in an environment

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

1.1 What is AI? 1. What is Artificial Intelligence? 2. AI Past and Present 3. Rational

Statistical learning Chapter 20, Sections 13 Chapter 20, Sections 13 1 Outline

What s the probability that the next candy is lime? What is P(d i+1 | d 1 ,, d i ) =

The geometry of Boolean algebra Chris Heunen 1 / 22 Boolean algebra: example , ,

Market Trend and Production Challenge of Organic Tropical and Sub- Tropical Fruits Vitoon

Statistical Learning February 4, 2010 CS 489 / 698 University of Waterloo Outline

LEARNING PROBABILISTIC MODELS AIMA CHAPTER 20 Instructor: Sael Lee Materials form AIMA resources,

Unit2: Probabilityanddistributions 2. Bayes theorem and Bayesian inference If you

Example Suppose there are five kinds of bags of candies: 10% are h 1 : 100% cherry candies 20% are

KY 331 Widening US 60 to Rinaldo Rd. MP 0.436 to 2.62 KY 331 Widening Daviess County PL &