Announcements CS 188: Artificial Intelligence Spring 2011 W4 out, - PDF document

Announcements CS 188: Artificial Intelligence Spring 2011 § W4 out, due next week Monday § P4 out, due next week Friday Lecture 19: Dynamic Bayes Nets, § Mid-semester survey Naïve Bayes 4/6/2011 Pieter Abbeel – UC Berkeley Slides adapted from Dan Klein. 2 Announcements II P4: Ghostbusters 2.0 Noisy distance prob § Course contest § Plot: Pacman's grandfather, Grandpac, True distance = 8 learned to hunt ghosts for sport. 15 13 § He was blinded by his power, but could hear the ghosts ’ banging and clanging. 11 9 § Transition Model: All ghosts move 7 randomly, but are sometimes biased 5 § Regular tournaments. Instructions have been posted! § Emission Model: Pacman knows a § First week extra credit for top 20, next week top 10, then top 5, 3 then top 3. “ noisy ” distance to each ghost 1 § First nightly tournament: tentatively Monday night 3 Today Dynamic Bayes Nets (DBNs) § Dynamic Bayes Nets (DBNs) § We want to track multiple variables over time, using § [sometimes called temporal Bayes nets] multiple sources of evidence § Idea: Repeat a fixed Bayes net structure at each time § Variables from time t can condition on those from t-1 § Demos: t =1 t =2 t =3 § Localization G 1 a G 2 a G 3 a § Simultaneous Localization And Mapping (SLAM) G 1 b G 2 b G 3 b § Start machine learning E 1 a E 1 b E 2 a E 2 b E 3 a E 3 b § Discrete valued dynamic Bayes nets are also HMMs 5 1

Exact Inference in DBNs DBN Particle Filters § Variable elimination applies to dynamic Bayes nets § A particle is a complete sample for a time step § Procedure: “ unroll ” the network for T time steps, then § Initialize : Generate prior samples for the t=1 Bayes net eliminate variables until P(X T |e 1:T ) is computed § Example particle: G 1 a = (3,3) G 1 b = (5,3) t =1 t =2 t =3 § Elapse time : Sample a successor for each particle § Example successor: G 2 a = (2,3) G 2 b = (6,3) G 1 a G 2 a G 3 a § Observe : Weight each entire sample by the likelihood of G 1 b G 2 b G 3 G 3 b b the evidence conditioned on the sample § Likelihood: P( E 1 a | G 1 a ) * P( E 1 b | G 1 b ) E 1 a E 1 b E 2 a E 2 b E 3 a E 3 b § Resample: Select prior samples (tuples of values) in proportion to their likelihood § Online belief updates: Eliminate all variables from the 7 8 previous time step; store factors for current time only [Demo] Trick I to Improve Particle Filtering DBN Particle Filters Performance: Low Variance Resampling § A particle is a complete sample for a time step § Initialize : Generate prior samples for the t=1 Bayes net § Example particle: G 1 a = (3,3) G 1 b = (5,3) § Elapse time : Sample a successor for each particle § Example successor: G 2 a = (2,3) G 2 b = (6,3) § Observe : Weight each entire sample by the likelihood of the evidence conditioned on the sample § Advantages: § Likelihood: P( E 1 a | G 1 a ) * P( E 1 b | G 1 b ) § More systematic coverage of space of samples § If all samples have same importance weight, no § Resample: Select prior samples (tuples of values) in samples are lost proportion to their likelihood § Lower computational complexity 9 Trick II to Improve Particle Filtering SLAM Performance: Regularization § If no or little noise in transitions model, all § SLAM = Simultaneous Localization And Mapping § We do not know the map or our location particles will start to coincide § Our belief state is over maps and positions! § Main techniques: Kalman filtering (Gaussian HMMs) and particle methods à regularization: introduce additional (artificial) noise into the transition model § [DEMOS] DP-SLAM, Ron Parr 2

Robot Localization SLAM § In robot localization: § SLAM = Simultaneous Localization And Mapping § We know the map, but not the robot ’ s position § We do not know the map or our location § Observations may be vectors of range finder readings § State consists of position AND map! § State space and readings are typically continuous (works § Main techniques: Kalman filtering (Gaussian HMMs) and particle basically like a very fine grid) and so we cannot store B(X) methods § Particle filtering is a main technique § [Demos] Global-floor Particle Filter Example SLAM 3 particles § DEMOS § fastslam.avi, visionSlam_heliOffice.wmv map of particle 3 map of particle 1 15 map of particle 2 Further readings Part III: Machine Learning § We are done with Part II Probabilistic § Up until now: how to reason in a model Reasoning and how to make optimal decisions § To learn more (beyond scope of 188): § Machine learning: how to acquire a model § Koller and Friedman, Probabilistic Graphical Models (CS281A) on the basis of data / experience § Thrun, Burgard and Fox, Probabilistic § Learning parameters (e.g. probabilities) Robotics (CS287) § Learning structure (e.g. BN graphs) § Learning hidden concepts (e.g. clustering) 3

Machine Learning Today Parameter Estimation r g g r g g r g g r r g g g g § Estimating the distribution of a random variable § An ML Example: Parameter Estimation § Elicitation: ask a human (why is this hard?) § Maximum likelihood § Empirically: use training data (learning!) § Smoothing § E.g.: for each outcome x, look at the empirical rate of that value: § Applications r g g § Main concepts § Naïve Bayes § This is the estimate that maximizes the likelihood of the data § Issue: overfitting. E.g., what if only observed 1 jelly bean? Estimation: Smoothing Estimation: Laplace Smoothing § Relative frequencies are the maximum likelihood estimates § Laplace ’ s estimate: § Pretend you saw every outcome H H T once more than you actually did § In Bayesian statistics, we think of the parameters as just another random variable, with its own distribution § Can derive this as a MAP ???? estimate with Dirichlet priors (see cs281a) Estimation: Laplace Smoothing Example: Spam Filter § Laplace ’ s estimate Dear Sir. § Input: email H H T (extended): § Output: spam/ham First, I must solicit your confidence in this transaction, this is by virture of its nature § Pretend you saw every outcome § Setup: as being utterly confidencial and top k extra times § Get a large collection of secret. … example emails, each labeled “ spam ” or “ ham ” TO BE REMOVED FROM FUTURE § Note: someone has to hand MAILINGS, SIMPLY REPLY TO THIS label all this data! MESSAGE AND PUT "REMOVE" IN THE § Want to learn to predict SUBJECT. § What ’ s Laplace with k = 0? labels of new, future emails § k is the strength of the prior 99 MILLION EMAIL ADDRESSES FOR ONLY $99 § Features: The attributes used to make the ham / spam decision § Laplace for conditionals: Ok, Iknow this is blatantly OT but I'm § Words: FREE! beginning to go insane. Had an old Dell § Smooth each condition § Text Patterns: $dd, CAPS Dimension XPS sitting in the corner and independently: § Non-text: SenderInContacts decided to put it to use, I know it was § … working pre being stuck in the corner, but when I plugged it in, hit the power nothing happened. 4

Announcements CS 188: Artificial Intelligence Spring 2011 W4 out, - PDF document

Announcements CS 188: Artificial Intelligence Spring 2011 W4 out, due next week Monday P4 out, due next week Friday Lecture 19: Dynamic Bayes Nets, Mid-semester survey Nave Bayes 4/6/2011 Pieter Abbeel UC Berkeley

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

Lecture 29: Artificial Intelligence Marvin Zhang 08/10/2016 Some slides are adapted from CS 188

What is Artificial Intelligence? CPSC 322 Lecture 1 September 5, 2007 What is Artificial

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

Standard 188-2015 Presentation - TE Watson ANSI/ASHRAE Standard 188-2015 Legionellosis: Risk

Announcements CS 188: Artificial Intelligence Spring 2010 P2: Due tonight W3: Expectimax,

Announcements CS 188: Artificial Intelligence Spring 2011 Practice Final Out (optional)

Announcements Project 4 due Friday HW9 due next Monday CS 188: Artificial Intelligence

Announcements CS 188: Artificial Intelligence W2 is due today (lecture or drop box) Spring

Announcements CS 188: Artificial Intelligence Uncertainty and Utilities Homework 3: Games

Announcements CS 188: Artificial Intelligence On-going: contest (optional and FUN!) Spring

CS 188: Artificial Intelligence Introduction Instructors: Anca Dragan, Sergey Levine University

Computational Logic Pure (Declarative) Logic Programs 1 Pure Logic Programs (Overview)

Implementing finite state machines The PROLOG programming language (1) PROgrammation LOGique

Overview Two-Part MDL Two-Part MDL Two-Part MDL for Two-Part MDL for Grammar Learning

Grammar 1: Nouns and Verbs Nouns: people, places, things, ideas Verbs: action words

Romans 1 by 1. Towards a population database for the Roman Empire Dr. Rada VARGA Centre for Roman

Introduction Utterances in discourse change the context Einf uhrung in Pragmatik und

Highly Granular ECAL Studies in a HP-TPC Context Lorenz Emberger, Frank Simon

ADDRESS TRANSLATION AND TLB Mahdi Nazm Bojnordi Assistant Professor School of Computing