Learning agents Performance standard Critic Sensors Learning from - PDF document

Learning agents Performance standard Critic Sensors Learning from Observations feedback Environment changes Chapter 18, Sections 1–3 Learning Performance element element knowledge learning goals experiments Problem generator Agent Effectors Chapter 18, Sections 1–3 1 Chapter 18, Sections 1–3 4 Outline Learning element ♦ Learning agents Design of learning element is dictated by ♦ what type of performance element is used ♦ Inductive learning ♦ which functional component is to be learned ♦ how that functional component is represented ♦ Decision tree learning ♦ what kind of feedback is available ♦ Measuring learning performance Example scenarios: Performance element Component Representation Feedback Alpha−beta search Eval. fn. Weighted linear function Win/loss Logical agent Transition model Successor−state axioms Outcome Utility−based agent Transition model Dynamic Bayes net Outcome Simple reflex agent Percept−action fn Neural net Correct action Supervised learning: correct answers for each instance Reinforcement learning: occasional rewards Chapter 18, Sections 1–3 2 Chapter 18, Sections 1–3 5 Learning Inductive learning (a.k.a. Science) Learning is essential for unknown environments, Simplest form: learn a function from examples ( tabula rasa ) i.e., when designer lacks omniscience f is the target function Learning is useful as a system construction method, O O X i.e., expose the agent to reality rather than trying to write it down An example is a pair x , f ( x ) , e.g., X , +1 Learning modifies the agent’s decision mechanisms to improve performance X Problem: find a(n) hypothesis h such that h ≈ f given a training set of examples ( This is a highly simplified model of real learning: – Ignores prior knowledge – Assumes a deterministic, observable “environment” – Assumes examples are given – Assumes that the agent wants to learn f —why? ) Chapter 18, Sections 1–3 3 Chapter 18, Sections 1–3 6

Inductive learning method Inductive learning method Construct/adjust h to agree with f on training set Construct/adjust h to agree with f on training set ( h is consistent if it agrees with f on all examples) ( h is consistent if it agrees with f on all examples) E.g., curve fitting: E.g., curve fitting: f(x) f(x) x x Chapter 18, Sections 1–3 7 Chapter 18, Sections 1–3 10 Inductive learning method Inductive learning method Construct/adjust h to agree with f on training set Construct/adjust h to agree with f on training set ( h is consistent if it agrees with f on all examples) ( h is consistent if it agrees with f on all examples) E.g., curve fitting: E.g., curve fitting: f(x) f(x) x x Chapter 18, Sections 1–3 8 Chapter 18, Sections 1–3 11 Inductive learning method Inductive learning method Construct/adjust h to agree with f on training set Construct/adjust h to agree with f on training set ( h is consistent if it agrees with f on all examples) ( h is consistent if it agrees with f on all examples) E.g., curve fitting: E.g., curve fitting: f(x) f(x) x x Ockham’s razor: maximize a combination of consistency and simplicity Chapter 18, Sections 1–3 9 Chapter 18, Sections 1–3 12

Attribute-based representations Hypothesis spaces How many distinct decision trees with n Boolean attributes?? Examples described by attribute values (Boolean, discrete, continuous, etc.) E.g., situations where I will/won’t wait for a table: Attributes Target Example Alt Bar Fri Hun Pat Price Rain Res Type Est WillWait X 1 T F F T Some $$$ F T French 0–10 T X 2 T F F T Full $ F F Thai 30–60 F X 3 F T F F Some $ F F Burger 0–10 T X 4 T F T T Full $ F F Thai 10–30 T X 5 T F T F Full $$$ F T French > 60 F X 6 F T F T Some $$ T T Italian 0–10 T F T F F None $ T F Burger 0–10 F X 7 X 8 F F F T Some $$ T T Thai 0–10 T F T T F Full $ T F Burger > 60 F X 9 X 10 T T T T Full $$$ F T Italian 10–30 F F F F F None $ F F Thai 0–10 F X 11 X 12 T T T T Full $ F F Burger 30–60 T Classification of examples is positive (T) or negative (F) Chapter 18, Sections 1–3 13 Chapter 18, Sections 1–3 16 Decision trees Hypothesis spaces One possible representation for hypotheses How many distinct decision trees with n Boolean attributes?? E.g., here is the “true” tree for deciding whether to wait: = number of Boolean functions Patrons? None Some Full F T WaitEstimate? >60 30−60 10−30 0−10 F Alternate? Hungry? T No Yes No Yes Reservation? Fri/Sat? T Alternate? No Yes No Yes No Yes Bar? T F T T Raining? No Yes No Yes F T F T Chapter 18, Sections 1–3 14 Chapter 18, Sections 1–3 17 Expressiveness Hypothesis spaces Decision trees can express any function of the input attributes. How many distinct decision trees with n Boolean attributes?? E.g., for Boolean functions, truth table row → path to leaf: = number of Boolean functions = number of distinct truth tables with 2 n rows A A B A xor B F T F F F B B F T T F T F T T F T T T F F T T F Trivially, there is a consistent decision tree for any training set w/ one path to leaf for each example (unless f nondeterministic in x ) but it probably won’t generalize to new examples Prefer to find more compact decision trees Chapter 18, Sections 1–3 15 Chapter 18, Sections 1–3 18

Hypothesis spaces Hypothesis spaces How many distinct decision trees with n Boolean attributes?? How many distinct decision trees with n Boolean attributes?? = number of Boolean functions = number of Boolean functions = number of distinct truth tables with 2 n rows = 2 2 n = number of distinct truth tables with 2 n rows = 2 2 n E.g., with 6 Boolean attributes, there are 18,446,744,073,709,551,616 trees How many purely conjunctive hypotheses (e.g., Hungry ∧ ¬ Rain )?? Each attribute can be in (positive), in (negative), or out 3 n distinct conjunctive hypotheses ⇒ More expressive hypothesis space – increases chance that target function can be expressed – increases number of hypotheses consistent w/ training set ⇒ may get worse predictions Chapter 18, Sections 1–3 19 Chapter 18, Sections 1–3 22 Hypothesis spaces Decision tree learning How many distinct decision trees with n Boolean attributes?? Aim: find a small tree consistent with the training examples = number of Boolean functions Idea: (recursively) choose “most significant” attribute as root of (sub)tree = number of distinct truth tables with 2 n rows = 2 2 n function DTL ( examples, attributes, default ) returns a decision tree E.g., with 6 Boolean attributes, there are 18,446,744,073,709,551,616 trees if examples is empty then return default else if all examples have the same classification then return the classification else if attributes is empty then return Mode ( examples ) else best ← Choose-Attribute ( attributes , examples ) tree ← a new decision tree with root test best for each value v i of best do examples i ← { elements of examples with best = v i } subtree ← DTL ( examples i , attributes − best , Mode ( examples )) add a branch to tree with label v i and subtree subtree return tree Chapter 18, Sections 1–3 20 Chapter 18, Sections 1–3 23 Hypothesis spaces Choosing an attribute How many distinct decision trees with n Boolean attributes?? Idea: a good attribute splits the examples into subsets that are (ideally) “all positive” or “all negative” = number of Boolean functions = number of distinct truth tables with 2 n rows = 2 2 n E.g., with 6 Boolean attributes, there are 18,446,744,073,709,551,616 trees Patrons? Type? How many purely conjunctive hypotheses (e.g., Hungry ∧ ¬ Rain )?? None Some Full French Italian Thai Burger Patrons ? is a better choice—gives information about the classification Chapter 18, Sections 1–3 21 Chapter 18, Sections 1–3 24

Learning agents Performance standard Critic Sensors Learning from - PDF document

Learning agents Performance standard Critic Sensors Learning from Observations feedback Environment changes Chapter 18, Sections 13 Learning Performance element element knowledge learning goals experiments Problem generator

Learning Agents Overview Learning important aspects Learning in Agents goal, types; individual

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

2015 OUTSTANDING YOUNG AGENTS COMMITTEE: Membership Development The Young Agents Council of the

Intelligent Driving Agents Intelligent Driving Agents Microscopic traffic simulation with

Innovative Ideas to Engage Agents Will Bickmore & Sarah-Lynne Rand Senior Account Managers

BABA is getting Social BECOME A BETTER AGENT Where good agents go to become great agents.

Intelligent Agents 2 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 2 1 2 Intelligence Agents

Intelligent Agents: acting rationally AIMA chapter 2 Summary Intelligent Agents: acting

1 Defining Agents 2 2 How Agents Should Act 3 2.1 Mapping from Percept Sequences to Actions

Single agent or multiple agents Many domains are characterized by multiple agents rather than a

Single agent or multiple agents Many domains are characterized by multiple agents rather than a

I ntelligent Agents I ntelligent Agents Rational Agent PEAS Types of Agents Some

Logical Agents Chapter 7 Why Do We Need Logic? Problem-solving agents were very inflexible:

Intelligent Agents Chapter 2 Some slide credits to Hwee Tou Ng (Singapore) Outline Agents

What is pandas ? IN TR OD U C TION TO DATA SC IE N C E IN P YTH ON Hillar y Green - Lerman Lead

libadjoint: a new abstraction for developing adjoint models S.W. Funke, P.E. Farrell and D.A. Ham

One System To Fit Them All: Shared MySQL Hosting At Facebook Andrew Regner Production

SIP URI Conventions for Media Servers IETF 51 8 August 2001 draft-burger-sipping-msuri-00.txt

I N A NUTSHELL ... Efficent visualization tool : for small sample continuous datasets

Analysis of the Occurrence of Laughter in Meetings Kornel Laskowski 1 , 2 & Susanne Burger 2 1

DG-KFVS schemes for convection-diffusion equations Praveen. C Center for Applicable Mathematics

The Strategist: Strategy from Combination Adam Brandenburger J.P . Valles Professor,

Learning agents Performance standard Critic Sensors Learning from - PDF document

Learning agents Performance standard Critic Sensors Learning from Observations feedback Environment changes Chapter 18, Sections 13 Learning Performance element element knowledge learning goals experiments Problem generator

Learning Agents Overview Learning important aspects Learning in Agents goal, types; individual

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents &amp; environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

2015 OUTSTANDING YOUNG AGENTS COMMITTEE: Membership Development The Young Agents Council of the

Intelligent Driving Agents Intelligent Driving Agents Microscopic traffic simulation with

Innovative Ideas to Engage Agents Will Bickmore &amp; Sarah-Lynne Rand Senior Account Managers

BABA is getting Social BECOME A BETTER AGENT Where good agents go to become great agents.

Intelligent Agents 2 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 2 1 2 Intelligence Agents

Intelligent Agents: acting rationally AIMA chapter 2 Summary Intelligent Agents: acting

1 Defining Agents 2 2 How Agents Should Act 3 2.1 Mapping from Percept Sequences to Actions

Single agent or multiple agents Many domains are characterized by multiple agents rather than a

Single agent or multiple agents Many domains are characterized by multiple agents rather than a

I ntelligent Agents I ntelligent Agents Rational Agent PEAS Types of Agents Some

Logical Agents Chapter 7 Why Do We Need Logic? Problem-solving agents were very inflexible:

Intelligent Agents Chapter 2 Some slide credits to Hwee Tou Ng (Singapore) Outline Agents

What is pandas ? IN TR OD U C TION TO DATA SC IE N C E IN P YTH ON Hillar y Green - Lerman Lead

libadjoint: a new abstraction for developing adjoint models S.W. Funke, P.E. Farrell and D.A. Ham

One System To Fit Them All: Shared MySQL Hosting At Facebook Andrew Regner Production

SIP URI Conventions for Media Servers IETF 51 8 August 2001 draft-burger-sipping-msuri-00.txt

I N A NUTSHELL ... Efficent visualization tool : for small sample continuous datasets

Analysis of the Occurrence of Laughter in Meetings Kornel Laskowski 1 , 2 &amp; Susanne Burger 2 1

DG-KFVS schemes for convection-diffusion equations Praveen. C Center for Applicable Mathematics

The Strategist: Strategy from Combination Adam Brandenburger J.P . Valles Professor,

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments

Innovative Ideas to Engage Agents Will Bickmore & Sarah-Lynne Rand Senior Account Managers

Analysis of the Occurrence of Laughter in Meetings Kornel Laskowski 1 , 2 & Susanne Burger 2 1