CS 486/686 Introduction to Artifjcial Intelligence Alice Gao - PowerPoint PPT Presentation

1/22 CS 486/686 Introduction to Artifjcial Intelligence Alice Gao Lecture 2 Readings: R & N 2.1, 2.2, 2.3 (esp 2.3.2) Based on work by K. Leyton-Brown, K. Larson, and P. van Beek

2/22 Outline Learning goals Rational Agents Properties of Task Environments Revisiting the learning goals

3/22 Learning goals - CS 486/686 Lecture 2 By the end of the lecture, you should be able to have this property. ▶ Given examples of sensors and actuators. ▶ Defjne rational agents. ▶ Given a task environment, describe its properties. ▶ Given a property, give examples of task environments that

4/22 Agents As a human, what sensors and actuators do we have? Consider a software agent. What sensors and actuators does it have? ▶ Interact with the environment. ▶ Perceive the environment using sensors. ▶ Act on the environment using actuators.

5/22 Defjnition of a rational agent For each possible percept sequence , a rational agent should select an action that is expected to maximize its performance measure , given the evidence provided by the percept sequence and whatever prior knowledge the agent has.

6/22 Properties of Task Environments The problems: the task environments The solutions: the rational agents Properties of the task environment: ▶ Fully observable v.s. partially observable ▶ Deterministic v.s. stochastic ▶ Static v.s. dynamic ▶ Episodic v.s. sequential ▶ Known v.s. unknown ▶ Single agent v.s. multi-agent

7/22 Uncertainty Given the observations, can the agent determine the state? the observations. observation. ▶ Fully observable: The agent knows the state of the world from ▶ Partially observable: Many states are possible given an

8/22 CQ: Fully versus Partial Observability CQ: Which pair of environments has difgerent observability? (A) Poker and autonomous cars (B) Chess and medical diagnosis (C) Crossword puzzle and Go

9/22 Examples of Uncertainty Come up with some additional examples yourself. Fully observable: Partially observable:

10/22 Uncertain dynamics Given the current state and an action, can the agent predict the next state? the current state and the action. multiple possible next states. ▶ Deterministic: The next state is completely determined given ▶ Stochastic: The current state and an action can lead to

11/22 CQ: Deterministic versus Stochastic Which of the following is correct? (A) Both are deterministic. (B) Both are stochastic. (C) Chess is deterministic. Poker is stochastic. (D) Chess is stochastic. Poker is deterministic. CQ: Consider Chess and Poker.

12/22 Examples of uncertain dynamics Come up with some additional examples yourself. Deterministic: Stochastic:

13/22 An uncertain environment An environment is uncertain if ▶ It is not fully observable, or ▶ It is not deterministic.

14/22 Can the environment change? Can the environment change while the agent interacts with it? with it. ▶ Static: The environment does not change. ▶ Dynamic: The environment changes while the agent interacts

15/22 CQ: Static versus dynamic CQ: Consider autonomous cars and medical diagnosis. Which of the following statement is correct? (A) Both are static. (B) Both are dynamic. (C) Autonomous cars is static. Medical diagnosis is dynamic. (D) Autonomous cars is dynamic. Medical diagnosis is static.

16/22 Examples of changing environments Come up with some additional examples yourself. Static: Dynamic

17/22 Long-term consequence of actions Can the agent’s current action afgect future actions? ▶ Episodic: The current action does not afgect future actions. ▶ Sequential: The current action could afgect all future actions.

18/22 CQ: Episodic v.s. Sequential CQ: Consider crossword puzzle and image classifjcation. Which of the following statement is correct? (A) Both are episodic. (B) Both are sequential. (C) Crossword puzzle is episodic. Image classifjcation is sequential. (D) Crossword puzzle is sequential. Image classifjcation is episodic.

19/22 Learning the rules of the environment Does the agent know the rules of the environment? environment. ▶ Known: The agent knows all the rules of the environment. ▶ Unknown: The agent does not know all the rules of the

20/22 Number of agents Does the agent consider all other agents to be part of the environment? part of the environment. reasons strategically about the other agents. ▶ Single agent: The agent assumes that any other agents are ▶ Multi-agent: The agent explicitly models other agents and

21/22 CQ: Single or multi agent CQ: Is autonomous cars single agent or multi-agent? (A) Defjnitely single agent. (B) Defjnitely multi-agent. (C) It depends.

22/22 Revisiting the learning goals By the end of the lecture, you should be able to have this property. ▶ Given examples of sensors and actuators. ▶ Defjne rational agents. ▶ Given a task environment, describe its properties. ▶ Given a property, give examples of task environments that

CS 486/686 Introduction to Artifjcial Intelligence Alice Gao - PowerPoint PPT Presentation

1/22 CS 486/686 Introduction to Artifjcial Intelligence Alice Gao Lecture 2 Readings: R & N 2.1, 2.2, 2.3 (esp 2.3.2) Based on work by K. Leyton-Brown, K. Larson, and P. van Beek 2/22 Outline Learning goals Rational Agents Properties

CS 486/686 Introduction to Artifjcial Intelligence Alice Gao Lecture 1 Based on work by K.

CS 486/686 Introduction to Artifjcial Intelligence Alice Gao Lecture 1 Based on work by K.

CS 486/686 Artificial Intelligence Jan 3rd, 2012 University of Waterloo 1 cs486/686 Lecture

CS 486/686 Artificial Intelligence Lecture 1: May 1, 2017 University of Waterloo 1 CS486/686

1 CS486/686 Lecture Slides (c) 2009 K. Larson and P.Poupart 2 CS486/686 Lecture Slides (c) 2009

Constraint Satisf action CS 486/ 686 May 17, 2005 Universit y of Wat erloo 1 CS486/686

Uncertainty CS 486/686 University of Waterloo Sept 30, 2008 1 CS486/686 Lecture Slides (c)

Propositional Logic CS 486/686 Sept 23, 2008 University of Waterloo 1 CS486/686 Lecture Slides

Artificial Neural Networks CS 486/686: Introduction to Artificial Intelligence 1 Introduction

Uncertainty [RN2 Sec. 13.1-13.6] [RN3 Sec. 13.1-13.5] CS 486/686 University of Waterloo

Course wrap up J uly 26, 2005 CS 486/ 686 Universit y of Wat erloo Out line Course wrap

Markov Decision Processes [RN2] Sec 17.1, 17.2, 17.4, 17.5 [RN3] Sec 17.1, 17.2, 17.4 CS 486/686

Statistical Learning (part II) October 28, 2008 CS 486/686 University of Waterloo Outline

Course wrap up CS 486/686 University of Waterloo Lecture 24: July 24, 2017 Outline Course

Utility Theory [RN2] Sect 16.1-16.3 [RN3] Sect 16.1-16.3 CS 486/686 University of Waterloo

Learning and Inference in Markov Logic Networks CS 486/686 University of Waterloo Lecture 23:

Sequential imperfect information games Players face uncertainty about the state of the world

STANDUP POKER KALPESH SHAH CULTURE HACKER & ENTERPRISE AGILE COACH A few things about me.

CSE 331 Object-Oriented Design Heuristics slides created by Marty Stepp based on materials by M.

Agile Estimation (Planning Poker) No plan survives contact with the enemy Field Marshal

Architectural Complexity Lessons from the bwin P5 Poker System Presented by: Henrik Henke

Multi-agent learning Simplied Poker Yannick Bitane , April 14th, 2011. Yannick Bitane. Slides

THE CENTRAL LIMIT THEOREM- WHAT SAMPLE SIZE IS NEEDED? PAUL BOUTHELLIER DEPARTMENT OF

EECS 394 Software Project Management Chris Riesbeck Estimating Thursday, May 19, 2011

Sambuz

Useful Links

Newsletter

Mail Us

CS 486/686 Introduction to Artifjcial Intelligence Alice Gao - PowerPoint PPT Presentation

1/22 CS 486/686 Introduction to Artifjcial Intelligence Alice Gao Lecture 2 Readings: R & N 2.1, 2.2, 2.3 (esp 2.3.2) Based on work by K. Leyton-Brown, K. Larson, and P. van Beek 2/22 Outline Learning goals Rational Agents Properties

CS 486/686 Introduction to Artifjcial Intelligence Alice Gao Lecture 1 Based on work by K.

CS 486/686 Introduction to Artifjcial Intelligence Alice Gao Lecture 1 Based on work by K.

CS 486/686 Artificial Intelligence Jan 3rd, 2012 University of Waterloo 1 cs486/686 Lecture

CS 486/686 Artificial Intelligence Lecture 1: May 1, 2017 University of Waterloo 1 CS486/686

1 CS486/686 Lecture Slides (c) 2009 K. Larson and P.Poupart 2 CS486/686 Lecture Slides (c) 2009

Constraint Satisf action CS 486/ 686 May 17, 2005 Universit y of Wat erloo 1 CS486/686

Uncertainty CS 486/686 University of Waterloo Sept 30, 2008 1 CS486/686 Lecture Slides (c)

Propositional Logic CS 486/686 Sept 23, 2008 University of Waterloo 1 CS486/686 Lecture Slides

Artificial Neural Networks CS 486/686: Introduction to Artificial Intelligence 1 Introduction

Uncertainty [RN2 Sec. 13.1-13.6] [RN3 Sec. 13.1-13.5] CS 486/686 University of Waterloo

Course wrap up J uly 26, 2005 CS 486/ 686 Universit y of Wat erloo Out line Course wrap

Markov Decision Processes [RN2] Sec 17.1, 17.2, 17.4, 17.5 [RN3] Sec 17.1, 17.2, 17.4 CS 486/686

Statistical Learning (part II) October 28, 2008 CS 486/686 University of Waterloo Outline

Course wrap up CS 486/686 University of Waterloo Lecture 24: July 24, 2017 Outline Course

Utility Theory [RN2] Sect 16.1-16.3 [RN3] Sect 16.1-16.3 CS 486/686 University of Waterloo

Learning and Inference in Markov Logic Networks CS 486/686 University of Waterloo Lecture 23:

Sequential imperfect information games Players face uncertainty about the state of the world

STANDUP POKER KALPESH SHAH CULTURE HACKER &amp; ENTERPRISE AGILE COACH A few things about me.

CSE 331 Object-Oriented Design Heuristics slides created by Marty Stepp based on materials by M.

Agile Estimation (Planning Poker) No plan survives contact with the enemy Field Marshal

Architectural Complexity Lessons from the bwin P5 Poker System Presented by: Henrik Henke

Multi-agent learning Simplied Poker Yannick Bitane , April 14th, 2011. Yannick Bitane. Slides

THE CENTRAL LIMIT THEOREM- WHAT SAMPLE SIZE IS NEEDED? PAUL BOUTHELLIER DEPARTMENT OF

EECS 394 Software Project Management Chris Riesbeck Estimating Thursday, May 19, 2011

Sambuz

Useful Links

Newsletter

Mail Us

STANDUP POKER KALPESH SHAH CULTURE HACKER & ENTERPRISE AGILE COACH A few things about me.