CSE 473: Artificial Intelligence Adversarial Search Dan Weld Based - PDF document

10/19/16 CSE 473: Artificial Intelligence Adversarial Search Dan Weld Based on slides from Dan Klein, Stuart Russell, Pieter Abbeel, Andrew Moore and Luke Zettlemoyer 1 (best illustrations from ai.berkeley.edu) Outline § Adversarial Search § Minimax search § α-β search § Evaluation functions § Expectimax § Reminder: § Project 2 due in 5 days 1

10/19/16 Types of Games stratego Number of Players? 1, 2, …? Deterministic Games § Many possible formalizations, one is: § States: S (start at s 0 ) § Players: P={1...N} (usually take turns) § Actions: A (may depend on player / state) § Transition Function: S x A à S § Terminal Test: S à {t,f} § Terminal Utilities: S x P à R § Solution for a player is a policy : S à A 2

10/19/16 Tic-tac-toe Game Tree Minimax Values States Under Agent’s Control: States Under Opponent’s Control: -8 -5 -10 + 8 Terminal States: Slide from Dan Klein & Pieter Abbeel - ai.berkeley.edu 3

10/19/16 Minimax Implementation Need Base case for recursion def max-value(state): def min-value(state): if leaf?(state), return U(state) if leaf?(state), return U(state) initialize v = - ∞ initialize v = + ∞ for each c in children(state) for each c in children(state) v = max(v, min-value(c)) v = min(v, max-value(c)) return v return v Slide from Dan Klein & Pieter Abbeel - ai.berkeley.edu a - b Pruning Example Max: ³ 3 £ 2 3 Min: ? ? Doesn’t matter! Progress of search… Don’t need to evaluate 4

10/19/16 Alpha-Beta Quiz Search depth-first Left to right Max: Order is important Do all nodes matter? Min: Slide from Dan Klein & Pieter Abbeel - ai.berkeley.edu Alpha-Beta Quiz 2 Search depth-first Left to right Max: Order is important Do all nodes matter? Min: Max: Slide from Dan Klein & Pieter Abbeel - ai.berkeley.edu 5

10/19/16 a - b Pruning § a is MAX’s best choice on path to root Player § If n becomes worse than a , α Opponent MAX will avoid it, so can stop considering n ’ s other children Player § Define b similarly for MIN Opponent n Min-Max Implementation def max-val(state ): def min-val(state ): if leaf?(state), return U(state) if leaf?(state), return U(state) initialize v = - ∞ initialize v = + ∞ for each c in children(state): for each c in children(state): v = max(v, min-val(c )) v = min(v, max-val(c )) return v return v Slide adapted from Dan Klein & Pieter Abbeel - ai.berkeley.edu 6

10/19/16 Alpha-Beta Implementation α: MAX’s best option on path to root β: MIN’s best option on path to root def max-val(state, α, β): def min-val(state , α, β): if leaf?(state), return U(state) if leaf?(state), return U(state) initialize v = - ∞ initialize v = + ∞ for each c in children(state): for each c in children(state): v = max(v, min-val(c, α, β)) v = min(v, max-val(c, α, β)) return v return v Slide adapted from Dan Klein & Pieter Abbeel - ai.berkeley.edu Alpha-Beta Implementation α: MAX’s best option on path to root β: MIN’s best option on path to root def max-val(state, α, β): def min-val(state, α, β): if leaf?(state), return U(state) if leaf?(state), return U(state) initialize v = - ∞ initialize v = + ∞ for each c in children(state): for each c in children(state): v = max(v, min-val(c, α, β)) v = min(v, max-val(c, α, β)) if v ≥ β return v if v ≤ α return v α = max(α, v) β = min(β, v) return v return v Slide adapted from Dan Klein & Pieter Abbeel - ai.berkeley.edu 7

10/19/16 Alpha-Beta Pruning Example α=- ¥ At max node: At min node: β=+ ¥ Prune if v ³b ; Prune if v £a ; 3 Else update a = max( a ,v) Else update b = min( b ,v) α=- ¥ α=3 α=3 α=3 β=+ ¥ β=+ ¥ β=+ ¥ β=+ ¥ 3 ≤2 ≤1 α=3 α=3 α=- ¥ α=- ¥ α=- ¥ α=- ¥ α=3 α=3 α=3 α=3 β=+ ¥ β=1 β=+ ¥ β=3 β=+ ¥ Β=+ ¥ β=3 β=3 β=14 β=5 3 12 2 14 5 1 ≥8 α is MAX ’ s best alternative here or above α=- ¥ α=- ¥ 8 β is MIN ’ s best alternative here or above β=3 β=3 Alpha-Beta Pruning Properties § This pruning has no effect on final result at the root § Values of intermediate nodes might be wrong! § but, they are correct bounds § Good child ordering improves effectiveness of pruning § With “ perfect ordering ” : § Time complexity drops to O(b m/2 ) § Doubles solvable depth! § (But complete search of complex games, e.g. chess, is still hopeless… 8

10/19/16 Resource Limits § Problem: In realistic games, cannot search to leaves! max 4 § Solution: Depth-limited search -2 4 min § Instead, search only to a limited depth in the tree -1 -2 4 9 § Replace terminal utilities with an evaluation function for non-terminal positions § Example: § Suppose we have 3 min/move, can explore 1M nodes / sec § So can check 200M nodes per move § a - b reaches about depth 10 à decent chess program § Guarantee of optimal play is gone § More plies makes a BIG difference ? ? ? ? Depth Matters § Evaluation functions are always imperfect § The deeper in the tree the evaluation function is buried, the less the quality of the evaluation function matters § Good example of the tradeoff between complexity of features and complexity of computation [Demo: depth limited (L6D4, L6D5)] 9

10/19/16 Iterative Deepening Iterative deepening uses DFS as a b subroutine: … 1. Do a DFS which only searches for paths of length 1 or less. (DFS gives up on any path of length 2) 2. If “ 1 ” failed, do a DFS which only searches paths of length 2 or less. 3. If “ 2 ” failed, do a DFS which only searches paths of length 3 or less. ….and so on. Creates an anytime algorithm Heuristic Evaluation Function § Function which scores non-terminals § Ideal function: returns the true utility of the position § In practice: need a simple, fast approximation § typically weighted linear sum of features: § e.g. f 1 ( s ) = (num white queens – num black queens), etc. 10

10/19/16 Evaluation for Pacman What features would be good for Pacman? Which algorithm? α - β , depth 4, simple eval fun QuickTime™ and a GIF decompressor are needed to see this picture. 11

10/19/16 Which algorithm? α - β , depth 4, better eval fun QuickTime™ and a GIF decompressor are needed to see this picture. Why Pacman Starves § He knows his score will go up by eating the dot now § He knows his score will go up just as much by eating the dot later on § There are no point-scoring opportunities after eating the dot § Therefore, waiting seems just as good as eating 12

CSE 473: Artificial Intelligence Adversarial Search Dan Weld Based - PDF document

10/19/16 CSE 473: Artificial Intelligence Adversarial Search Dan Weld Based on slides from Dan Klein, Stuart Russell, Pieter Abbeel, Andrew Moore and Luke Zettlemoyer 1 (best illustrations from ai.berkeley.edu) Outline Adversarial Search

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

CSE 473 Artificial Intelligence (AI) Rajesh Rao (Instructor) Yi-Shu Wei (TA) Hunter Whalen (TA)

CSE 473 Artificial Intelligence (AI) Rajesh Rao (Instructor) Jennifer Hanson (TA) Evan Herbst

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

1/29/10 CSE 3402: Intro to Artificial Intelligence CSE 3402: Intro to Artificial Intelligence

What is Artificial Intelligence? CPSC 322 Lecture 1 September 5, 2007 What is Artificial

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

Artificial Intelligence as Law Bart Verheij Department of Artificial Intelligence, Bernoulli

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Lecture Overview What is Artificial Intelligence? Agents acting in an environment

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

1.1 What is AI? 1. What is Artificial Intelligence? 2. AI Past and Present 3. Rational

8th November 2019 Artificial Intelligence Finance Institute NYU Courant Artificial Intelligence

Games and Adversarial Search A: Mini-max, Cutting Off Search CS171, Summer Session I, 2018

How to write an editorial article Roger Graves Director, Writing Across the Curriculum Goals 1.

Pipeline Safety 20 Years After The Bellingham Tragedy Whats Changed? Are We Safer? New

Welcome!.. The Second International Workshop on Data-aware Distributed Computing DADC 2009 June

Formal Security Analysis of Cryptographic Protocol Code

Privacy as a tool for Robust Mechanism Design in Large Markets Aaron

Theory of Computer Games: Concluding Remarks Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Deconstructing Visualizations Maneesh Agrawala CS 448B: Visualization Fall 2018 Last Time: