Title: Adverserial Search AIMA: Chapter 5 (Sections 5.1, 5.2 - PowerPoint PPT Presentation

B.Y. Choueiry ✫ ✬ Title: Adverserial Search AIMA: Chapter 5 (Sections 5.1, 5.2 and 5.3) Introduction to Artificial Intelligence 1 CSCE 476-876, Fall 2017 URL: www.cse.unl.edu/˜choueiry/F17-476-876 Berthe Y. Choueiry (Shu-we-ri) Instructor’s notes #9 September 25, 2017 (402)472-5444 ✪ ✩

B.Y. Choueiry ✫ ✬ Outline • Introduction 2 • Minimax algorithm • Alpha-beta pruning Instructor’s notes #9 September 25, 2017 ✪ ✩

B.Y. Choueiry ✫ ✬ Context • In an MAS, agents affect each other’s welfare • Environment can be cooperative or competitive 3 • Competitive environments yield adverserial search problems (games) • Approaches: mathematical game theory and AI games Instructor’s notes #9 September 25, 2017 ✪ ✩

B.Y. Choueiry ✫ ✬ Game theory vs. AI • AI games: fully observable, deterministic environments, players alternate, utility values are equal (draw) or opposite (winner/loser) In vocabulary of game theory: deterministic, turn-taking, 4 two-player, zero-sum games of perfect information • Games are attractive to AI: states simple to represent, agents restricted to a small number of actions, outcome defined by simple rules Instructor’s notes #9 Not croquet or ice hockey, but typically board games September 25, 2017 Exception: Soccer (Robocup www.robocup.org/ ) ✪ ✩

B.Y. Choueiry ✫ ✬ Board game playing: an appealing target of AI research Board game: Chess (since early AI), Othello, Go, Backgammon, etc. - Easy to represent 5 - Fairly small numbers of well-defined actions - Environment fairly accessible - Good abstraction of an enemy, w/o real-life (or war) risks :—) Instructor’s notes #9 September 25, 2017 But also: Bridge, ping-pong, etc. ✪ ✩

B.Y. Choueiry ✫ ✬ Characteristics • ‘Unpredictable’ opponent: contingency problem (interleaves search and execution) • Not the usual type of ‘uncertainty’: no randomness/no missing information (such as in traffic) 6 but, the moves of the opponent expectedly non benign • Challenges: - huge branching factor - large solution space Instructor’s notes #9 - Computing optimal solution is infeasible September 25, 2017 - Yet, decisions must be made. Forget A*... ✪ ✩

B.Y. Choueiry ✫ ✬ Discussion • What are the theoretically best moves? 7 • Techniques for choosing a good move when time is tight √ Pruning: ignore irrelevant portions of the search space × Evaluation function: approximate the true utility of a state without doing search Instructor’s notes #9 September 25, 2017 ✪ ✩

B.Y. Choueiry ✫ ✬ Two-person Games - 2 player: Min and Max - Max moves first - Players alternate until end of game - Gain awarded to player/penalty give to loser Game as a search problem: 8 • Initial state: board position & indication whose turn it is • Successor function: defining legal moves a player can take Returns {(move, state) ∗ } • Terminal test: determining when game is over Instructor’s notes #9 September 25, 2017 states satisfy the test: terminal states • Utility function (a.k.a. payoff function): numerical value for outcome e.g., Chess: win=1, loss=-1, draw=0 ✪ ✩

B.Y. Choueiry ✫ ✬ Usual search Max finds a sequence of operators yielding a terminal goal scoring winner according to the utility function Game search • Min actions are significant 9 Max must find a strategy to win regardless of what Min does: → correct action for Max for each action of Min − • Need to approximate (no time to envisage all possibilities difficulty): a huge state space, an even more huge search space  Instructor’s notes #9 10 40 different legal positions September 25, 2017  e.g. , chess: Average branching factor=35, 50 moves/player= 35 100  • Performance in terms of time is very important ✪ ✩

B.Y. Choueiry ✫ ✬ Example : Tic-Tac-Toe Max has 9 alternative moves Terminal states’ utility: Max wins=1, Max loses = -1, Draw = 0 MAX ( X ) X X X MIN ( O ) X X X X X X 10 X O X O X . . . MAX ( X ) O X O X X O X O . . . MIN ( O ) X X Instructor’s notes #9 September 25, 2017 . . . . . . . . . . . . . . . X O X X O X X O X O X O O X X TERMINAL O X X O X O O Utility �–1 0 +1 ✪ ✩

B.Y. Choueiry ✫ ✬ Example : 2-ply game tree Max’s actions: a 1 , a 2 , a 3 Min’s actions: b 1 , b 2 , b 3 A MAX 3 a 1 a 3 11 a 2 B C D MIN 3 2 2 b 1 b 3 c 1 c 3 d 1 d 3 � b 2 c 2 d 2 Instructor’s notes #9 September 25, 2017 3 12 8 2 4 6 14 5 2 Minimax algorithm determines the optimal strategy for Max → decides which is the best move ✪ ✩

B.Y. Choueiry ✫ ✬ Minimax algorithm - Generate the whole tree, down to the leaves - Compute utility of each terminal state - Iteratively, from the leaves up to the root, use utility of nodes at depth d to compute utility of nodes at depth ( d − 1) : 12 MIN ‘row’: minimum of children MAX ‘row’: maximum of children Minimax-Value ( n )  Utility ( n ) if n is a terminal node    Instructor’s notes #9 max s ∈ Succ ( n ) Minimax-Value ( s ) if n is a Max node September 25, 2017   min s ∈ Succ ( n ) Minimax-Value ( s ) if n is a Min node  ✪ ✩

B.Y. Choueiry ✫ ✬ Minimax decision • MAX’s decision: minimax decision maximizes utility under the assumption that the opponent will play perfectly to his/her own advantage 13 • Minimax decision maximes the worst-case outcome for Max (which otherwise is guaranteed to do better) • If opponent is sub-optimal, other strategies may reach better outcome better than the minimax decision Instructor’s notes #9 September 25, 2017 ✪ ✩

B.Y. Choueiry ✫ ✬ Minimax algorithm: Properties • m maximum depth b legal moves 14 • Using Depth-first search, space requirement is: O ( bm ) : if generating all successors at once O ( m ) : if considering successors one at a time • Time complexity O ( b m ) Instructor’s notes #9 September 25, 2017 Real games: time cost totally unacceptable ✪ ✩

B.Y. Choueiry ✫ ✬ Multiple players games Utility ( n ) becomes a vector of the size of the number of players For each node, the vector gives the utility of the state for each player 15 to move A (1, 2, 6) B (1, 2, 6) (1, 5, 2) � X Instructor’s notes #9 C (1, 2, 6) (6, 1, 2) (1, 5, 2) (5, 4, 5) September 25, 2017 A (1, 2, 6) (4, 2, 3) (6, 1, 2) (7, 4,�1) (5,�1,�1) (1, 5, 2) (7, 7,�1) (5, 4, 5) ✪ ✩

B.Y. Choueiry ✫ ✬ Alliance formation in multiple players games How about alliances? • A and B in weak positions, but C in strong position A and B make an alliance to attack C (rather than each other 16 → Collaboration emerges from purely selfish behavior! • Alliances can be done and undone (careful for social stigma!) • When a two-player game is not zero-sum, players may end up automatically making alliances (for example when the terminal Instructor’s notes #9 September 25, 2017 state maximizes utility of both players) ✪ ✩

B.Y. Choueiry ✫ ✬ Alpha-beta pruning • Minimax requires computing all terminal nodes: unacceptable • Do we really need to do compute utility of all terminal nodes? 17 ... No, says John McCarthy in 1956: It is possible to compute the correct minimax decision without looking at every node in the tree, and yet get the correct Instructor’s notes #9 decision September 25, 2017 • Use pruning (eliminating useless branches in a tree) ✪ ✩

B.Y. Choueiry ✫ ✬ Example of alpha-beta pruning [−∞, +∞] [−∞, +∞] A A (a) (b) [−∞, 3 ] [−∞, 3 ] B B 3 3 12 [ 3 , +∞] [ 3 , +∞] A A (c) (d) 18 B B [−∞, 2] C [3, 3] [3, 3] 3 12 8 3 12 8 2 [3, 14] A [3, 3] A (e) (f) B [−∞, 2 ] C [−∞, 14 ] D [−∞, 2] Instructor’s notes #9 B C D [3, 3] [3, 3] [2, 2] September 25, 2017 3 12 8 2 14 3 12 8 2 14 5 2 Try 14, 5, 2, 6 below D ✪ ✩

B.Y. Choueiry ✫ ✬ General principal of Alpha-beta pruning  — a parent node of n  If Player has a better choice m at — any choice point further up  n will never be reached in actual play Player 19 m Opponent .. .. .. Player n Opponent Instructor’s notes #9 September 25, 2017 Once we have found enough about n ( e.g. , through one of it descendants), we can prune it ( i.e. , discard all its remaining descendants) ✪ ✩

B.Y. Choueiry ✫ ✬ Mechanism of Alpha-beta pruning α : value of best choice so far for MAX, (maximum) β : value of best choice so far for MIN, (minimum) Player 20 m Opponent .. .. .. Player n Opponent Instructor’s notes #9 September 25, 2017 Alpha-beta search: - updates the value of α , β as it goes along - prunes a subtree as soon as its worse then current α or β ✪ ✩

B.Y. Choueiry ✫ ✬ Effectiveness of pruning Effectiveness of pruning depends on the order of new nodes examined [−∞, +∞] [−∞, +∞] A A (a) (b) [−∞, 3 ] [−∞, 3 ] B B 21 3 3 12 [ 3 , +∞] [ 3 , +∞] A A (c) (d) B B [−∞, 2] C [3, 3] [3, 3] 3 12 8 3 12 8 2 Instructor’s notes #9 [3, 14] [3, 3] A A (e) (f) September 25, 2017 B [−∞, 2 ] C [−∞, 14 ] D B [−∞, 2] C D [3, 3] [3, 3] [2, 2] 3 12 8 2 14 3 12 8 2 14 5 2 ✪ ✩

Title: Adverserial Search AIMA: Chapter 5 (Sections 5.1, 5.2 - PowerPoint PPT Presentation

B.Y. Choueiry Title: Adverserial Search AIMA: Chapter 5 (Sections 5.1, 5.2 and 5.3) Introduction to Artificial Intelligence 1 CSCE 476-876, Fall 2017 URL: www.cse.unl.edu/choueiry/F17-476-876 Berthe Y. Choueiry (Shu-we-ri)

Informed Search strategies AIMA sections 3.5, 3.6 Summary Informed Search strategies

Informed search algorithms Chapter 4, Sections 12 Chapter 4, Sections 12 1 Outline

Informed search algorithms Chapter 3, Sections 56 of; based on AIMA Slides c Artificial

Uninformed Search strategies AIMA sections 3.4 Uninformed search strategies Uninformed Search

Local search algorithms Chapter 4, Sections 12 of; based on AIMA Slides c Artificial

Local search algorithms AIMA sections 4.1,4.2 Summary Local search algorithms Hill-climbing

Local search algorithms Chapter 4, Sections 34 Chapter 4, Sections 34 1 Outline

LEARNING PROBABILISTIC MODELS AIMA CHAPTER 20 Instructor: Sael Lee Materials form AIMA resources,

First-order logic Chapter 8, Sections 13 of; based on AIMA Slides c Artificial Intelligence,

Bayesian networks Chapter 14, Sections 14 of; based on AIMA Slides c Artificial Intelligence,

Artificial Intelligence Chapter 1, Sections 13 of; based on AIMA Slides c Artificial

Intelligent Agents Chapter 2, Sections 14 of; based on AIMA Slides c Artificial Intelligence,

Game playing Chapter 5, Sections 16 of; based on AIMA Slides c Artificial Intelligence,

Constraint Satisfaction Problems Chapter 6, Sections 15 of; based on AIMA Slides c Artificial

Planning Chapter 10, Sections 14 of; based on AIMA Slides c Artificial Intelligence, spring

Inference in first-order logic Chapter 9, Sections 15 of; based on AIMA Slides c Artificial

2. A Game Theoretic Analysis of the ESP Game Ming Yin and Steve Komarov Human Computation Today

Drafting Territories in the Board Game Risk Presenter: Richard Gibson Joint Work With: Neesha

A Case for Protecting Computer Games With SGX Erick Bauman and Zhiqiang Lin System and Software

Differential Game Logic Andr e Platzer Summer School Marktoberdorf 2017 0.5 0.4 0.3 0.2

Switchboard: A Matchmaking System for Multiplayer Mobile Games Justin Manweiler , Sharad Agarwal,

Roeder Interactive Game loop Interactive Game Loop Core Mechanics Physics, AI, etc. Update()

On the Bit Security of Cryptographic Primitives M E T R A S 7 . 7 M Daniele Micciancio

Double Your Contributors Using These 3 Simple Tricks! - or - Why Would Someone Work on Your