Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu - PowerPoint PPT Presentation

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1

Introduction Alpha-beta pruning is the standard searching procedure used for solving 2-person perfect-information zero sum games exactly. Definitions: • A position p . • The value of a position p , f ( p ) , is a numerical value computed from evaluating p . ⊲ Value is computed from the root player’s point of view. ⊲ Positive values mean in favor of the root player. ⊲ Negative values mean in favor of the opponent. ⊲ Since it is a zero sum game, thus from the opponent’s point of view, the value can be assigned − f ( p ) . • A terminal position: a position whose value can be decided. ⊲ A position where win/loss/draw can be concluded. ⊲ A position where some constraints, e.g., time limit and depth limit, are met. • A position p has b legal moves p 1 , p 2 , . . . , p b . TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 2

Tree node numbering 1 2 3 2.2 3.1 3.2 1.1 1.3 2.1 1.2 3.1.1 3.1.2 From the root, number a node in a search tree by a sequence of integers a 1 .a 2 .a 3 .a 4 · · · • Meaning from the root, you first take the a 1 th branch, then the a 2 th branch, and then the a 3 th branch, and then the a 4 th branch · · · • The root is specified as an empty sequence. • The depth of a node is the length of the sequence of integers specifying it. This is called “Dewey decimal system.” TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 3

Mini-max formulation max min max 7 2 5 1 6 7 min 8 1 Mini-max formulation: • � f ( p ) if b = 0 F ′ ( p ) = max { G ′ ( p 1 ) , . . . , G ′ ( p b ) } if b > 0 • � f ( p ) if b = 0 G ′ ( p ) = min { F ′ ( p 1 ) , . . . , F ′ ( p b ) } if b > 0 • An indirect recursive formula with a bottom-up evaluation! • Equivalent to AND-OR logic. TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 4

Mini-max formulation max min 1 2 max 7 2 5 8 1 6 7 min 8 1 Mini-max formulation: • � f ( p ) if b = 0 F ′ ( p ) = max { G ′ ( p 1 ) , . . . , G ′ ( p b ) } if b > 0 • � f ( p ) if b = 0 G ′ ( p ) = min { F ′ ( p 1 ) , . . . , F ′ ( p b ) } if b > 0 • An indirect recursive formula with a bottom-up evaluation! • Equivalent to AND-OR logic. TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 5

Mini-max formulation max min 1 2 7 max 7 2 5 8 1 6 7 min 8 1 Mini-max formulation: • � f ( p ) if b = 0 F ′ ( p ) = max { G ′ ( p 1 ) , . . . , G ′ ( p b ) } if b > 0 • � f ( p ) if b = 0 G ′ ( p ) = min { F ′ ( p 1 ) , . . . , F ′ ( p b ) } if b > 0 • An indirect recursive formula with a bottom-up evaluation! • Equivalent to AND-OR logic. TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 6

Mini-max formulation max 7 min 1 2 7 max 7 2 5 8 1 6 7 min 8 1 Mini-max formulation: • � f ( p ) if b = 0 F ′ ( p ) = max { G ′ ( p 1 ) , . . . , G ′ ( p b ) } if b > 0 • � f ( p ) if b = 0 G ′ ( p ) = min { F ′ ( p 1 ) , . . . , F ′ ( p b ) } if b > 0 • An indirect recursive formula with a bottom-up evaluation! • Equivalent to AND-OR logic. TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 7

Algorithm: Mini-max Algorithm F ′ (position p ) // max node • determine the successor positions p 1 , . . . , p b • if b = 0 , then return f ( p ) else begin ⊲ m := −∞ ⊲ for i := 1 to b do t := G ′ ( p i ) ⊲ ⊲ if t > m then m := t // find max value • end; • return m Algorithm G ′ (position p ) // min node • determine the successor positions p 1 , . . . , p b • if b = 0 , then return f ( p ) else begin ⊲ m := ∞ ⊲ for i := 1 to b do t := F ′ ( p i ) ⊲ ⊲ if t < m then m := t // find min value • end; • return m TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 8

Mini-max: comments A brute-force method to try all possibilities! • May visit a position many times. Depth-first search • Move ordering is according to order the successor positions are gener- ated. • Bottom-up evaluation. • Post-ordering traversal. Q: • Iterative deepening? • BFS? • Other types of searching? TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 9

Mini-max: revised (1/2) Search a max-node position p with a depth of depth . Algorithm F ′ (position p , integer depth ) // max node • determine the successor positions p 1 , . . . , p b • if b = 0 // a terminal node or depth = 0 // remaining depth to search or time is running up // from timing control or some other constraints are met // add knowledge here then return f ( p ) // current board value else begin ⊲ m := −∞ // initial value ⊲ for i := 1 to b do // try each child ⊲ begin t := G ′ ( p i , depth − 1) ⊲ ⊲ if t > m then m := t // find max value ⊲ end end • return m TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 10

Mini-max: revised (2/2) Search a min-node position p with a depth of depth . Algorithm G ′ (position p , integer depth ) // min node • determine the successor positions p 1 , . . . , p b • if b = 0 // a terminal node or depth = 0 // remaining depth to search or time is running up // from timing control or some other constraints are met // add knowledge here then return f ( p ) // current board value else begin ⊲ m := ∞ // initial value ⊲ for i := 1 to b do // try each child ⊲ begin t := F ′ ( p i , depth − 1) ⊲ ⊲ if t < m then m := t // find min value ⊲ end end • return m TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 11

Nega-max formulation max min max 7 5 1 6 7 2 min −8 −1 Nega-max formulation: Let F ( p ) be the greatest possible value achievable from position p against the optimal defensive strategy. • � h ( p ) if b = 0 F ( p ) = max {− F ( p 1 ) , . . . , − F ( p b ) } if b > 0 ⊲ � f ( p ) if depth of p is 0 or even h ( p ) = − f ( p ) if depth of p is odd ⊲ h ( p ) is the position’s value from the point of view of the player of p . TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 12

Nega-max formulation max min −1 neg −2 neg neg neg neg max 8 7 5 1 6 7 2 neg neg min −8 −1 Nega-max formulation: Let F ( p ) be the greatest possible value achievable from position p against the optimal defensive strategy. • � h ( p ) if b = 0 F ( p ) = max {− F ( p 1 ) , . . . , − F ( p b ) } if b > 0 ⊲ � f ( p ) if depth of p is 0 or even h ( p ) = − f ( p ) if depth of p is odd ⊲ h ( p ) is the position’s value from the point of view of the player of p . TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 13

Nega-max formulation max min −1 neg −2 −7 neg neg neg neg neg neg max 8 7 5 1 6 7 2 neg neg min −8 −1 Nega-max formulation: Let F ( p ) be the greatest possible value achievable from position p against the optimal defensive strategy. • � h ( p ) if b = 0 F ( p ) = max {− F ( p 1 ) , . . . , − F ( p b ) } if b > 0 ⊲ � f ( p ) if depth of p is 0 or even h ( p ) = − f ( p ) if depth of p is odd ⊲ h ( p ) is the position’s value from the point of view of the player of p . TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 14

Nega-max formulation max neg 7 neg neg min −1 neg −2 −7 neg neg neg neg neg neg max 8 7 5 1 6 7 2 neg neg min −8 −1 Nega-max formulation: Let F ( p ) be the greatest possible value achievable from position p against the optimal defensive strategy. • � h ( p ) if b = 0 F ( p ) = max {− F ( p 1 ) , . . . , − F ( p b ) } if b > 0 ⊲ � f ( p ) if depth of p is 0 or even h ( p ) = − f ( p ) if depth of p is odd ⊲ h ( p ) is the position’s value from the point of view of the player of p . TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 15

Algorithm: Nega-max Algorithm F (position p , integer depth ) • determine the successor positions p 1 , . . . , p b • if b = 0 // a terminal node or depth = 0 // remaining depth to search or time is running up // from timing control or some other constraints are met // add knowledge here • then return h ( p ) else • begin ⊲ m := −∞ ⊲ for i := 1 to b do ⊲ begin ⊲ t := − F ( p i , depth − 1) // recursive call, the returned value is negated ⊲ if t > m then m := t // always find a max value ⊲ end • end • return m TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 16

Nega-max: comments Another brute-force method to try all possibilities. • Use h ( p ) instead of f ( p ) . ⊲ Zero-sum game: if one player thinks a position p has a value of w , then the other player thinks it is − w . ⊲ min { x, y, z } = − max {− x, − y, − z } . ⊲ max { x, y, z } = − min {− x, − y, − z } . • Watch out the code in dealing with search termination conditions. ⊲ Reach a given searching depth. ⊲ Timing control. ⊲ Other constraints such as the score is good or bad enough. Notations: • F ′ means the Mini-max version. ⊲ Need a G ′ companion. ⊲ Easy to explain. • F means the Negamax version. ⊲ Simpler code. ⊲ Maybe difficult to explain. TCG: α - β Pruning, 20191107, Tsan-sheng Hsu c � 17

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu - PowerPoint PPT Presentation

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Introduction Alpha-beta pruning is the standard searching procedure used for solving 2-person perfect-information zero sum

Alpha- -beta pruning beta pruning Example Alpha Example reduce the branching factor of

Natural Target Pruning Making Proper Pruning Cuts Natural Target Pruning In this lesson we

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

More on games (Ch. 5.4-5.6) Alpha-beta pruning Previously on CSci 4511... We talked about how to

Beta star measurement G. Wang and M.Bai Yellow beta star and chromatic beta beat measurement

BASICS Natural Target Pruning Terminology and Tools Reasons for Pruning Fruit Trees

Pruning for Cropload Management and Productivity 2013 Winter Pruning Workshop Dr. Mercy

ECE 4524 Artificial Intelligence and Engineering Applications Meeting 6: Alpha-Beta Pruning,

More on games (Ch. 5.4-5.6) Announcements Writing 2 posted Minimax Pruning in real life:

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis Chaoqi Wang , Roger Grosse,

CS440/ECE448 Lecture 11: Alpha-Beta Pruning; Limited Horizon Slides by Mark Hasegawa-Johnson

Extensive Form Games 2/10/17 Alpha-Beta Pruning Exercise + + + + + + +

Game Playing Part 2 Alpha-Beta Pruning Yingyu Liang yliang@cs.wisc.edu Computer Sciences

Minimax strategies, alpha beta pruning Lirong Xia Reminder Project 1 due tonight Makes

Computing the eigenvalues of a Joint work with Raf Vandebril and Paul Van Dooren companion

systematics uncertainties in the determination of the local dark matter density Miguel Pato in

Stream Reasoning and Multi-Context Systems Thomas Eiter Institute of Logic and Computation

CCL Industries Inc. CCL Industries Inc. Investor Update 4th Quarter 2013 Review February 20,

Eric Duchne (Institut Fourier, Grenoble) Aviezri S. Fraenkel (Weizmann Institute, Rehovot)

Scheduling Plans Introductory Concepts CSCI [4|6] 730 Embellish on the introductory

Heavy Ion planning Jamie Boyd & Christoph Schwick (LPC)

Timeline for FEMB Development, System Tests and protoDUNE Run 2 Marco Verzocchi - Fermilab Cold

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu - PowerPoint PPT Presentation

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Introduction Alpha-beta pruning is the standard searching procedure used for solving 2-person perfect-information zero sum

Alpha- -beta pruning beta pruning Example Alpha Example reduce the branching factor of

Natural Target Pruning Making Proper Pruning Cuts Natural Target Pruning In this lesson we

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

More on games (Ch. 5.4-5.6) Alpha-beta pruning Previously on CSci 4511... We talked about how to

Beta star measurement G. Wang and M.Bai Yellow beta star and chromatic beta beat measurement

BASICS Natural Target Pruning Terminology and Tools Reasons for Pruning Fruit Trees

Pruning for Cropload Management and Productivity 2013 Winter Pruning Workshop Dr. Mercy

ECE 4524 Artificial Intelligence and Engineering Applications Meeting 6: Alpha-Beta Pruning,

More on games (Ch. 5.4-5.6) Announcements Writing 2 posted Minimax Pruning in real life:

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis Chaoqi Wang , Roger Grosse,

CS440/ECE448 Lecture 11: Alpha-Beta Pruning; Limited Horizon Slides by Mark Hasegawa-Johnson

Extensive Form Games 2/10/17 Alpha-Beta Pruning Exercise + + + + + + +

Game Playing Part 2 Alpha-Beta Pruning Yingyu Liang yliang@cs.wisc.edu Computer Sciences

Minimax strategies, alpha beta pruning Lirong Xia Reminder Project 1 due tonight Makes

Computing the eigenvalues of a Joint work with Raf Vandebril and Paul Van Dooren companion

systematics uncertainties in the determination of the local dark matter density Miguel Pato in

Stream Reasoning and Multi-Context Systems Thomas Eiter Institute of Logic and Computation

CCL Industries Inc. CCL Industries Inc. Investor Update 4th Quarter 2013 Review February 20,

Eric Duchne (Institut Fourier, Grenoble) Aviezri S. Fraenkel (Weizmann Institute, Rehovot)

Scheduling Plans Introductory Concepts CSCI [4|6] 730 Embellish on the introductory

Heavy Ion planning Jamie Boyd &amp; Christoph Schwick (LPC)

Timeline for FEMB Development, System Tests and protoDUNE Run 2 Marco Verzocchi - Fermilab Cold

Heavy Ion planning Jamie Boyd & Christoph Schwick (LPC)