the multiple dimensions of mean payoff games
play

The Multiple Dimensions of Mean-Payoff Games Laurent Doyen CNRS - PowerPoint PPT Presentation

The Multiple Dimensions of Mean-Payoff Games Laurent Doyen CNRS & LSV, ENS Paris-Saclay RP 2017 About Basics about mean-payoff games Algorithms & Complexity Strategy complexity Memory Strategy complexity Memory


  1. The Multiple Dimensions of Mean-Payoff Games Laurent Doyen CNRS & LSV, ENS Paris-Saclay RP 2017

  2. About Basics about mean-payoff games • Algorithms & Complexity • Strategy complexity – Memory • Strategy complexity – Memory Focus • Equivalent game forms • Techniques for memoryless proofs

  3. Mean-Payoff Mean-Payoff

  4. Mean-Payoff

  5. Mean-Payoff

  6. Mean-Payoff

  7. Mean-Payoff Switching policy to get average power (1,1) ?

  8. Mean-Payoff Switching policy to get average power (1,1) ?

  9. Mean-Payoff Mean-payoff value = limit-average of the visited weights

  10. Mean-Payoff Switching policy • Infinite memory: (1,1) vanishing frequency in q 0 • Infinite memory: (1,1) vanishing frequency in q 0

  11. Mean-Payoff limit ?

  12. Mean-Payoff limit ? limsup liminf Mean-payoff is prefix-independent

  13. Mean-Payoff Switching policy • Infinite memory: (1,1) for liminf • Infinite memory: (1,1) for liminf

  14. Mean-Payoff Switching policy • Infinite memory: (1,1) for liminf & (2,2) for limsup • Infinite memory: (1,1) for liminf & (2,2) for limsup

  15. Games Games

  16. Two-player games

  17. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration

  18. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  19. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  20. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  21. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  22. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  23. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  24. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  25. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  26. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  27. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  28. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Play:

  29. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Strategies = recipe to extend the play prefix Player 1: Player 2:

  30. Two-player games Player 1 (maximizer) Player 2 (minimizer) • Turn-based • Turn-based • Infinite duration Strategies = recipe to extend the play prefix Player 1: outcome of two strategies is a play Player 2:

  31. Mean-payoff games Mean-payoff games

  32. Mean-payoff games Mean-payoff game: positive and negative weights (encoded in binary) Decision problem: Decide if there exists a player-1 strategy to ensure mean-payoff value ≥ 0 Value problem:

  33. Mean-payoff games Key ingredients: • identify memory requirement: infinite vs. finite vs. memoryless • solve 1-player games (i.e., graphs) Key arguments for memoryless proof: • backward induction • shuffle of plays • nested memoryless objectives

  34. Reduction to Reachability Games

  35. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0)

  36. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0)

  37. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0)

  38. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0) If player 1 wins � only positive cycles are formed � mean-payoff value ≥ 0 If player 2 wins � only negative cycles are formed � mean-payoff value < 0 (Note: limsup vs. liminf does not matter)

  39. Reduction to Reachability Games Reachability objective: positive cycles (v ≥ 0) positive cycles (v ≥ 0) Mean-payoff game � Ensuring positive cycles Memoryless strategy transfers to finite-memory mean-payoff winning strategy

  40. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ?

  41. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ?

  42. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Progress measure: minimum initial credit to stay always positive

  43. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Progress measure: minimum initial credit to stay always positive

  44. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Progress measure: minimum initial credit to stay always positive

  45. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Progress measure: minimum initial credit to stay always positive

  46. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Choose successor to stay above minimum credit minimum credit such that Progress measure: minimum initial credit to stay always positive In choose such that

  47. Strategy Synthesis Memoryless mean-payoff winning strategy ? winning strategy ? Choose successor to stay above minimum credit minimum credit such that Progress measure: minimum initial credit to stay always positive In choose such that

  48. Memoryless proofs Key arguments for memoryless proof: • backward induction • backward induction • shuffle of plays • nested memoryless objectives

  49. Energy Games Energy: min-value of the prefix. (if positive cycle; otherwise ∞) Mean-payoff: average-value of the cycle. Mean-payoff: average-value of the cycle.

  50. Energy Games Winning strategy ?

  51. Energy Games Winning strategy ? Follow the minimum initial credit !

  52. Multi-dimension Multi-dimension games

  53. Multi-dimension games Multiple resources • Energy: initial credit to stay always above (0,0) • Energy: initial credit to stay always above (0,0) • Mean-payoff:

  54. Multi-dimension games Multiple resources • Energy: initial credit to stay always above (0,0) • Energy: initial credit to stay always above (0,0) same ? • Mean-payoff: same as positive cycles ?

  55. Multi-dimension games Multiple resources • Energy: initial credit to stay always above (0,0) • Energy: initial credit to stay always above (0,0) same ? • Mean-payoff: same as positive cycles ? If player 1 can ensure positive simple cycles, then energy and mean-payoff are satisfied. Not the converse !

  56. Multi-dimension games If player 1 has initial credit to stay always positive (Energy) then finite-memory strategies are sufficient

  57. Multi-dimension games If player 1 has initial credit to stay always positive (Energy) then finite-memory strategies are sufficient Then σ’ 1 is winning Let σ 1 be winning On each branch and finite memory L L 1 stop and play ... ... ... ... L 2 as from L 1 ! ... ... ... ... ... ... ... ... With L 1 ≤L 2 ( ℕ d ,≤) is well-quasi ordered wqo + Koenig’s lemma

  58. Multi-energy games If player 1 has initial credit to stay always positive (Energy) then finite-memory strategies are sufficient For player 2 ?

  59. Multi-energy games For player 2, memoryless strategies are sufficient • induction on player-2 states • if ∃ initial credit against all memoryless strategies, then ∃ initial credit against all arbitrary strategies. ‘left’ game ‘right’ game

  60. Multi-energy games For player 2, memoryless strategies are sufficient • induction on player-2 states • if ∃ initial credit against all memoryless strategies, then ∃ initial credit against all arbitrary strategies. c l c r c l+ c r ⇒ ‘left’ game ‘right’ game Play is a shuffle of left-game play and right-game play Energy is sum of them

  61. Multi-energy games For player 2, memoryless strategies are sufficient • induction on player-2 states • if ∃ initial credit against all memoryless strategies, then ∃ initial credit against all arbitrary strategies. c l c r c l+ c r ⇒ ‘left’ game ‘right’ game Value against Value against memoryless Play is a shuffle of left-game play arbitrary strategies and right-game play strategies Energy is sum of them In general, we need

  62. Memoryless proofs Key arguments for memoryless proof: • backward induction • backward induction • shuffle of plays • nested memoryless objectives

  63. Multi-energy games If player 1 has initial credit to stay always positive (Energy) then finite-memory strategies are sufficient For player 2, memoryless strategies are sufficient coNP ?

Recommend


More recommend