energy and mean payoff games
play

Energy and Meanpayoff Games Laurent Doyen LSV, ENS Cachan & - PowerPoint PPT Presentation

Energy and Meanpayoff Games Laurent Doyen LSV, ENS Cachan & CNRS joint work with Aldric Degorre, Raffaella Gentilini, JeanFranois Raskin, Szymon Torunczyk ACTS 2010, Chennai Synthesis problem Specification avoid failure,


  1. Energy and Mean�payoff Games Laurent Doyen LSV, ENS Cachan & CNRS joint work with Aldric Degorre, Raffaella Gentilini, Jean�François Raskin, Szymon Torunczyk ACTS 2010, Chennai

  2. Synthesis problem Specification avoid failure, ensure progress, etc. Correctness relation

  3. Synthesis problem System - Model Specification avoid failure, ensure progress, etc. Correctness relation Solved as a game – system vs. environment solution = winning strategy This talk: quantitative games (resource-constrained systems)

  4. Energy games (staying alive)

  5. Energy games (CdAHS03,BFLM08) Maximizer Minimizer positive weight = reward play: (1,4) (4,1) (1,4) (4,1) … weights: �1 +2 �1 +2 … energy level: � 0 2 1 3 2 4 3 …

  6. Energy games (CdAHS03,BFL+08) Maximizer Minimizer positive weight = reward play: (1,4) (4,1) (1,4) (4,1) … weights: �1 +2 �1 +2 … energy level: � 0 2 1 3 2 4 3 … � Initial credit

  7. Energy games Strategies: Maximizer Minimizer play: Infinite sequence of edges consistent with strategies and outcome is winning if: Energy level

  8. Energy games Decision problem: Decide if there exist an initial credit c 0 and a strategy of the maximizer to maintain the energy level always nonnegative.

  9. Energy games Decision problem: Decide if there exist an initial credit c 0 and a strategy of the maximizer to maintain the energy level always nonnegative. For energy games, memoryless strategies suffice.

  10. Energy games c 0 =2 c 0 =2 Decision problem: Decide if there exist an initial credit c 0 and a strategy of the maximizer to maintain the energy level always nonnegative. For energy games, memoryless strategies suffice. c 0 =1 c 0 =0 A memoryless strategy is winning if all cycles are nonnegative when is fixed.

  11. Energy games c 0 =2 c 0 =2 Decision problem: Decide if there exist an initial credit c 0 and a strategy of the maximizer to maintain the energy level always nonnegative. For energy games, memoryless strategies suffice. c 0 =1 c 0 =0 A memoryless strategy is winning if all cycles are nonnegative when is fixed.

  12. Algorithm

  13. Algorithm for energy games Initial credit is useful to survive before a cycle is formed Length(AcyclicPath) ≤ Q Q: #states E: #edges W: maximal weight

  14. Algorithm for energy games Initial credit is useful to survive before a cycle is formed Length(AcyclicPath) ≤ Q Q: #states E: #edges Minimum initial credit is at most Q—W W: maximal weight

  15. Algorithm for energy games The minimum initial credit is such that: in Maximizer state q: in Minimizer state q: Compute successive under�approximations of the minimum initial credit.

  16. Algorithm for energy games 0 0 Fixpoint algorithm: � start with 0 0

  17. Algorithm for energy games 0 1 0 2 Fixpoint algorithm: � start with � iterate at Maximizer states: 0 1 0 0 at Minimizer states:

  18. Algorithm for energy games 0 1 2 0 2 2 Fixpoint algorithm: � start with � iterate at Maximizer states: 0 1 1 0 0 0 at Minimizer states:

  19. Algorithm for energy games 0 1 2 0 2 2 Fixpoint algorithm: � start with � iterate at Maximizer states: 0 1 1 0 0 0 at Minimizer states: Termination argument: monotonic operators, and finite codomain Complexity: O(E—Q—W)

  20. Mean�payoff games

  21. Mean�payoff games (EM79) Maximizer Minimizer positive weight = reward play: (1,4) (4,1) (1,4) (4,1) … weights: �1 +2 �1 +2 … mean�payoff value: (limit of weight average)

  22. Mean�payoff games (EM79) Mean�payoff value: either or Decision problem: Given a rational threshold , decide if there exists a strategy of the maximizer to ensure mean�payoff value at least . Note: we can assume e.g. by shifting all weights by .

  23. Mean�payoff games Mean�payoff value: either or Decision problem: Given a rational threshold , decide if there exists a strategy of the maximizer to ensure mean�payoff value at least . Assuming A memoryless strategy is winning if all cycles are nonnegative when is fixed.

  24. Mean�payoff games Mean�payoff value: either or Decision problem: log�space equivalent to Given a rational threshold , decide energy games [BFL+08] if there exists a strategy of the maximizer to ensure mean�payoff value at least . Assuming A memoryless strategy is winning if all cycles are nonnegative when is fixed.

  25. Complexity Energy games Mean�payoff games O(E—Q—W) (this talk) Decision problem O(E—Q—W) O(E—Q 2 —W) [ZP96] Deterministic Pseudo�polynomial algorithms

  26. Outline ► Perfect information • Mean�payoff games • Energy games • Algorithms ► Imperfect information • Energy with fixed initial credit • Energy with unknown initial credit • Mean�payoff

  27. Imperfect information (staying alive in the dark)

  28. Imperfect information – Why ? System - Model Specification avoid failure, ensure progress, etc. Correctness relation • Private variables/internal state • Noisy sensors Strategies should not rely on hidden information

  29. Imperfect information – How ? � � • Coloring of the state space observations = set of states with the same color

  30. Imperfect information – How ? � � � ��� � � Maximizer states only Playing the game: 1. Maximizer chooses an action (a or b) 2. Minimizer chooses successor state (compatible with Maximizer’s action) 3. The color of the next state is visible to Maximizer

  31. Imperfect information – How ? � � ��� � Actions Observations

  32. Imperfect information – How ? Observation�based strategies � � ��� Goal: all outcomes have � � nonnegative energy level, � or nonnegative mean�payoff value Actions Observations

  33. Complexity Energy games Mean�payoff games O(E—Q—W) (this talk) Perfect O(E—Q—W) information O(E—Q 2 —W) [ZP96] Imperfect ? ? information

  34. Imperfect information Observation�based strategies � � ��� Goal: all outcomes have � nonnegative energy level, � � or nonnegative mean�payoff value Two variants for Energy games: � fixed initial credit � unknown initial credit

  35. Fixed initial credit Can you win with initial credit = 3 ? Actions Observations

  36. Fixed initial credit Can you win with initial credit = 3 ? Keep track of � which can be the current state, and � what is the worst�case energy level Initially: (3, ⊥ , ⊥ )

  37. Example (3, ⊥ , ⊥ ) ��� ( ⊥ ,2,2)

  38. Example (3, ⊥ , ⊥ ) ��� ( ⊥ ,2,2) � � � � (3, ⊥ , ⊥ ) ( ⊥ ,2,1) ( ⊥ ,1,3) (3, ⊥ , ⊥ )

  39. Example (3, ⊥ , ⊥ ) ��� ( ⊥ ,2,2) � � � � (3, ⊥ , ⊥ ) ( ⊥ ,2,1) ( ⊥ ,1,3) (3, ⊥ , ⊥ ) Stop search whenever � negative value, or � comparable ancestor

  40. Example (3, ⊥ , ⊥ ) ��� ( ⊥ ,2,2) � � � � (3, ⊥ , ⊥ ) ( ⊥ ,2,1) ( ⊥ ,1,3) (3, ⊥ , ⊥ ) � � � � (4, ⊥ , ⊥ ) ( ⊥ ,1,0) ( ⊥ ,1,4) (2, ⊥ , ⊥ ) Stop search whenever: � negative value, or � comparable ancestor

  41. Example (3, ⊥ , ⊥ ) ��� ( ⊥ ,2,2) � � � � (3, ⊥ , ⊥ ) ( ⊥ ,2,1) ( ⊥ ,1,3) (3, ⊥ , ⊥ ) � � � � (4, ⊥ , ⊥ ) ( ⊥ ,1,0) ( ⊥ ,1,4) (2, ⊥ , ⊥ ) Initial credit = 3 is not sufficient !

  42. Example (3, ⊥ , ⊥ ) ��� ( ⊥ ,2,2) � � � � (3, ⊥ , ⊥ ) ( ⊥ ,2,1) ( ⊥ ,1,3) (3, ⊥ , ⊥ ) � � � � (4, ⊥ , ⊥ ) ( ⊥ ,1,0) ( ⊥ ,1,4) (2, ⊥ , ⊥ ) Search will terminate because is well�quasi ordered.

  43. Example Upper bound: non�primitive recursive (3, ⊥ , ⊥ ) ��� Lower bound: EXPSPACE�hard ( ⊥ ,2,2) Proof (not shown in this talk): reduction from � � the infinite execution problem of Petri Nets. � � (3, ⊥ , ⊥ ) ( ⊥ ,2,1) ( ⊥ ,1,3) (3, ⊥ , ⊥ ) � � � � (4, ⊥ , ⊥ ) ( ⊥ ,1,0) ( ⊥ ,1,4) (2, ⊥ , ⊥ ) Search will terminate because is well�quasi ordered.

  44. Complexity Energy games Mean�payoff games (unknown initial credit) O(E—Q—W) (this talk) Perfect O(E—Q—W) information O(E—Q 2 —W) [ZP96] Imperfect r.e. ? information

  45. Memory requirement With imperfect information: Corollary: Finite�memory strategies suffice in energy games

  46. Memory requirement With imperfect information: Corollary: Finite�memory strategies suffice in energy games In mean�payoff games: • �������� memory may be required • limsup vs. liminf definition do ��� coincide

  47. Memory requirement Energy games Mean�payoff games Perfect memoryless memoryless information Imperfect finite memory infinite memory information

  48. Unknown initial credit Theorem The unknown initial credit problem for energy games is undecidable. (even for blind games) Proof: Using a reduction from the halting problem of 2�counter machines.

  49. 2�counter machines • 2 counters c 1 , c 2 • increment, decrement, zero test q1: inc c 1 goto q2 q2: inc c 1 goto q3 q3: if c 1 == 0 goto q6 else dec c 1 goto q4 q4: inc c 2 goto q5 q5: inc c 2 goto q3 q6: halt

Recommend


More recommend