approximating values of generalized reachability
play

Approximating Values of Generalized-Reachability Stochastic Games - PowerPoint PPT Presentation

Approximating Values of Generalized-Reachability Stochastic Games Maximilian Weininger joint work with Pranav Ashok, Krishnendu Chatterjee, Jan Kretnsk, Tobias Winkler HIGHLIGHTS 2020 (Paper appeared at LICS 2020) Model The problem The


  1. Approximating Values of Generalized-Reachability Stochastic Games Maximilian Weininger joint work with Pranav Ashok, Krishnendu Chatterjee, Jan Kretínský, Tobias Winkler HIGHLIGHTS 2020 (Paper appeared at LICS 2020)

  2. Model

  3. The problem

  4. The problem

  5. The problem

  6. The problem

  7. The problem

  8. The problem Want: Pareto frontier

  9. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] [CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

  10. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] [CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

  11. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] [CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

  12. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] [CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

  13. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] [CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

  14. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] [CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

  15. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? [CFK+13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

  16. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

  17. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

  18. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

  19. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

  20. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

  21. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation

  22. The problem Want: Pareto frontier How: Value iteration from below [CFK+13] Problem: When to stop? Solution: Convergent over-approximation Approximate values of generalized-reachability stochastic games for arbitrarily small precision.

  23. Our contribution Over-approximation need not converge (multiple fixpoints)

  24. Our contribution Over-approximation need not converge (multiple fixpoints) - Consider single directions - Apply single-dimensional solution

  25. Our contribution Over-approximation need not converge (multiple fixpoints) - Consider single directions - Apply single-dimensional solution - Group directions into regions

  26. Context Single-dim SG Multi-dim MDP Multi-dim SG Computational NP ∩ coNP [Con92] PSPACE-complete [RRS15] complexity Strategy complexity [Con92] Condon, A. (1992). The complexity of stochastic games [RRS15] Randour, M., Raskin, J. F., & Sankur, O. Percentile queries in multi-dimensional Markov decision processes. CAV 2015.

  27. Context Single-dim SG Multi-dim MDP Multi-dim SG Computational NP ∩ coNP [Con92] PSPACE-complete [RRS15] Decidability open complexity Strategy complexity [Con92] Condon, A. (1992). The complexity of stochastic games [RRS15] Randour, M., Raskin, J. F., & Sankur, O. Percentile queries in multi-dimensional Markov decision processes. CAV 2015.

  28. Context Single-dim SG Multi-dim MDP Multi-dim SG Computational NP ∩ coNP [Con92] PSPACE-complete [RRS15] Decidability open complexity Strategy complexity Memoryless Randomized memoryless deterministic [Con92] (absorbing) [EKVY07]; Finite mem. in general [RRS15] [Con92] Condon, A. (1992). The complexity of stochastic games [RRS15] Randour, M., Raskin, J. F., & Sankur, O. Percentile queries in multi-dimensional Markov decision processes. CAV 2015. [EKVY07] Etessami, K., Kwiatkowska, M., Vardi, M. Y., & Yannakakis, M. Multi-objective model checking of Markov decision processes. TACAS 2007.

  29. Context Single-dim SG Multi-dim MDP Multi-dim SG Computational NP ∩ coNP [Con92] PSPACE-complete [RRS15] Decidability open complexity Strategy complexity Memoryless Randomized memoryless Inf. mem. (absorbing) deterministic [Con92] (absorbing) [EKVY07]; [CFK+13] Finite mem. in general [RRS15] [Con92] Condon, A. (1992). The complexity of stochastic games [RRS15] Randour, M., Raskin, J. F., & Sankur, O. Percentile queries in multi-dimensional Markov decision processes. CAV 2015. [EKVY07] Etessami, K., Kwiatkowska, M., Vardi, M. Y., & Yannakakis, M. Multi-objective model checking of Markov decision processes. TACAS 2007. [CFK + 13] Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., & Wiltsche, C. On stochastic games with multiple objectives. MFCS 2013.

  30. Context Single-dim SG Multi-dim MDP Multi-dim SG Computational NP ∩ coNP [Con92] PSPACE-complete [RRS15] Decidability open complexity Strategy complexity Memoryless Randomized memoryless Inf. mem. (absorbing) deterministic [Con92] (absorbing) [EKVY07]; [CFK+13] Finite mem. in general [RRS15] Over-approximation need not converge (multiple fixpoints)

Recommend


More recommend