PAC Statistical Model Checking for Markov Decision Processes and - PowerPoint PPT Presentation

PAC Statistical Model Checking for Markov Decision Processes and Stochastic Games 1 Pranav Ashok, Jan Kˇ ret´ ınsk´ y, Maximilian Weininger Technical University of Munich Highlights of Logic, Automata and Games Warsaw, Poland September 19, 2019 1 based on paper presented at CAV 2019

Stochastic Game Reachability 0 . 2 a b c 0 . 8 Objective player: maximize P(F ) player: minimize P(F ) Reachability in limited information stochastic games 2/6

This work: Black-box (limited information setting) Unknown successor distribution Problem statement Compute V ( s ) = max σ min τ P σ,τ ( F ) = min τ max σ P σ,τ ( F ) s s with guarantees Reachability in limited information stochastic games 3/6

Background ◮ Seminal paper on Stochastic Games [ Condon 90 ] quadratic programming, strategy iteration, value iteration Reachability in limited information stochastic games 4/6

Background ◮ Seminal paper on Stochastic Games [ Condon 90 ] quadratic programming, strategy iteration, value iteration ◮ Algos not directly applicable on general SG ◮ First practical algorithm for general SG giving guarantees [ Kelmendi et. al. 2018 ] Reachability in limited information stochastic games 4/6

Background ◮ Seminal paper on Stochastic Games [ Condon 90 ] quadratic programming, strategy iteration, value iteration ◮ Algos not directly applicable on general SG ◮ First practical algorithm for general SG giving guarantees [ Kelmendi et. al. 2018 ] ◮ This work: first algorithm for limited information SG Reachability in limited information stochastic games 4/6

The Algorithm Similar to Kelmendi et. al. 2018 while U − L is large 1. Simulate and estimate 2. Back-propagate Reachability in limited information stochastic games 5/6

The Algorithm Similar to Kelmendi et. al. 2018 while U − L is large 1. Simulate and estimate 2. Back-propagate The how ◮ Simulation finds important parts of state space Reachability in limited information stochastic games 5/6

The Algorithm Similar to Kelmendi et. al. 2018 while U − L is large 1. Simulate and estimate 2. Back-propagate The how ◮ Simulation finds important parts of state space ◮ Simulation computes Hoeffding confidence intervals ball around estimate such that real prob. falls in the ball with high confidence Reachability in limited information stochastic games 5/6

The Algorithm Similar to Kelmendi et. al. 2018 while U − L is large 1. Simulate and estimate 2. Back-propagate The how ◮ Simulation finds important parts of state space ◮ Simulation computes Hoeffding confidence intervals ball around estimate such that real prob. falls in the ball with high confidence ◮ Information conservatively back-propagated Reachability in limited information stochastic games 5/6

The Algorithm Similar to Kelmendi et. al. 2018 while U − L is large 1. Simulate and estimate 2. Back-propagate The how ◮ Simulation finds important parts of state space ◮ Simulation computes Hoeffding confidence intervals ball around estimate such that real prob. falls in the ball with high confidence ◮ Information conservatively back-propagated ◮ Other tricks to ensure fixpoint convergence Reachability in limited information stochastic games 5/6

Conclusion ◮ Algorithm for reachability in limited information MDP/SG result ∈ [0 . 6 − ǫ, 0 . 6 + ǫ ] with prob of going wrong 10 − 8 ◮ Implemented and benchmarked in PRISM Model Checker ◮ First algorithm to do so for SG ◮ First practical algorithm for MDPs Reachability in limited information stochastic games 6/6

PAC Statistical Model Checking for Markov Decision Processes and - PowerPoint PPT Presentation

PAC Statistical Model Checking for Markov Decision Processes and Stochastic Games 1 Pranav Ashok, Jan K ret nsk y, Maximilian Weininger Technical University of Munich Highlights of Logic, Automata and Games Warsaw, Poland September

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Guiding Financial Controls and Practices for PACs and PAC Treasurers PAC Treasurers Workshop

Model Repair for Markov Decision Model Repair for Markov Decision Model Repair for Markov

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

NAPSLO PAC Contributions How contributing to the NAPSLO PAC will benefit you, your company and the

WELCOME June 2011 PAC Presentation Opening Remarks Introductions June 2011 PAC

AAOS Orthopaedic PAC The Orthopaedic PAC is the only national political action committee

LArIAT Fermilab PAC Meeting November 11, 2016 Jen Raaf PAC Charge Fermilab PAC Meeting, J.

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

L ECTURE 15: Regrade requests: L EARNING T HEORY Send us email, and come and see me next

SecLabel: Enhancing RISC-V Platform Security with Labelled Architecture Zhenyu Ning 1,2 , Yinqian

Protocol for Carrying Authentication for Network Access (PANA) (draft-ietf-pana-pana-00.txt)

Albany Medical Center Hospital PPS PAC Meeting July 18, 2016 Meeting Attendance Please email

Meeting January 10, 2018 2 Agenda Networking 1. CCB Update: Where Weve Been and Where

Computational Learning Theory Based on Machine Learning, T. Mitchell, McGRAW Hill, 1997, ch.

La th eorie PAC-Bayes en apprentissage supervis e Pr esentation au LRI de luniversit

Money 101 2019 PAC Conference March 30, 2019 Background Owner of Green Mountain Financial

Sambuz

Useful Links

Newsletter

Mail Us