Continuous-time Markov Decisions based on Partial Exploration - PowerPoint PPT Presentation

Continuous-time Markov Decisions based on Partial Exploration Pranav Ashok Technical University of Munich Highlights 2018, Berlin Joint work with Yuliya Butkova 1 , Holger Hermanns 1 and Jan Kretinsky 2 1 Saarland University, Germany 2 Technical University of Munich, Germany 1

Motivation By Gareth Jones [CC BY-SA 3.0 (https://creativecommons.org/licenses/by-sa/3.0), from Wikimedia Commons 2

Motivation - n students mail @ λ 1 , λ 2 ,..., λ n /day - you pick a student’s mail to process it - if processed: remove from queue - else : put it back into queue By Gareth Jones [CC BY-SA 3.0 (https://creativecommons.org/licenses/by-sa/3.0), from Wikimedia Commons 3

Motivation - n students mail @ λ 1 , λ 2 ,..., λ n /day - you pick a student’s mail to process it - if processed: remove from queue - else : put it back into queue Q1: What is the max. prob. (over all strategies) that all queues are empty at the end of the week? 4 By Gareth Jones [CC BY-SA 3.0 (https://creativecommons.org/licenses/by-sa/3.0), from Wikimedia Commons

Motivation - n students mail @ λ 1 , λ 2 ,..., λ n /day - you pick a student’s mail to process it - if processed: remove from queue - else : put it back into queue Q1: What is the max. prob. (over all strategies) that all queues are empty at the end of the week? Q2: What is the min. prob. that student X quits your group after a semester? 5 By Gareth Jones [CC BY-SA 3.0 (https://creativecommons.org/licenses/by-sa/3.0), from Wikimedia Commons

Continuous-time Markov Decision Process (CTMDP) Time-bounded Reachability Maximal probability (over all strategies) of reaching some goal state within T time units max � P � ( ♢ ≤T G) 6

Challenge Existing reachability algorithms sometimes perform extremely bad in practice even though in PTIME Can we improve them? 7

Contributions Framework for time-bounded reachability (TBR) analysis ➔ Use simulations to identify important parts of state-space ➔ Instantiate with standard algorithms to show speed up ➔ 8

Key Idea Partial Exploration Suffices Not necessary to explore all states to get � -optimal solution 9

What can we do with a partial model? 10

What can we do with a partial model? lo��-bo�� m��e� up��-bo�� m��e� 13

The Framework Compute Expand partial Use any solver Initialize lower/upper model to get L and U models U - L > � 14

Partial model through simulations using � sim 15

Experiments I Explored States Size of partial models by π sim Benchmark States % 1,479k 105 0.01 597k 296 0.05 1,000k 559 0.06 7,562k 23309 0.31 2k 2537 93.86 119k - - 16

Experiments I Explored States Size of partial models by π sim Benchmark States % 1,479k 105 0.01 597k 296 0.05 1,000k 559 0.06 7,562k 23309 0.31 2k 2537 93.86 119k - - 17

Experiments II Runtimes TO → > 1800s (30 min) 1,000k 71 1 4 1 1,479k -TO- 2 -TO- 2 597k 251 10 114 15 7,562k 507 -TO- 171 105 18k 6 99 2 -TO- 119k 1475 -TO- 826 -TO- 18

Experiments II Runtimes TO → > 1800s (30 min) 1,000k 71 1 4 1 1,479k -TO- 2 -TO- 2 597k 251 10 114 15 7,562k 507 -TO- 171 105 18k 6 99 2 -TO- 119k 1475 -TO- 826 -TO- 19

Conclusion CTMDP TBR analysis framework based on partial exploration ➔ Partial model through simulations ➔ Usable with any TBR solver* ➔ Good on models with many unimportant/improbable states ➔ *conditions apply, based on simulation strategy 20

Continuous-time Markov Decision Processes (CTMDP) C = (S, A, R , Goal) ● S: finite set of states ; A: finite set of non-det choices ● s Each choice → multiple transitions ● a Each transition has a rate λ = R (s, a, s’) ● Time t at which transition fired ← exp. dist ( λ ) ● λ Next state chosen by a race between transitions ● s’ 22

Continuous-time Markov Decisions based on Partial Exploration - PowerPoint PPT Presentation

Continuous-time Markov Decisions based on Partial Exploration Pranav Ashok Technical University of Munich Highlights 2018, Berlin Joint work with Yuliya Butkova 1 , Holger Hermanns 1 and Jan Kretinsky 2 1 Saarland University, Germany 2 Technical

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Overview Verifying Continuous-Time Markov Chains Negative exponential distributions 1 Lecture

Overview Motivation Verifying Continuous-Time Markov Chains 1 Lecture 1+2: Discrete-Time Markov

Imprecise Markov chains From basic theory to applications II prof. Jasper De Bock Imprecise

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Discrete Time Markov Chains Discrete-Time Markov Chains Books - Introduction to Stochastic

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Continuous Time Markov Chain Birth and Death Process IE 502: Probabilistic Models Jayendran

Rate-Based Stochastic Fusion Calculus and Angelo Troina Continuous Time Markov Chains Fusion

Overview Partial Constituent Fronting in German The phenomenon: Partial constituent fronting

Stochastic Processes Markov Processes Hamid R. Rabiee 1 Overview o Markov Property o Markov

Discrete time Markov chains Today: Discrete Time Markov Chains, Limiting Discrete time Markov

Chapter 5. Continuous-Time Markov Chains Prof. Shun-Ren Yang Department of Computer Science,

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains

All Units...Pick up your Charter Renewal Packet & Schedule your Charter Turn-in (about 15

APPROACHES TO MANAGEMENT Dr.M. Thenmozhi Professor Department of Management Studies Indian

Resul sults ts Base sed d Accounta countability bility Are we really making a difference?

Decision making A decision is an intention to use/not to use a factor at a given level:

Making Complex Decisions Paolo Turrini Department of Computing, Imperial College London

Collective Decision Making with Incomplete Individual Opinions Zoi Terzopoulou Institute for

Mathematical Fuzzy Logic in Reasoning and Decision Making under Uncertainty Hykel Hosni

Supporting Robust Decisions with Classification and Data-Mining Algorithms Benjamin Bryant

Sambuz

Useful Links

Newsletter

Mail Us

Continuous-time Markov Decisions based on Partial Exploration - PowerPoint PPT Presentation

Continuous-time Markov Decisions based on Partial Exploration Pranav Ashok Technical University of Munich Highlights 2018, Berlin Joint work with Yuliya Butkova 1 , Holger Hermanns 1 and Jan Kretinsky 2 1 Saarland University, Germany 2 Technical

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Overview Verifying Continuous-Time Markov Chains Negative exponential distributions 1 Lecture

Overview Motivation Verifying Continuous-Time Markov Chains 1 Lecture 1+2: Discrete-Time Markov

Imprecise Markov chains From basic theory to applications II prof. Jasper De Bock Imprecise

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Discrete Time Markov Chains Discrete-Time Markov Chains Books - Introduction to Stochastic

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Continuous Time Markov Chain Birth and Death Process IE 502: Probabilistic Models Jayendran

Rate-Based Stochastic Fusion Calculus and Angelo Troina Continuous Time Markov Chains Fusion

Overview Partial Constituent Fronting in German The phenomenon: Partial constituent fronting

Stochastic Processes Markov Processes Hamid R. Rabiee 1 Overview o Markov Property o Markov

Discrete time Markov chains Today: Discrete Time Markov Chains, Limiting Discrete time Markov

Chapter 5. Continuous-Time Markov Chains Prof. Shun-Ren Yang Department of Computer Science,

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains

All Units...Pick up your Charter Renewal Packet &amp; Schedule your Charter Turn-in (about 15

APPROACHES TO MANAGEMENT Dr.M. Thenmozhi Professor Department of Management Studies Indian

Resul sults ts Base sed d Accounta countability bility Are we really making a difference?

Decision making A decision is an intention to use/not to use a factor at a given level:

Making Complex Decisions Paolo Turrini Department of Computing, Imperial College London

Collective Decision Making with Incomplete Individual Opinions Zoi Terzopoulou Institute for

Mathematical Fuzzy Logic in Reasoning and Decision Making under Uncertainty Hykel Hosni

Supporting Robust Decisions with Classification and Data-Mining Algorithms Benjamin Bryant

Sambuz

Useful Links

Newsletter

Mail Us

All Units...Pick up your Charter Renewal Packet & Schedule your Charter Turn-in (about 15