Statistical Model Checking and Rare Events Paolo Zuliani Joint - PowerPoint PPT Presentation

Statistical Model Checking and Rare Events Paolo Zuliani Joint work with Edmund M. Clarke Computer Science Department, CMU

Probabilistic Verification  Verification of stochastic system models via statistical model checking  Temporal logic specification:  “the amount of p53 exceeds 10 5 within 20 minutes”  If Ф = “p53 exceeds 10 5 within 20 minutes” Probability ( Ф ) = ?

Equivalently  A biased coin (Bernoulli random variable):  Prob (Heads) = p Prob (Tails) = 1-p  p is unknown  Question: What is p ?  A solution: flip the coin a number of times, collect the outcomes, and use statistical estimation

Statistical Model Checking Key idea (Haakan Younes, 2001)  System behavior w.r.t. property Ф can be modeled by a Bernoulli random variable of parameter p :  System satisfies Ф with (unknown) probability p  Question: What is p?  Draw a sample of system simulations and use:  Statistical estimation : returns “ p in interval (a,b)” with high probability

Statistical Model Checking  Statistical Model Checking is a Monte Carlo method  Problems arise when p is very small (rare event)  The number of simulations (coin flips) needed to estimate p accurately grows too large  Need to deal with this …

Rare events  Estimate Prob(X  t ) = p t , when p t is small (say 10 -9 )  

Rare events  Estimate Prob(X  t ) = p t , when p t is small (say 10 -9 )  Standard (Crude) Monte Carlo: generate K i.i.d. samples of X; return the estimator e K e K =  Prob ( e K  p t ) = 1 for K   (strong law LN)

Rare events  E[ e K ] = p t (  p 1 p )  Var[ e K ] = t t K    

Rare events  E[ e K ] = p t (  p 1 p )  Var[ e K ] = t t K  By the Central Limit Theorem (CLT), the distribution of e K converges to a normal distribution with:  mean p t (  p 1 p )  variance t t K  var[ e ] p ( 1 p )  t t  Relative Error (RE) = K E [ e ] p K K t

Rare events (  p 1 p )  RE = t t p K t  Fix K , then RE is unbounded as p t  0  More accuracy  more samples  Want confidence interval of relative accuracy δ and coverage probability c, i.e., estimate e K must satisfy: Prob(| e K – p t | < δ· p t ) ≥ c  How many samples do we need?

Rare events  From the CLT, a 99% (approximate) confidence interval of relative accuracy δ needs about  1 p t K ≈ samples  2 p t Thus, Prob(| e K – p t | < δ p t ) ≈ 0.99   

Rare events  From the CLT, a 99% (approximate) confidence interval of relative accuracy δ needs about  1 p t K ≈ samples  2 p t Thus, Prob(| e K – p t | < δ p t ) ≈ 0.99  Examples:  p t = 10 -9 and δ = 10 -2 ( ie , 1% relative accuracy) we need about 10 13 samples!!  Bayesian estimation requires about 6x10 6 samples with p t =10 -4 and δ = 10 -1

A solution  Importance Sampling (1940s)  A variance-reduction technique  Can result in dramatic reduction in sample size

Importance Sampling  The fundamental Importance Sampling identity f is the density of X

Importance Sampling  The fundamental Importance Sampling identity likelihood ratio f is the density of X

Importance Sampling  Estimate p t = E[ X  t ] = Prob( X  t)  A sample X 1 ,… X K iid as f  The crude Monte Carlo estimator is

Importance Sampling  Estimate p t = E[ X  t ] = Prob( X  t)  A sample X 1 ,… X K iid as f  The crude Monte Carlo estimator is sampling from f

Importance Sampling  Define a biasing density f *  Compute the IS estimator f ( x )  where is the likelihood ratio W ( x ) f * x ( )

Importance Sampling  Define a biasing density f *  Compute the IS estimator sampling from f * ! f ( x )  where is the likelihood ratio W ( x ) f * x ( )

Importance Sampling  Need to choose a “good” biasing density (low variance)  I ( x t ) f ( x )   Optimal density: f ( x ) * p t K K 1 1 f ( X )       e I ( X t ) W ( X ) I ( X t ) K K K f ( X )   i 1 i 1 * K 1 f ( X )     p I ( X t ) p  t t K I ( X t ) f ( X )  i 1  Zero variance! (But …)

Importance Sampling  Need to choose a “good” biasing density (low variance)  I ( x t ) f ( x )   Optimal density: f ( x ) * p unknown t K K 1 1 f ( X )       e I ( X t ) W ( X ) I ( X t ) K K K f ( X )   i 1 i 1 * K 1 f ( X )     p I ( X t ) p  t t K I ( X t ) f ( X )  i 1  Zero variance! (But …)

Cross-Entropy Method (R. Rubinstein)  Suppose the density of X in a family of densities { f ( · ; v )}  the “nominal” f is f ( x ; u )  Key idea : choose a parameter v such that the distance between f * and f ( · ; v ) is minimal  The Kullback-Leibler divergence (cross-entropy) is a measure of “distance” between two densities  First used for rare event simulation by Rubinstein (1997)

Cross-Entropy Method  The KL divergence (cross-entropy) of densities g , h is   g ( X )      D ( g , h ) E  ln  g ( x ) ln g ( x ) dx g ( x ) ln h ( x ) dx g   h ( X )  D ( g , h )  0 (= 0 IFF g = h)  D ( g , h ) ≠ D ( h , g )

Cross-Entropy Method  The KL divergence (cross-entropy) of densities g , h is   g ( X )      D ( g , h ) E  ln  g ( x ) ln g ( x ) dx g ( x ) ln h ( x ) dx g   h ( X )  D ( g , h )  0 (= 0 IFF g = h)  D ( g , h ) ≠ D ( h , g ) family { f ( · ; v )}

Cross-Entropy Method  The KL divergence (cross-entropy) of densities g , h is   g ( X )      D ( g , h ) E  ln  g ( x ) ln g ( x ) dx g ( x ) ln h ( x ) dx g   h ( X )  D ( g , h )  0 (= 0 IFF g = h)  D ( g , h ) ≠ D ( h , g ) family { f ( · ; v )} optimal density f *

Cross-Entropy Method  The KL divergence (cross-entropy) of densities g , h is   g ( X )      D ( g , h ) E  ln  g ( x ) ln g ( x ) dx g ( x ) ln h ( x ) dx g   h ( X )  D ( g , h )  0 (= 0 IFF g = h)  D ( g , h ) ≠ D ( h , g ) min D ( f * , f ( · ; v )) family { f ( · ; v )} optimal density f *

Cross-Entropy Method  The Cross-Entropy Method has two basic steps  

Cross-Entropy Method  The Cross-Entropy Method has two basic steps arg min D ( f ( · ), f ( · ; v )) 1. find v * = * v  

Cross-Entropy Method  The Cross-Entropy Method has two basic steps arg min D ( f ( · ), f ( · ; v )) 1. find v * = * v 2. run importance sampling with biasing density f ( · ; v * )  

Cross-Entropy Method  The Cross-Entropy Method has two basic steps arg min D ( f ( · ), f ( · ; v )) 1. find v * = * v 2. run importance sampling with biasing density f ( · ; v * )  Step 2 is “easy”  Step 1 is not so easy

Cross-Entropy Method  Step 1 : v * =

Cross-Entropy Method  Step 1 :   f ( X )     v * = * arg min E  ln  arg min f ( x ) ln f ( x ) dx f ( x ) ln f ( x ; v ) dx f * * *   * f ( X ; v ) v v

Cross-Entropy Method  Step 1 : always  0   f ( X )     v * = * arg min E  ln  arg min f ( x ) ln f ( x ) dx f ( x ) ln f ( x ; v ) dx f * * *   * f ( X ; v ) v v

Cross-Entropy Method  Step 1 : always  0   f ( X )     v * = * arg min E  ln  arg min f ( x ) ln f ( x ) dx f ( x ) ln f ( x ; v ) dx f * * *   * f ( X ; v ) v v   arg max f ( x ) ln f ( x ; v ) dx * v

Cross-Entropy Method  Step 1 : always  0   f ( X )     v * = * arg min E  ln  arg min f ( x ) ln f ( x ) dx f ( x ) ln f ( x ; v ) dx f * * *   * f ( X ; v ) v v  I ( x t ) f ( x ; u )     arg max f ( x ) ln f ( x ; v ) dx arg max ln f ( x ; v ) dx * p v v t

Cross-Entropy Method  Step 1 : always  0   f ( X )     v * = * arg min E  ln  arg min f ( x ) ln f ( x ) dx f ( x ) ln f ( x ; v ) dx f * * *   * f ( X ; v ) v v  I ( x t ) f ( x ; u )     arg max f ( x ) ln f ( x ; v ) dx arg max ln f ( x ; v ) dx * p v v t    arg max I ( x t ) f ( x ; u ) ln f ( x ; v ) dx v

Statistical Model Checking and Rare Events Paolo Zuliani Joint - PowerPoint PPT Presentation

Statistical Model Checking and Rare Events Paolo Zuliani Joint work with Edmund M. Clarke Computer Science Department, CMU Probabilistic Verification Verification of stochastic system models via statistical model checking Temporal

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

ISBT WORKING PARTY ON RARE DONORS Israel Have a national Rare Donor Program: Yes Number of

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

CME / CE Rare Disease Clinical Trials Module: Rare Pulmonology Clinical Trials Timothy Craig,

Rare Rare Rare-earth Rare-earth earth-based half earth-based half based half-Heusler based

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Hoare Logic and Model Checking Model Checking Lecture 11: Model checking for Computation Tree

Model checking perhaps the most important part of applied statistical modelling Simon Wood

Rare cancers What is the problem, and how big is it? Rare (orphan) diseases NIH Office for Rare

ISBT WORKING PARTY ON RARE DONORS Canada Canadian Blood Services Have a national Rare Donor

RAREDISEASEDAY.ORG 1 Rare Disease Day 2020 13 th edition on Saturday 29 February 2020

ISBT WORKING PARTY ON RARE DONORS SINGAPORE Have a national Rare Donor Program: YES Number of

CME / CE Rare Disease Clinical Trials Focus on Rare Hematologic Clinical Trials Morie A Gertz

CS70: Jean Walrand: Lecture 29. Confidence? Confidence? Confidence is essential is many

A Course in Applied Econometrics 1. Introduction Lecture 10 2. Example I: Missing Data 3.

Inferential Statistics Chapters 6 &7

M5S1 - Confidence Intervals Professor Jarad Niemi STAT 226 - Iowa State University October 9,

A graphic comparison of the Fieller and Delta intervals for ratios of parameter estimates. Joe

HiGrad: Statistical Inference for Stochastic Approximation and Online Learning Weijie Su

Quantifying Chance Part 1: Sampling Variability INFO-1301, Quantitative Reasoning 1 University

Lecture 4. Maximum Likelihood Estimation - confidence intervals. Igor Rychlik Chalmers

Statistical Model Checking and Rare Events Paolo Zuliani Joint - PowerPoint PPT Presentation

Statistical Model Checking and Rare Events Paolo Zuliani Joint work with Edmund M. Clarke Computer Science Department, CMU Probabilistic Verification Verification of stochastic system models via statistical model checking Temporal

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

ISBT WORKING PARTY ON RARE DONORS Israel Have a national Rare Donor Program: Yes Number of

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

CME / CE Rare Disease Clinical Trials Module: Rare Pulmonology Clinical Trials Timothy Craig,

Rare Rare Rare-earth Rare-earth earth-based half earth-based half based half-Heusler based

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Hoare Logic and Model Checking Model Checking Lecture 11: Model checking for Computation Tree

Model checking perhaps the most important part of applied statistical modelling Simon Wood

Rare cancers What is the problem, and how big is it? Rare (orphan) diseases NIH Office for Rare

ISBT WORKING PARTY ON RARE DONORS Canada Canadian Blood Services Have a national Rare Donor

RAREDISEASEDAY.ORG 1 Rare Disease Day 2020 13 th edition on Saturday 29 February 2020

ISBT WORKING PARTY ON RARE DONORS SINGAPORE Have a national Rare Donor Program: YES Number of

CME / CE Rare Disease Clinical Trials Focus on Rare Hematologic Clinical Trials Morie A Gertz

CS70: Jean Walrand: Lecture 29. Confidence? Confidence? Confidence is essential is many

A Course in Applied Econometrics 1. Introduction Lecture 10 2. Example I: Missing Data 3.

Inferential Statistics Chapters 6 &amp;7

M5S1 - Confidence Intervals Professor Jarad Niemi STAT 226 - Iowa State University October 9,

A graphic comparison of the Fieller and Delta intervals for ratios of parameter estimates. Joe

HiGrad: Statistical Inference for Stochastic Approximation and Online Learning Weijie Su

Quantifying Chance Part 1: Sampling Variability INFO-1301, Quantitative Reasoning 1 University

Lecture 4. Maximum Likelihood Estimation - confidence intervals. Igor Rychlik Chalmers

Inferential Statistics Chapters 6 &7