Centre for Research in Statistical Methodology - PowerPoint PPT Presentation

Centre for Research in Statistical Methodology http://www2.warwick.ac.uk/fac/sci/statistics/crism/ • Conferences and workshops • Research Fellow positions • PhD studentships • Academic visitor programme.

Sequential Importance Sampling for Irreducible Multivariate Diffusions or Importance sampling for mutually singular measures Gareth Roberts University of Warwick MCQMC, February 2012 Part of ongoing work with Paul Fearnhead, Krys Latuszynski, Omiros Papaspiliopoulos, and Giorgos Sermaidis

Diffusions A d -dimensional diffusion is a continuous-time strong Markov process with continuous sample paths. We can define a diffusion as the solution of the Stochastic Differential Equation (SDE): d X t = µ ( X t )d t + σ ( X t )d B t . where B denotes d -dimensional Brownian motion, σ is a d × d matrix and µ is a d -vector. Often understood intuitively and constructively via its dynamics over small time intervals. Approximately for small h : x t + hµ ( x t ) + h 1 / 2 σ ( x t ) Z X t + h | X t = x t ∼ where Z is a d -dimensional standard normal random variable.

Transition Densities We will denote the transition density of the diffusion by p ( y | x, h ) dy = p ( X t + h ∈ dy | X t = x ) . It satisfies Kolmogorov’s forward equation: ∂ ∂tp ( x | y, t ) = K x p ( x | y, t ) , for some forward-operator K x which acts on x . Generally the transition density is intractable with the usual exceptions: constant or linear drifts and a few others ...

The Exact Algorithm Generally simulation and inference for diffusions is performed by approximating the diffusions by a discrete-time Markov process. However, work by Beskos, Papaspiliopoulos and Roberts demonstrate how to simulate from a class of diffusion models where: • The volatility can be transformed to be constant via the Lamperti transform: ie we can find a 1-1 function η satisfying the matrix valued differential equation ∇ ησ = I d • The drift of the transformed diffusion is the gradient of a potential: µ ( x ) = ∇ A ( x ). This can be applied to almost all 1- d diffusions for which CMG theorem holds, but only certain classes of d -dimensional ones.

Current Approaches: The Exact Algorithm The exact Algorithm is a Rejection Sampler based on proposing paths from a driftless version of the diffusion (with same volatility). The acceptance probability for the path is (for σ ( x ) = I d ) proportional to: �� T � T µ ( X t )d X t − 1 � | µ ( X t ) | 2 d t exp 2 0 0 � T � A ( X T ) − A ( X 0 ) − 1 � | µ ( X t ) | 2 + ∇ µ ( X t ) � � = exp d t . 2 0 Whilst this cannot be evaluated, events with this probability can be simulated.

The condition µ ( x ) = ∇ A ( x ) is used to remove the stochastic integral. It ensures that Girsanov’s formula is unbounded on for bounded sample paths. The condition σ ( x ) is constant is so that we can simulate from the driftless diffusion. • Importance sampling seems doomed if we cannot sample from an absolutely continuous distribution. Consider two diffusions with different diffusion coefficients, σ 1 and σ 2 , then their laws as NOT mutually absolutely continuous ... even though their finite-dimensional distributions typically are.

Avoiding time-discretisation Errors: Why? Beskos, Papaspiliopoulos, Roberts and Fearnhead (2006) extend the rejection sampler to an importance sampler, and show how this can used to perform inference for diffusions which avoids time-discretisation approximations. Why may these methods be useful? • Error in estimates are purely Monte Carlo. Thus it is easier to quantify the error. • Time-discretisation may tend to use substantially finer discretisations than are necessary: possible computational gains? • Want methods which are robust as h → 0 • Error is O ( C − 1 / 2 ), where C is CPU cost. Alternative approaches have errors that can be e.g. O ( C − 1 / 3 ) or worse (though see multigrid work by Giles).

Our Aim Our aim was to try and extend the ability to perform inference without time- discretisation approximations to a wider class of diffusions. The key is to be able to unbiasedly estimate expectations, such as E( f ( X t )) or E( f ( X t 1 , . . . , X t m )). The approach we have developed can be applied to general continuous-time Markov processes, and is a continuous-time version of sequential importance sampling. We construct a signed measure-valued stochastic processes (which is non-Markov) { ξ t , t ≥ 0 } with E( ξ t ( f )) = E( f ( X t ))

Importance Sampling Importance Sampling ( IS ) is a Monte Carlo integration technique. Consider the integral � h ( x ) � I = f ( x ) p ( x )d x = q ( x ) q ( x )d x, where p ( x ) and q ( x ) are densities, f ( x ) is arbitrary and p ( x ) > 0 ⇒ q ( x ) > 0. Here we are setting h ( x ) = f ( x ) p ( x ). We can view this as an expectation with respect to q ( x ). Thus 1. Sample x i , i = 1 , . . . , N , iid from q ( x ); 2. Estimate the integral by the unbiased, consistent estimator: N I = 1 h ( x i ) ˆ � q ( x i ) . N i =1

Sequential Importance Sampling (SIS) As this gives an estimate of the expectation of f ( X ) for arbitrary functions f , we can think of the sample from q ( x ), and the corresponding weights as giving an approximation to the distribution defined by p ( x ). This idea can be extended to Markov processes: n � p ( x 1 , . . . , x n ) = p ( x 1 ) p ( x i | x i − 1 ) . i =2 With a proposal process defined by q ( x 1 ) and q ( x i | x i − 1 ).

Sequential Importance Sampling (SIS) To obtain one weighted sample: 1. Simulate X ( i ) w ( i ) from q ( x 1 ); assign a weight ˜ 1 = p ( x 1 ) /q ( x 1 ). 1 2. For t = 2 , . . . , n ; simulate X ( i ) t | x ( i ) t − 1 from q ( x t | x ( i ) t − 1 ), and set p ( x ( i ) t | x ( i ) t − 1 ) w ( i ) w ( i ) ˜ = ˜ . t t − 1 q ( x ( i ) t | x ( i ) t − 1 )

New Approach: CIS We now derive a continuous-time importance sampling (CIS) procedure for unbiased inference for general continuous-time Markov models. We will describe the CIS algorithm for generating a single realisation. So at any time t we will have x t and w t , realisations of random variables X t , W t such that E p ( f ( X t )) = E q ( f ( X t ) W t ) . The former expectation is wrt to the target diffusion, the latter wrt to CIS procedure. We will use a proposal process with tractable transition density q ( x | y, t ) (and forward-operator K (1) x ).

A discrete-time SIS procedure First consider a discrete-time SIS method aimed at inference at times h, 2 h, 3 h, . . . , . (0) Fix x 0 ; set w 0 = 1, and i = 1. (1) Simulate X ih = x ih from q ( x ih | x ( i − 1) h ). (2) Set p ( x ih | x ( i − 1) h , h ) w i = w i − 1 q ( x ih | x ( i − 1) h , h ) (3) Let i = i + 1 and goto (1).

Problems: cannot calculate weights, and often the efficiency degenerates as h → 0 for fixed T . As h → 0, where q and p are discetisations of absolutely continuous diffusions, the limit is given by Girsanov’s formula. We want it to work in the case where q and p are mutually singular also!

Random weight SIS It is valid to replace the weight in the SIS procedure by a random variable whose expectation is equal to the weight. A simple way to do this here is to define � 1 � p ( y | x, h ) r ( y, x, h ) = 1 + q ( y | x, h ) − 1 λh, and introduce a Bernoulli random variable U i , with success probability λh . Then p ( y | x, h ) q ( y | x, h ) = E { (1 − U i ) · 1 + U i r ( y, x, h ) } .

Random weight SIS Now we can have a random weight SIS algorithm: (0) Fix x 0 ; set w 0 = 1, and i = 1. (1) Simulate X ih = x ih from q ( x ih | x ( i − 1) h ). (2) Simulate U i . If U i = 1 then set w i = w i − 1 r ( x ih , x ( i − 1) h , h ), otherwise w i = w i − 1 . (3) Let i = i + 1 and goto (1). This is a less efficient algorithm than the previous one, but it enables us to now use two tricks: retrospective sampling and Rao-Blackwelisation.

Retrospective Sampling We only need to update the weights at time-points where U i = 1. At these points we need to simulate X ih , X ( i − 1) h to calculate the new weights. If j is the most recent time when U j = 1, then the distribution of X ih is given by q ( x ih | x jh , ( i − j ) h ) (assuming time-homogeneity for simplicity). Given x jh and x ih the conditional distribution of X ( i − 1) h is q ( x ( i − 1) h | x jh , x ih ) = q ( x ( i − 1) h | x jh , ( i − j − 1) h ) q ( x ih | x ( i − 1) h , h ) . q ( x ih | x jh , ( i − j ) h )

New SIS algorithm Using these ideas we get: (0) Fix x 0 ; set w 0 = 1, j = 0 and i = 1. (1) Simulate U i ; if U i = 0 goto (3). (2) [ U i = 1] Simulate X ih from q ( x ih | x jh , ( i − j ) h ) and X ( i − 1) h from q ( x ( i − 1) h | x jh , x ih ). Set w i = w j r ( x ih , x ( i − 1) h , h ) . (3) Let i = i + 1 and goto (1). If we stop the SIS at a time point t , then X t can be drawn from q ( x t | x jh , t − jh ); and the weight is w j .

Example 1.3 X_t 1.1 0.9 0 20 40 60 80 100 W_t 1.0 0.0 0 20 40 60 80 100

Rao-Blackwellisation At time ih , the incremental weight depends on x ih and x ( i − 1) h . Rather than simulating both we simulate x ih , and use an expected incremental weight � � ρ h ( x ih , x jh , ( j − i ) h ) = E r ( x ih , X ( i − 1) h , h ) | x jh , with expectation with respect to the conditional distribution of X ( i − 1) h given x jh , x ih under the proposal: � � � E r ( x ih , X ( i − 1) h , h ) | x jh = r ( x ih , x ( i − 1) h , h ) q ( x ( i − 1) h | x jh , x ih )d x ( i − 1) h .

Centre for Research in Statistical Methodology - PowerPoint PPT Presentation

Centre for Research in Statistical Methodology http://www2.warwick.ac.uk/fac/sci/statistics/crism/ Conferences and workshops Research Fellow positions PhD studentships Academic visitor programme. Sequential Importance Sampling for

Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green Action

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

Scaling Methodology Scaling Methodology Dan Smith Director HW Engineering dsmith@nvidia.com

Centre for Research in Statistical Methodology http://go.warwick.ac.uk/crism Conferences and

Statistical graphics with Statistical graphics with ggplot2 ggplot2 Programming for Statistical

Methodology Methodology 3 age groups 2 7 years 8-12 years 13-17 years

SoC SoC Design Design Lecture 2: Design Methodology and Lecture Lecture 2: Design Methodology

STAT 401A - Statistical Methods for Research Workers Statistical Inference Jarad Niemi (Dr. J)

Modeling Process Quality Statistical methodology plays an important role in quality control and

Hatice Melek ATE TA CI Statistical Economic and Social Research and Training Centre

Statistical, Economic and Social Research and Training Centre for Islamic Countries SESRI RIC

Statistical presentation Statistical presentation Statistical tabulations by age, sex and 3 digit

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

13 Jan, 2011 Statistical Literacy: Confounding UTSA Confounding 2011 1 2011 2 Statistical

Statistics 435/535 Statistical Methods for Quality and Productivity Improvement / Statistical

Le 10 Sport / LEquipe Bruno Jullien Toulouse School of Economics ACE 6 November 2014 Market

FST-01SZ (Flying Stone Tiny 01 revision ShenZhen) free hardware design for Gnuk Token Niibe

Simulating Codata Types using Coalgebras Anton Setzer Swansea University, Swansea UK Types 2018,

BUILDING A LEARNING CULTURE Ali Shipway What, So what, Now what? Next Steps Outcomes

Calculator Homework 1 Homework Goal Use Storyboard to create user interface of an app.

David Lorge Parnas When the first papers on information Hiding were published (1970-72) ,

Copyleft: Data and Theory L U I S V I L L A D A T E L I N E : B R U S S E L S Copyleft: Sorry,

Copyleft for the Rest of Us Linuxwochen Linz 2016 About me DevOps Engineer Security

Centre for Research in Statistical Methodology - PowerPoint PPT Presentation

Centre for Research in Statistical Methodology http://www2.warwick.ac.uk/fac/sci/statistics/crism/ Conferences and workshops Research Fellow positions PhD studentships Academic visitor programme. Sequential Importance Sampling for

Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green Action

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

Scaling Methodology Scaling Methodology Dan Smith Director HW Engineering dsmith@nvidia.com

Centre for Research in Statistical Methodology http://go.warwick.ac.uk/crism Conferences and

Statistical graphics with Statistical graphics with ggplot2 ggplot2 Programming for Statistical

Methodology Methodology 3 age groups 2 7 years 8-12 years 13-17 years

SoC SoC Design Design Lecture 2: Design Methodology and Lecture Lecture 2: Design Methodology

STAT 401A - Statistical Methods for Research Workers Statistical Inference Jarad Niemi (Dr. J)

Modeling Process Quality Statistical methodology plays an important role in quality control and

Hatice Melek ATE TA CI Statistical Economic and Social Research and Training Centre

Statistical, Economic and Social Research and Training Centre for Islamic Countries SESRI RIC

Statistical presentation Statistical presentation Statistical tabulations by age, sex and 3 digit

EFTA Statistical Cooperation &amp; the European Statistical System EEA Seminar EEA Seminar

EFTA Statistical Cooperation &amp; the European Statistical System EEA Seminar EEA Seminar

13 Jan, 2011 Statistical Literacy: Confounding UTSA Confounding 2011 1 2011 2 Statistical

Statistics 435/535 Statistical Methods for Quality and Productivity Improvement / Statistical

Le 10 Sport / LEquipe Bruno Jullien Toulouse School of Economics ACE 6 November 2014 Market

FST-01SZ (Flying Stone Tiny 01 revision ShenZhen) free hardware design for Gnuk Token Niibe

Simulating Codata Types using Coalgebras Anton Setzer Swansea University, Swansea UK Types 2018,

BUILDING A LEARNING CULTURE Ali Shipway What, So what, Now what? Next Steps Outcomes

Calculator Homework 1 Homework Goal Use Storyboard to create user interface of an app.

David Lorge Parnas When the first papers on information Hiding were published (1970-72) ,

Copyleft: Data and Theory L U I S V I L L A D A T E L I N E : B R U S S E L S Copyleft: Sorry,

Copyleft for the Rest of Us Linuxwochen Linz 2016 About me DevOps Engineer Security

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar