sequential monte carlo methods for state and and
play

Sequential Monte Carlo Methods for State and and Parameter - PDF document

Sequential Monte Carlo Methods for State and and Parameter Estimation (with application to ocean biogeochemistry) Michael Dowd Dept of Mathematics and Statistics Dalhousie University Halifax, N.S., Canada BIRS Data Assimilation Workshop,


  1. Sequential Monte Carlo Methods for State and and Parameter Estimation (with application to ocean biogeochemistry) Michael Dowd Dept of Mathematics and Statistics Dalhousie University Halifax, N.S., Canada BIRS Data Assimilation Workshop, February 3-8, 2008 Outline 1. Motivation: observations and dynamic models for ocean biogeochemistry 2. The state space model for nonlinear and nonGaussian systems: filtering and smoothing 3. Sequential Monte Carlo approaches: resampling/bootstrap and MCMC 4. Static parameter estimation for stochastic dynamics: likelihood and state augmentation

  2. Statistical Estimation for Nonlinear NonGaussian Dynamic Systems Approaches … • Statistical emulators/ computer experiments for studying large scale dynamical model and perhaps DA. • Functional data analysis applied to estimating differential equations • Hierarchical Bayes and Markov Chain Monte Carlo • Sequential Monte Carlo approaches* Data

  3. Ocean Biogeochemical Time Series • Ocean Observatory at Lunenburg, Canada obs � gamma( � t , � ) P t obs � lognormal( � t , f ( � t )) N t Long Term Ocean Time - Space Series Bermuda Atlantic Time Series: • 15 years, monthly cruises • measure depth profiles of biogeochemical variables • use CTD and bottle samples Temperature Chlorophyll

  4. Dynamic Models A Physical - Biogeochemical Model dP { } � P � � P P � g P ; k P { } IZ + n P dt = g N ; k N dZ { } IZ � � Z Z + n Z dt = � g P ; k P dN { } IZ � g N ; k N { } � P + �� Z Z + n N dt = � D + � g P ; k P Physics dD { } IZ + (1 � � ) � Z Z + n D dt = � � D + � P P + (1 � � � � ) g P ; k P � � � X � t � � � X Sample Output � = SMS ( X ) � � K � z � t � z � � S � t � � � � S � � � � � � u u � � � � � = � � = F S K f v F K � � t � u � � � z � t � z � t z z � � � � � v � t � � � v � T � t � � � T � + f u = F � � = F � � � z K t K � � v � t � T � z � z � z Biogeochemistry where turbulence sub-model computes K t and K t ’ from u , v , T and S

  5. Incorporating Stochasticity (stochastic photosynthetic parameter + system noise) O-D ODE based PZND model • Frequent transitioning across bifurcation • aperiodic/episodic • dynamical dependencies maintained • note (high freq) forcing versus (low freq) response Describe ensemble properties as a distribution The estimation problem for system state and parameters …

  6. The State Space Model dynamics equation measurement equation = y h x e ( , ) t t t t x t = f ( x t � 1 , � , n t ) y t = h ( x t , � , � t ) (measurement equation) or or y t : measurements h t : obs operator y t � p ( y t | x t , � ) x t � p ( x t | x t � 1 , � ) e t : measurement error • Given Y � = ( � y 1 ,...., � y � � ) � want to jointly estimate the state x t and static parameters � and � General Case (Nonlinear Stochastic Dynamics): Filtering and Smoothing for State Estimation* Filtering: � p ( x t | Y t , � , � ) � p ( y t | x t , � ) p ( x t | x t � 1 , � ) p ( x t � 1 | Y t � 1 , � , � ) dx t � 1 for t=1, …, T , given p(x 0 ) Smoothing: T T p ( x 1: T | Y T , � , � ) = p ( x 0 ) � p ( x t | x t � 1 , � ) � p ( y t | x t , � ) t = 1 t = 1 � nonlinear, non-Gaussian case can be treated with sampling based solutions (via sequential MC methods) * treat parameter estimation later on …

  7. Sequential Monte Carlo Approaches 1. Stochastic dynamic prediction : numerical integration of stochastic dynamic system (generate forecast ensemble) = f ( x t � 1| t � 1 i = 1,...., n ( i ) ( i ) ( i ) , � ), x t | t � 1 , n t { } � p ( x t | Y t � 1 , � , � ) ( i ) x t | t � 1 � Ensemble must cover the part of state space with non- negligible values of the predictive density 2. Bayesian blending of measurements and numerical � model predictions (e.g. resampling, MCMC). Sequential Bayesian Monte Carlo { } � p ( x t | Y t � 1 , � , � ) { } � p ( x t | Y t , � , � ) ( i ) ( i ) x t | t � 1 x t | t ( i ) = w t � 1 ( i ) p ( y t | x t | t � 1 ( i ) ), i = 1,...., n (a) SIR - compute: w t { } � x t | t { } � p ( x t | Y t ) ( i ) , w t ( i ) ( i ) - weighted resample of x t | t � 1 or (b) Sequential Metropolis Hastings MCMC (c) Resample - Move (SIR/MCMC) (d) Ensemble/unscented Kalman filter (approximate) + …

  8. Sequential Metropolis-Hastings { } � p ( x t � 1 | Y t � 1 , � , � ), i = 1,...., n ( i ) Basic Idea: Given x t � 1| t � 1 1. Generate candidate from predictive density: loop over k � p ( x t | Y t � 1 , � , � ) = f ( x t � 1| t � 1 * * * * , � ) x t | t � 1 via: x t | t � 1 , n t 2. Evaluate acceptance probability � � , � , � ) * � = min 1, p ( y t | x t | t � 1 * ( k ) � , choose x t | t � 1 or x t | t � ( k ) , � , � ) � � p ( y t | x t | t { } � p ( x t | Y t , � , � ) ( i ) x t | t sample from target: • Flexible and configurable, e.g adaptive ensemble • EFFICIENT PROPOSALS ARE KEY, e.g prior, or from EnKF? Filter State Estimates Time series of median and percentiles Distributions

  9. Example SIR Results from Physical Biogeochemical Model Comparison of SMC Methods : Convergence of Distributions p ( x t | y 1: t )log p ( x t | y 1: t ) � <K-L divergence> = p ( x t | y 1: t ) dx t � Figure : Convergence to “exact” solution for different SMC methods

  10. Convergence of Moments (M-H MCMC) Parameter Estimation via Likelihood The likelihood arising from the state space model* is T T � � � L ( � | Y T ) = p ( Y T | � ) = p ( y t | Y t � 1 , � ) = p ( y t | x t , � ) p ( x t | Y t � 1 , � ) dx t t = 1 t = 1 From sequential MC filter we can compute predictive density ( i ) } ~ p ( x t | Y t � 1 , � ) { x t | t � 1 and so compute the likelihood as ( ) T L ( � | Y T ) � 1 � ( i ) , � n � p ( y t | x t | t � 1 i = 1 n t = 1 *assume � is given, and suppress the explicit dependence)

  11. Distributions for Parameters Likelihood Posterior: using prior info Parameter identifiability issues priors ‘focus’ the likelihood (sample based) likelihood surface is ‘rough’ � challenge for optimizers (stochastic gradients) Parameter Estimation via State Augmentation � � x t = x t � � � Idea : Append state to include parameters, � t � � � t = � t � 1 Specify p( � 0 ) and allow parameter to evolve as Choose � T as estimate for parameter For practical implementation with finite sample, we must { � 0 ( i ) } ~ p ( � ) (1) Specify initial ensemble (including dependence structure between the parameters) { � t ( i ) } (2) At each t , introduce smoothed bootstrap of (with dispersion correction) to generate diversity in parameter ensemble, while maintaining distributional properties. Easy and seems to work in practice, little theoretical guidance on convergence. Does not seem to work well with EnKF

  12. State Augmentation Example Parameter values from Trace plot of one realization 2000 realizations State Reconstruction by Smoother • Fixed interval smoother using optimal parameters. • Uses forward M-H MCMC filter • Smoother realizations provided by backwards sweep smoother algorithm of Godsill et al (2004)

  13. Remarks and Outstanding Issues on Fully Bayesian DA via Sequential Monte Carlo 1. Sequential MC approaches allow for state and parameter estimation in nonlinear nonGaussian dynamic systems. Wide variety available (bootstrap or MCMC) and easy to implement, but computationally …. 2. Static parameter estimation in SDEs outstanding statistical issue. Likelihood (via predictive density). State augmentation (via filter density). EM algorithm (via smoother density) 3. Effective stochastic simulation (integration) and specification of model errors a key feature. 4. Adaptation for (large dimension) dynamical systems! � need small ensembles (100-1000) to represent large dimensional state space. Efficient proposal distribution is paramount , e.g use information flow via dynamics. 5. Methods for computationally efficient smoothing also needed. 6. Information based metrics for assessing improvements and comparing approaches

Recommend


More recommend