The 7th International Symposium on Data Assimilation (ISDA2019) - PowerPoint PPT Presentation

The 7th International Symposium on Data Assimilation (ISDA2019) Efficient Implementations of Ensemble Based Methods In Sequential Data Assimilation: Accounting for Localization Elias D. Ni˜ no-Ruiz Applied Math and Computer Science Laboratory (AML-CS) Department of Computer Science Universidad del Norte BAQ 080001, Colombia E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Outline I Data Assimilation Components Ensemble Based Methods The Stochastic Ensemble Kalman Filter Localization Methods Precision Matrix Localization Efficient EnKF-MC Shrinkage Covariance Matrix Estimation Ensemble Kalman filter based on RBLW Efficient Implementation of the RBLW EnKF-RBLW EnKF-MC and EnKF-RBLW with the SPEEDY Model Accuracy of the EnKF-MC Local Estimation of B − 1 Accuracy of the EnKF-RBLW Parallel Implementations of Ensemble Based Methods Recent References References E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Components in DA [BS12] I � 10 8 � ◮ We want to estimate x ∗ ∈ ❘ n × 1 . n ∼ O . ◮ Imperfect numerical model: x next = M t current → t next ( x current ) , where x ∈ ❘ n × 1 . ◮ Noisy observations: y = H ( x ) + ǫ ∈ ❘ m × 1 , � 10 6 � where H : ❘ n → ❘ m and ǫ ∼ N ( 0 m , R ). m ∼ O . ◮ Prior estimate x b ∈ ❘ n × 1 with errors following N ( 0 , B ). E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Components in DA [BS12] II 0.5 0.6 20 20 0.7 40 0.8 40 0.9 60 60 1 1.1 80 80 1.2 100 100 1.3 1.4 120 120 1.5 20 40 60 80 100 120 2000 4000 6000 8000 10000 12000 14000 16000 20 40 60 80 100 120 (c) y = H · x ∗ + ǫ (a) x ∗ (b) H 20 40 60 80 100 120 20 40 60 80 100 120 (d) x b E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Components in DA [BS12] III ◮ By Bayes’ Theorem we know that: P ( x | y ) ∝ P ( x ) · L ( x | y ) where � � � � x − x b � − 1 2 � � P ( x ) ∝ exp 2 · � B − 1 � � − 1 2 · � y − H · x � 2 L ( x | y ) ∝ exp R − 1 and therefore, x a = arg max P ( x | y ) , x E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Components in DA [BS12] IV ◮ It can be easily shown that: � � x b + A · H T · R − 1 · d = A · B − 1 · x b + H T · R − 1 · y x a = � R + H · B · H T � − 1 x b + B · H T · · d = � � − 1 ∈ ❘ n × n , and B − 1 + H T · R − 1 · H where A = d = y − H · x b ∈ ❘ m × 1 . ◮ Posterior distribution: x ∼ N ( x a , A ) . ◮ How do we estimate x b and B ?. E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Ensemble Based Methods ◮ We can make use of an ensemble of model realizations: � x b [1] , x b [2] , . . . , x b [ N ] � X b = ∈ ❘ n × N ◮ Empirical moments of the ensemble: 1 x b ≈ x b N · X b · 1 N ∈ ❘ n × n , = 1 N − 1 · δ X · δ X T , B ≈ P b = and δ X = X b − x b · 1 T N ∈ ❘ n × N . E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

The Lorenz 96 Model - Toy Model I ◮ The Lorenz 96 model:   ( x 2 − x n − 1 ) · x n − x 1 + F for i = 1 ,  dx j dt = ( x i +1 − x i − 2 ) · x i − 1 − x i + F for 2 ≤ i ≤ n − 1 , (1)   ( x 1 − x n − 2 ) · x n − 1 − x n + F for i = n , where x i stands for the i -th model component, for 1 ≤ i ≤ n , usually n = 40. ◮ Each model component stands for a particle which fluctuates in the atmosphere. ◮ Exhibits chaotic behaviour when the external force F is set to 8. E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

The Lorenz 96 Model - Toy Model II 15 15 15 10 10 10 5 5 5 0 0 0 -5 -5 -5 -10 -10 -10 0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10 (e) x 5 (f) x 10 (g) x 20 15 15 15 10 10 10 5 5 5 0 0 0 -5 -5 -5 -10 -10 -10 0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10 (h) x 30 (i) x 35 (j) x 40 E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Estimation of B via N = 10 5 . 5 10 15 20 25 30 35 40 5 10 15 20 25 30 35 40 (a) Structure (b) Surf Figure: Estimation of B via N = 10 5 . E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

The Stochastic Ensemble Kalman Filter [Eve03, Eve06] I ◮ Sequential Monte Carlo method for parameter and state estimation. ◮ Analysis ensemble (posterior ensemble): � � X b + P b · H T · R + H · P b · H X a · ∆Y = X b + P a · H T · R − 1 · ∆Y ∈ ❘ n × N , X a = � � � P b � − 1 P a · H T · R − 1 · Y s + X a · X b ∈ ❘ n × N , = � P b � − 1 � − 1 � H T · R − 1 · H + where P a = ∈ ❘ n × n , and the e -th column of ∆Y ∈ ❘ m × N and Y s ∈ ❘ n × N are: � x b [ e ] � d [ e ] = y + ǫ [ e ] − H ∈ ❘ m × 1 , and y s [ e ] = y + ǫ [ e ] , respectively, for 1 ≤ e ≤ N , and ǫ [ e ] ∼ N ( 0 m , R ). E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

L − 2 Error Norms in Time, N = 10 5 2 1.5 1 1 0 0.5 -1 0 -0.5 -2 0 5 10 15 0 5 10 15 (a) p = 50% (b) p = 100% Figure: L − 2 error norms in time, N = 10 5 . But too many samples!!! In practice, model realizations are constrained by the hundreds... E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

L − 2 error norms in time, N = 10 1.5 1.5 1.48 1.46 1.45 1.44 1.4 1.42 1.4 1.38 1.35 0 5 10 15 0 5 10 15 (a) p = 50% (b) p = 100% Figure: L − 2 error norms in time, N = 10. What is going on here? ... E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Estimation of B via N = 10 5 10 15 20 25 30 35 40 5 10 15 20 25 30 35 40 (a) Structure (b) Surf Figure: Estimation of B via N = 10. What can we do? Localization methods... E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Localization Methods ◮ Avoid the impact of spurious correlations. ◮ Increase the rank of P b . ◮ Three different flavors: 1. Covariance Matrix Localization. (Precision Localization) [NRSD15, NRSD17, NR17, NRSD18]. 2. Spatial Domain Localization [OHS + 04]. 3. Observation Localization [AND07, AND09]. E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Precision Matrix Localization I ◮ Component-wise products are prohibitive in high-dimensional spaces. ◮ When two model components are conditional independent, their corresponding entry in the precision covariance matrix is zero . (a) r = 0 (b) r = 1 (c) r = 3 E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Precision Matrix Localization II ◮ Modified Cholesky Decomposition [BL + 08]: B − 1 = T T · D − 1 · T � where the non-zero elements from T ∈ ❘ n × n are given by fitting models of the form: � x [ i ] = x [ q ] · {− T } i , q + ǫ [ i ] ∈ ❘ N × 1 , for 1 ≤ i ≤ n , q ∈ P ( i , r ) � ǫ [ i ] � and { D } i , i = var . (a) N (6 , 1) (b) P (6 , 1) E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Precision Matrix Localization III ◮ An estimate: 0 0 5 5 10 10 15 15 20 20 25 25 30 30 35 35 40 40 0 10 20 30 40 0 10 20 30 40 nz = 160 nz = 40 (a) P b (b) T (c) D 0 5 10 15 20 25 30 35 40 0 10 20 30 40 nz = 298 B − 1 Str (d) � (e) � B − 1 (f) � B Results: E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Precision Matrix Localization IV 2 2 2 1 1 1 0 0 0 -1 -1 -1 -2 -2 -2 0 5 10 15 0 5 10 15 0 5 10 15 (a) N = 30, r = 1, (b) N = 30, r = 3, (c) N = 30, r = 5, p = 100% p = 100% p = 100% 1.4 1.5 1.4 1.2 1 1.2 1 0.5 0.8 1 0 0.6 0.8 -0.5 0.4 0.2 -1 0.6 0 5 10 15 0 5 10 15 0 5 10 15 (d) N = 30, r = 1, (e) N = 30, r = 3, (f) N = 30, r = 5, p = 50% p = 50% p = 50% E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Efficient EnKF-MC I Consider � � − 1 X b + B − 1 + H T · R − 1 / 2 · R − 1 / 2 · H · H T · R − 1 · ∆Y � X a = � B − 1 + Z · Z T � − 1 · H T · R − 1 · ∆Y X b + � =   − 1 � z [ j ] � T m � X b + B − 1 + z [ j ] · · H T · R − 1 · ∆Y , �  = j =1 z [ j ] ∈ ❘ n × 1 is the j -th column of Z = H T · R − 1 / 2 ∈ ❘ n × m . E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Efficient EnKF-MC II � T (0) � T � T (0) � · D (0) · = T T · D · T = � B − 1 , A (0) = � z [1] � T � T (1) � T � T (1) � A (0) + z [1] · · D (1) · A (1) = = , � z [2] � T � T (2) � T � T (2) � A (1) + z [2] · · D (2) · A (2) = = , . . . � z [ m ] � T A ( m − 1) + z [ m ] · A ( m ) = � T ( m ) � T � T ( m ) � · D ( m ) · T T · � T = A − 1 , = � D · � = E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

Efficient EnKF-MC III at any intermediate step j , for 1 ≤ j ≤ m , we have, � T ( j − 1) � T � T ( j − 1) � � z [ j ] � T · D ( j − 1) · + z [ j ] · A ( j ) = � p ( j ) � T � � T ( j − 1) � T � � T ( j − 1) � D ( j − 1) + p ( j ) · · · = , � T ( j − 1) � T · p ( j ) = z [ j ] ∈ ❘ n × 1 . By computing the Cholesky where decomposition of, � p ( j ) � T � T ( j − 1) � T � T ( j − 1) � D ( j − 1) + p ( j ) · · D ( j ) · � � = , therefore, � T ( j − 1) · T ( j − 1) � T � T ( j − 1) · T ( j − 1) � · D ( j ) · � � A ( j ) = � T ( j ) � T � T ( j ) � · D ( j ) · = , E. Ni˜ no-Ruiz, ISDA2019 - RIKEN R-CCS

The 7th International Symposium on Data Assimilation (ISDA2019) - PowerPoint PPT Presentation

The 7th International Symposium on Data Assimilation (ISDA2019) Efficient Implementations of Ensemble Based Methods In Sequential Data Assimilation: Accounting for Localization Elias D. Ni no-Ruiz Applied Math and Computer Science Laboratory

Assimilation of 3D Radar Data and Derived Objects on the Convective Scale with an Ensemble-based

Assimilation of AIRS Data at NRL Assimilation of AIRS Data at NRL Benjamin Ruston, Clay

The Joint Effort for Data assimilation Integration (JEDI) OOPS Structure, Data flow Joint Center

Soil moisture data assimilation: Soil moisture data assimilation: Error modeling, adaptive

Use of AIRS data in the Joint Center for Satellite Data Assimilation Lars Peter Riishojgaard

Functional Data Assimilation with White-Noise Data Error and Applications to Assimilation of

Introduction to Data Assimilation Ma elle Nodet team.inria.fr/moise/maelle Universit e de

Data assimilation and observing Data assimilation and observing strategies working group

or Flow-Adaptive Background-Error-Covariance Modeling for Data Assimilation using Information

Carlos Geijo 6th WMO Symposium on Data Assimilation. Maryland 7-11 October 2013 INTRODUCTION

7th GRADE FORECASTING The process by which you choose your elective classes for 7th grade

Assimilation of Advanced Assimilation of Advanced InfraRed nfraRed Sounder (AIRS) observations

The effects of data selection on The effects of data selection on the assimilation of AIRS data

A Hybrid Variational-Ensemble Data Assimilation Method with an Implicit Optimal Hessian

Coupled 4D-variational physical and biological data assimilation in the California Current System

Development of an Ensemble-Based Data Assimilation System with a Coupled Atmosphere Ocean GCM

Utah DOT Randy Park Peer Exchange - Integra/ng Risk Management

Federal Tax Legislative Update Here We Go Again! Will Tax Reform Finally Happen? Hank Gutman

Advanced Anti-Deobfuscation Bjorn De Sutter ISSISP 2017 Paris 1 About me Research

Second Quarter 2019 Earnings Presentation Tuesday, August 13, 2019 Agenda Prepared Remarks

Data Assimilation for Numerical Weather Prediction [NWP] Project Ahmed Attia Statistical and

Joint Effort for Data assimilation Integration Why OOPS/JEDI? Joint Center for Satellite Data

assimilation to model te Amsterdam urban climate 18 July 2018, S. Koopmans , N.E. Theeuwes, R. van

Time parallelisation for optimal control and data assimilation Julien Salomon Joint work with M.