Efficient Matrix-Free Ensemble Kalman Filter Implementations: Accounting for Localization

Elias David Niño Ruiz; Rolando Beltrán Arrieta; Alfonso Manuel
Mancilla Herrera

doi:10.5772/intechopen.72465

Abstract

This chapter discusses efficient and practical matrix-free implementations of the ensemble Kalman filter (EnKF) in order to account for localization during the assimilation of observations. In the EnKF context, an ensemble of model realizations is utilized in order to estimate the moments of its underlying error distribution. Since ensemble members come at high computational costs (owing to current operational model resolutions) ensemble sizes are constrained by the hundreds while, typically, their error distributions range in the order of millions. This induces spurious correlations in estimates of prior error correlations when these are approximated via the ensemble covariance matrix. Localization methods are commonly utilized in order to counteract this effect. EnKF implementations in this context are based on a modified Cholesky decomposition. Different flavours of Cholesky-based filters are discussed in this chapter. Furthermore, the computational effort in all formulations is linear with regard to model resolutions. Experimental tests are performed making use of the Lorenz 96 model. The results reveal that, in terms of root-mean-square-errors, all formulations perform equivalently.

Keywords

ensemble Kalman filter
modified Cholesky decomposition
sampling methods

Author Information

Show +

Elias David Niño Ruiz*
- Applied Math and Computer Science Laboratory, Department of Computer Science, Universidad Del Norte, Barranquilla, Colombia
Rolando Beltrán Arrieta
- Applied Math and Computer Science Laboratory, Department of Computer Science, Universidad Del Norte, Barranquilla, Colombia
Alfonso Manuel Mancilla Herrera
- Applied Math and Computer Science Laboratory, Department of Computer Science, Universidad Del Norte, Barranquilla, Colombia

*Address all correspondence to: enino@uninorte.edu.co

1. Introduction

The ensemble Kalman filter (EnKF) is a sequential Monte Carlo method for parameter and state estimation in highly nonlinear models. The popularity of the EnKF owes to its simple formulation and relatively easy implementation. In the EnKF, an ensemble of model realizations is utilized in order to, among other things, estimate the moments of the prior error distribution (forecast distribution). During this estimation and the assimilation of observations, many challenges are present in practice. Some of them are addressed in this chapter: the first one is related to the proper estimation of background error correlations, which has a high impact on the quality of the analysis corrections while the last one is related to the proper estimation of the posterior ensemble when the observation operator is nonlinear. In all cases, background error correlations are estimated based on the Bickel and Levina estimator [1].

2. Preliminaries

We want to estimate the state of a dynamical system x ∗ ∈ R n × 1 , which (approximately) evolves according to some (imperfect) numerical model operators x k ∗ = M t k − 1 → t k x k − 1 ∗ , for instance, a model which mimics the behavior of the atmosphere and/or the ocean where n is the model dimension (typically associated with the model resolution). The estimation process is based on two sources of information: a prior estimate of x ∗ , x b = x ∗ + ξ with ξ ∼ N 0 n B , and a noisy observation y = H x ∗ + ε with ε ∼ N 0 m R , where m is the number of observed model components, R m × m is the estimated data error covariance matrix, H : R n × 1 → R m × 1 is the (nonlinear) observation operator that maps the information from the model domain to the observation space, 0 n is the nth dimensional vector whose components are all zeros. In practice, m ≪ n or m < n . The assimilation process can be performed by using sequential data assimilation methods, and one of the most widely used methods is the ensemble Kalman filter, which is detailed in the following paragraphs.

2.1. The ensemble Kalman filter

In the ensemble Kalman filter (EnKF) [2], an ensemble of model realizations

X b = x b 1 x b 2 … x b N ∈ R n × N , E1

is utilized in order to estimate the moments of the background error distribution,

x b i ∼ N x b B , E2

via the empirical moments of the ensemble (1) and therefore,

x b ≈ x ¯ b = 1 N ⋅ ∑ i = 1 N x b i ∈ R n × 1 , E3

and

B ≈ P b = 1 N − 1 ⋅ Δ X b ⋅ Δ X b T ∈ R n × n , E4

where N is the ensemble size, x b i ∈ R n × 1 is the ith ensemble member, for 1 ≤ i ≤ N , x b ∈ R n × 1 is well known as the background state, B ∈ R n × n stands for the background error covariance matrix, x ¯ b is the ensemble mean, and P b is the ensemble covariance matrix. Likewise, the matrix of member deviations Δ X b ∈ R n × N reads,

Δ X b = X b − x ¯ b ⋅ 1 N T ∈ R n × N . E5

A variety of EnKF implementations have been proposed in the current literature, most of them rely on basic assumptions over the moments of two probability distributions: the background (prior) and the analysis (posterior) distributions. In most of EnKF formulations, normal assumptions are done over both error distributions.

In general, depending on how the assimilation process is performed, the EnKF implementation will fall in one of two categories: stochastic filter [3] or deterministic filter [4]. In the context of stochastic filters, for instance, the assimilation of observations can be performed by using any of the next formulations,

X a = A ⋅ P b − 1 ⋅ X b + H T ⋅ R − 1 ⋅ Y s ∈ R n × N , E6

X a = X b + A ⋅ H T ⋅ R − 1 ⋅ Δ Y ∈ R n × N , E7

and

X a = X b + P b ⋅ H T ⋅ R − 1 + H ⋅ P b ⋅ H T − 1 ⋅ Δ Y ∈ R n × N , E8

where the analysis covariance matrix A ∈ R n × n is given by A = P b − 1 + H T ⋅ R − 1 ⋅ H − 1 , the matrix of innovations (on observations) is denoted by Δ Y = Y s − H ⋅ X b ∈ R n × N , and the matrix of perturbed observations reads Y s = y ⋅ 1 N T + E ∈ R m × N , where the columns of E ∈ R m × N are formed by samples from a normal distribution with moments N 0 R . In practice, one of the most utilized formulations is given by (8).

2.2. Cholesky and modified Cholesky decomposition

The forms of the Cholesky and the modified Cholesky decomposition were discussed by Golub and Van Loan in [5]. If A ∈ R n × n is symmetric positive definite, then there exist a unique lower triangular L ∈ R n × n with positive diagonal entries such that A = L ⋅ L T . If all the leading principal submatrices of A ∈ R n × n are non-singular, then there exist unique lower triangular matrices L and M and a unique diagonal matrix D = diag d 1 d 2 … d n such that A = L ⋅ D ⋅ M T . If A is symmetric, then L = M and A = L ⋅ D ⋅ L T . Golub and Van Loan provide proofs of those decompositions, as well as various ways to compute them. We can use the Cholesky factor of a covariance matrix to quickly solve linear systems that involve a covariance matrix. Note that if we compute the Cholesky factorization and solve the triangular system L ⋅ y = b and L T ⋅ x = y , then

A ⋅ x = L ⋅ L T ⋅ x = L ⋅ L T ⋅ x = L ⋅ y = b . E9

For a symmetric matrix A ∈ R n × n , the modified Cholesky decomposition involves finding a non-negative diagonal matrix E such that A + E is positive definite. In particular, E = 0 whenever A is positive definite, otherwise E should be sufficiently small. This allows applying the usual Cholesky factorization to A + E , i.e. finding matrices L , D ∈ R n × n such that A + E = L ⋅ D ⋅ L T .

Recall Δ X b = X b − x ¯ b ⊗ 1 N T ∈ R n × N . Thus, Δ x b i ∼ N 0 n B for 1 ≤ i ≤ n . Let x b i ∈ R N × 1 , the vector holding the ith row across all columns of Δ X b for 1 ≤ i ≤ n . Bickel and Levina in [1] discussed a modified Cholesky decomposition for the estimation of precision covariance matrix, and this allows in our context to estimate of B − 1 by fitting regressions of the form:

x b i = ∑ j = 1 i − 1 x b j ⋅ β i , j + ξ i ∈ R N × 1 , E10

then, L ij = − β i , j , for 1 ≤ i < j ≤ n .

3. Filters based on modified Cholesky decomposition

3.1. Ensemble Kalman filter based on modified Cholesky

In the ensemble Kalman filter based on a modified Cholesky decomposition (EnKF-MC) [6], the analysis ensemble is estimated by using the following equations,

X a = X b + B − 1 + H T R − 1 ⋅ H − 1 ⋅ H T ⋅ R − 1 ⋅ Δ Y , E11

and

Δ Y = Y S − H . X b ∈ R n × N . E12

In this context, error correlations are estimated via a modified Cholesky decomposition. This provides an estimate of the inverse background error covariance matrix of the form,

B − 1 ≈ B ̂ − 1 = L T ⋅ D − 1 ⋅ L ∈ R n × n , E13

or equivalently

B ≈ B ̂ = L − 1 ⋅ D ⋅ L − T ∈ R n × n , E14

where L ∈ R n × n is a lower-triangular matrix whose diagonal elements are all ones, and D ∈ R n × n is a diagonal matrix. Even more, when only local effects are considered during the estimation of B ̂ − 1 , sparse estimators of the precision background can be obtained. Furthermore, the matrix L can be sparse with only a few non-zero elements per row. Typically, the number of non-zero elements depends on the radius of influence during the estimation of background error correlations. For instance, in the one-dimensional case, the radius of influence denotes the maximum number of non-zero elements, per row, in L . The EnKF-MC is then obtained by plugging in the estimator (3) in (6). Given the structure of the Cholesky factors, the EnKF-MC can be seen as a matrix-free implementation of the EnKF.

Recall that the precision analysis covariance matrix reads,

A − 1 = B − 1 + H T ⋅ R − 1 ⋅ H ⋅ R n × n , E15

moreover, since H T ⋅ R − 1 ⋅ H ∈ R n × n can be written as a sum of m rank-one matrices, the factors (3) can be updated in order to obtain an estimate of the inverse analysis covariance matrix. The next section discussed such approximation in detail.

3.1.1. Posterior ensemble Kalman filter stochastic (PEnKF-S)

In the stochastic posterior ensemble Kalman filter (PEnKF-S) [7, 8], we want to estimate the moments of the analysis distribution,

x ∼ N x a A , E16

based on the background ensemble X b , where x a is the analysis state and A ∈ R n × n is the analysis covariance matrix. Consider the estimate of the inverse background error covariance matrix Eq. (3), the precision analysis covariance matrix Eq. (4) can be approximated as follows:

A − 1 ≈ A ̂ − 1 = B ̂ − 1 + X ⋅ X T , E17

where X = H T ⋅ R − 1 2 ∈ R n × m . The matrix (17) can be written as follows:

A ̂ − 1 = L T ⋅ D − 1 ⋅ L + ∑ i = 1 m x i ⋅ x i T ,

where x i denotes the ith column of matrix X , for 1 ≤ i ≤ m . Consider the sequence of factor updates,

L i T ⋅ D i ⋅ L i = L i − 1 T ⋅ D i − 1 ⋅ L i − 1 + x i ⋅ x i T L i T ⋅ D i ⋅ L i = L i − 1 T ⋅ D i − 1 + p i ⋅ p i T ⋅ L i − 1 L i T ⋅ D i ⋅ L i = L ˜ i − 1 ⋅ L i − 1 T ⋅ D ˜ i − 1 ⋅ L ˜ i − 1 ⋅ L i − 1 ,

where L i − 1 ⋅ p i = x i ∈ R n × 1 , for 1 ≤ i ≤ m , B ̂ − 1 = L 0 T ⋅ D 0 ⋅ L 0 and

D i + p i ⋅ p i T = L ˜ i T ⋅ D ˜ i ⋅ L ˜ i ∈ R n × n , E18

We can use of Dolittle’s method in order to compute the factors D ˜ i and L ˜ i in (9), and it is enough to note that,

1 l ˜ 21 l ˜ 31 ⋯ l ˜ n 1 0 1 l ˜ 32 ⋯ l ˜ n 2 0 0 1 ⋯ l ˜ n 3 ⋮ ⋮ ⋮ ⋱ ⋮ 0 0 0 ⋯ 1 ︸ L ˜ i T d ˜ 1 0 0 0 0 0 d ˜ 2 0 0 0 0 0 d ˜ 3 0 0 ⋮ ⋮ ⋮ ⋱ 0 0 0 0 0 d ˜ n ︸ D ˜ i 1 0 0 … 0 l ˜ 21 1 0 … 0 l ˜ 31 l ˜ 21 1 … 0 ⋮ ⋮ ⋮ ⋱ 0 l ˜ n 1 l ˜ n 2 l ˜ n 3 … 1 ︸ L ˜ i = d 1 + p 1 2 p 1 ⋅ p 2 p 1 ⋅ p 3 … p 1 ⋅ p n p 2 ⋅ p 1 d 2 + p 2 2 p 2 ⋅ p 3 … p 2 ⋅ p n p 3 ⋅ p 1 p 3 ⋅ p 2 d 3 + p 3 2 … p 3 ⋅ p n ⋮ ⋮ ⋮ ⋱ ⋮ p n ⋅ p 1 p n ⋅ p 2 p n ⋅ p 3 … d n + p n 2 ︸ D i + p i ⋅ p i T

After some math simplifications, the next equations are obtained,

d ˜ k = p k 2 + d k − ∑ q = k + 1 n d ˜ q ⋅ l ˜ qi 2 , E19

and

l ˜ kj = 1 d ˜ k ⋅ p k ⋅ p j − ∑ q = k + 1 n d ˜ q ⋅ l ˜ qi ⋅ l ˜ qj , E20

for 1 ≤ k < n and 1 ≤ j ≤ k − 1 . The set of Eqs. (19) and (20) can be used in order to derive an algorithm for rank-one update of Cholesky factors, and the updating process is shown in Algorithm 1.

Algorithm 1. Rank-one update for the factors L i − 1 and D i − 1

1. function UPD_CHOLESKY_FACTORS L i − 1 D i − 1 x i

2. Compute p i from L i T ⋅ p i = x i

3. for k = n ∼ 1 do

4. Compute d ˜ k via Eq. (10).

5. Set l kk ← 1 .

6. for j = 1 → k − 1 do

7. Compute l ˜ kk according to (11).

8. end for

9. end for

10. Set L i ← L i − 1 ⋅ L i − 1 and D i ← D ˜ i .

11. return L i , D i

12. end function

Algorithm 1 can be used in order to update the factors of B ̂ − 1 for all column vectors in X , and this process is detailed in Algorithm 2.

Algorithm 2. Computing the factors L m and D m of L m T ⋅ D m ⋅ L m

13. function COMPUTE_ANALYSIS_FACTOR L 0 D 0 H R

14. Set X ← H T ⋅ R − 1 2 .

15. for i = 1 → m do

16. Set L i D i ← UPD _ CHOLESKY _ FACTORS L i − 1 D i − 1 x i

17. end for

18. return L m , D m

19. end function

Once the updating process has been performed, the resulting factors form an estimate of the inverse analysis covariance matrix,

A ̂ − 1 = L m T ⋅ D m ⋅ L m ∈ R n × n , E21

From this covariance matrix, the posterior mode of the distribution can be approximated as follows:

x ¯ a = x ¯ b + z ∈ R n × 1 , E22

where

L m T ⋅ D m ⋅ L m ⋅ z = q , E23

with q = H T ⋅ R − 1 ⋅ y − H ⋅ x ¯ b ∈ R n × 1 . Note that the linear system (23) involves lower and upper triangular matrices, and therefore, x ¯ a can be estimated without the need of matrix inversion. Once the posterior mode is computed, the analysis ensemble is built about it. Note that A ̂ reads A ̂ = L m − 1 ⋅ D m − 1 ⋅ L m − T , and therefore, a square root of A ̂ can be approximated as follows:

A ̂ 1 2 = L m ‐ 1 ⋅ D m − 1 2 ∈ R n × n , E24

which can be utilized in order to build the analysis ensemble,

X a = x ¯ a ⋅ 1 N T + Δ X a , E25

where Δ X a ∈ R n × N is given by the solution of the linear system,

L m ⋅ D m − 1 2 ⋅ Δ X a = W ∈ R n × N , E26

In addition, the columns of W ∈ R n × N are formed by samples from a multivariate standard normal distribution. Again, since L m is lower triangular, the solution of (26) can be obtained readily.

3.1.2. Posterior ensemble Kalman filter deterministic (PEnKF-D)

The deterministic posterior ensemble Kalman filter (PEnKF-D) is a square root formulation of the PEnKF-S. The main difference between them lies on the use of synthetic data. In both methods, the matrices A − 1 and B − 1 are computed similarly (i.e., by using Dolittle’s method and the algorithms described in the previous section).

In the PEnKF-D, the computation of B ̂ − 1 is performed (13) based on the empirical moments of the ensemble. The analysis update is performed by using the perturbations

B ̂ − 1 2 ⋅ X b − x ¯ b ⋅ 1 N T ∼ N 0 I , E27

which are consistent with the dynamics of the numerical model, for instance, samples from (27) are driven by the physics and the dynamics of the numerical model. Finally, the analysis equation of the PEnKF-D

X a = x ¯ a ⋅ 1 N T + A ̂ 1 2 ⋅ B ̂ − 1 2 ⋅ Δ X b ∈ R n × N , E28

where

x ¯ a = x ¯ b + A ⋅ H T ⋅ R − 1 ⋅ y − H ⋅ x ¯ b , E29

A 1 2 ≈ A ̂ 1 2 = L ˜ T ⋅ D ˜ 1 2 − 1 , E30

and

B − 1 2 ≈ B ̂ − 1 2 = L ⋅ D 1 2 , E31

Since the moments of the posterior distribution are unchanged, we expect both methods PEnKF-S and PEnKF-D to perform equivalently in terms of errors during the estimation process.

3.2. Markov Chain Monte Carlo-based filters

Definitely, the EnKF represents a breakthrough in the data assimilation context, perhaps its biggest appeal is that we can obtain a closed form expression for the analysis members. Nevertheless, as can be noted, during the derivation of the analysis equations, some constrains are imposed, for instance, the observation operator is assumed linear, and therefore, the likelihood P y x obeys a Gaussian distribution. Of course, in practice, this assumption can be easily broken, and therefore, bias can be introduced on the posterior prediction; this is notoriously significant if one considers the fields in which data assimilation lives are submerged. From a statistical point of view, more specifically, under the Bayesian framework, this situation can be summarized as follows:

The prediction is obtained from the posterior distribution: P x y .
The information from numerical model is known like the prior of background: P b x .
The information from the observations is incorporated through the likelihood: P y x .
The posterior distribution is calculated: P x y ∝ P b x ⋅ P y x .
If P b x and P y x are Gaussian, P y x is Gaussian too, and the analysis is equal to the mean of P y x .

A significant number of approaches have been proposed in the current literature in order to deal with these constraints; for instance, the particle filters (PFs) and the maximum likelihood ensemble filter (MLEF) are ensemble-based methods, which deal with nonlinear observation operators. Unfortunately, in the context of PF, its use is questionable under realistic weather forecast scenarios since the number of particles (ensemble members) increases exponentially regarding the number of components in the model state. Anyways, an extended analysis of these methods exceeds the scope of this document, but its exploration is highly recommended.

Taking samples directly from the posterior distribution is a strategy that can help to remove the bias induced by wrong assumptions on the posterior distribution (i.e., normal assumptions). We do not put the sights on finding the mode of the posterior distribution; instead of this, we want a set of state vectors that allow us to create a satisfactory representation of the posterior error distribution, then, based on these samples, it is possible to estimate moments of the posterior distribution from which the analysis ensemble can be obtained [9].

Hereafter, we will construct a modification of the sequential scheme of data assimilation; first, we will describe how to compute the analysis members by using variations in Markov Chain Monte Carlo (MCMC)-based methods, and then, we will include the modified Cholesky decomposition in order to estimate a precision background covariance.

An overview of the proposed method is as follows:

The forecast step is unchanged; the forecast ensemble is obtained by applying the model to the ensemble of states of the previous assimilation cycle.
The analysis step is modified, so that the analysis is not obtained anymore by, for instance (28), but we perform k iterations of an algorithm from the Markov Chain Monte Carlo (MCMC) family in order to obtain samples from the posterior distribution.

In order to be more specific in the explanation, let us define Metropolis-Hastings (MH) as the selected algorithm from the MCMC family. Now let us focus on MCMC in the most intuitive way possible. Let us define J x y equal to the posterior pdf obtained from the expression P x y ∝ P b x ⋅ P y x . MH explores the state space in order to include model states in a so-called Markov Chain. The selection of candidates is based on the condition x c y > J x t − 1 y , that is, if the value of J ⋅ for a candidate x c is greater than that for a previous vector x t − 1 . Candidates are generated from a pre-defined proposal distribution Q ⋅ ; generally, a Gaussian multivariate distribution with mean x t − 1 is chosen, and a covariance matrix is defined in order to handle the step sizes. Concisely, MCMC methods proceed as follows: at first iteration, the chain is started with x b vector. At end, the sample is obtained by extracting the last t vectors on the chain, and the other vectors are dismissed or “burned”. In (32), we can see the expression to calculate the acceptance probability that relates the target pdf, J ⋅ , with the proposal Q ⋅

α = J x c y Q x t − 1 x c J x t − 1 y Q x c x t − 1 , E32

The terms that involve Q ⋅ can be canceled if a symmetric distribution is chosen as the proposal one, for instance, in the Gaussian distribution case. The procedure for the calculation of the analysis applying MH is described below:

Initialize the Markov Chain, C , assigning the background value x b to C 0 .
Generate a candidate vector state x c , from Q .
Obtain U from a uniform (0, 1) distribution.
If U ≤ α Then: C t + 1 = x c Else: C t + 1 = C t
Repeat Steps 2 through 4, for t = 1 until t = k − 1
Remove the first p vectors of the chain, the burned ones.

The analysis is computed over the sample:

x a = 1 k − p ⋅ ∑ i = k − p k C i ,

MCMC methods are straightforward to implement; when an enough number of iterations is performed, the behavior of the target function is captured on the chain [10]. This is true for even, complex density functions such as those with multiple modes. Briefly, let us focus on the fact that, generally, simulation-based methods such as MCMC explore a discretized grid, and as the mesh is refined, a huge number of iterations are needed before a high probability zone of the posterior distribution is reached [11]. Concretely, Hu et al. [12] proposed a family of modified MCMC dimension-independent algorithms under the name of preconditioned Crank-Nicolson (pCN) MCMC. These methods are robust regarding the curse of dimensionality in the statistical context. Initially, the Crank-Nicolson discretization is applied to a stochastic partial differential equation (SPDE) in order to obtain a new expression for the proposal distribution:

x c = x t − 1 − 1 2 δ K L x t − 1 + x c + 2 K δ ε 0 , E33

where L is the precision matrix of B , K is a preconditioning matrix, ε 0 is white noise, if K = B and δ ∈ 0 2 in (33), we get the pCN proposal described in (34):

x c = x t − 1 − 1 2 δ K L x t − 1 + x c + 2 K δ , E34

where ω ~ N 0 B and β = 8 δ / 2 + δ 2 . Of course, we use P b like a computationally efficient estimation of B . The acceptance probability is defined in (35):

α = min 1 exp J x t − 1 y − J x c y , E35

The procedure for the calculation of the analysis applying pCN Metropolis-Hastings is described as follows:

Initialize the Markov Chain, C , assigning the background value x b to C 0 .
Generate a candidate vector state using (3): x c = 1 − β 2 1 2 x t − 1 + β ω .
Obtain U from a uniform (0, 1) distribution.
If U ≤ α , then C t + 1 = x c . Else: C t + 1 = C t .
Repeat Steps 2 through 4, for t = 1 until t = k − 1 .
Remove the first p vectors of the chain, the burned ones.
The analysis is calculated over the sample: x a = 1 k − p ⋅ ∑ i = k − p k C i .

Finally, in the data assimilation context, by using the modified Cholesky decomposition described in the earlier sections, the pCN-MH filter reads:

The forecast step remains unchanged; the forecast ensemble is obtained by applying the model to the ensemble of states of the previous iteration, but the estimation of background covariance matrix is calculated by modified Cholesky.
The update step is modified, so that the analysis is obtained by run k iterations of pCN-MH, to obtain a sample of the posterior error distribution.

Now we are ready to numerically test the methods discussed in this chapter.

4. Experimental results

4.1. Stochastic filter PEnKF-S

We assess the accuracy of the PEnKF-S and compare it against that of the LETFK implementation proposed by Hunt [13]. The numerical model is the Lorenz 96 model [11], which mimics the behavior of the atmosphere:

dx k dt = − x k − 1 ⋅ x k − 2 − x k + 1 − x k + F , for 1 ≤ k ≤ n , E36

where n is the number of components in the model and F is an external force; with the value of the parameter, F = 8 , the model presents a great entropy.

The experimental settings are described below:

An initial random solution x − 3 + is propagated for a while in time by using Lorenz 96 model and a fourth-order Runge Kutta method in order to obtain a vector state x − 2 + whose physics are consistent with the dynamics of such numerical model. This vector state serves as our reference solution.
The reference solution is perturbed by using samples from a normal distribution with parameters N 0 n σ B ⋅ I . Three different values for σ B are considered during the numerical experiments σ B ∈ 0.05 0.10 0.15 . This perturbed state is propagated in time in order to make it consistent with the physics and dynamics of the numerical model. From here, an initial background state x − 1 b is obtained.
A similar procedure is performed in order to build a perturbed ensemble about x − 1 b . The ensemble members are propagated in time from where an ensemble of model realizations x 0 b i i = 1 N of model is obtained.
The assimilation windows consist of 15 equidistant observations. The frequency of observations is 0.5 time units, which represents 3.5 days in the atmosphere.
The dimension of the vector state is n = 40 . The external force of the numerical model is set to F = 80 .
The number of observed components is 50% of the dimension of the vector state.
Three ensemble sizes are tried during the experiments ∈ 20 40 60 .
As a measure of quality, the L 2 norm of the analysis state and the reference solution are computed across assimilation steps.
A total of 100 runs are performed for each pair N σ B . For each run, a different initial random vector is utilized in order to build the initial perturbed reference solution x − 3 + (before the model is applied x − 2 ∗ ). This yields to different initial ensembles as well as synthetic data for the different runs of each configuration (pair).

The average of the error norms of each pair N σ B for the LETKF and the PEnKF implementations is shown in Table 1 . As can be seen, on average across 100 runs, the performance of the proposed EnKF implementation outperforms that of the LETKF in terms of L 2 norm of the error. Even more, the PEnKF-S seems to be invariant to the initial background error σ B since, in all cases, when the ensemble size is increased a better estimation of the reference state x ∗ at different observation times is obtained. This can also obey to the estimation of background error correlations via the modified Cholesky decomposition since it is drastically improved whenever the ensemble size is increased as pointed out by Bickel and Levina in [1]. In such case, the error decreases by O log n / N . This is crucial in the PEnKF-S formulation since estimates of the precision analysis covariance matrix are obtained by rank-one updates on the inverse background error covariance matrix. On the other hand, in the LETKF context, increasing the ensemble size can improve the accuracy of the method but that is not better than the one shown by the PEnKF.

σ B	N	LETKF	PEnKF-S
0.05	20	22,6166	21,2591
	40	20,5671	18,2548
	60	20,0567	17,8824
0.10	20	23,1742	21,0725
	40	20,9513	18,3542
	60	18,5048	17,8240
0.15	20	24,8201	20,9059
	40	21,1314	18,1731
	60	20,8487	17,7590

Table 1.

Average of L-2 norm of errors for 100 runs of each configuration σ B N for the compared filter implementations.

Some plots of the L 2 norm of error for the PEnKF and the LETKF across different configurations and runs are shown in Figure 1 . Note that the error of the PEnKF decreases aggressively since the earlier iterations. In the LETKF context, the accuracy is similar to that of the PEnKF only at the end of the assimilation window.

Figure 1.
Local ensemble transform Kalmar filter (LETKF).

4.2. Deterministic filter PEnKF-D

Holding the settings from the previous section, experiments are performed by using the PEnKF-D. The results have similar behavior to that of the LETFK and the PEnKF-S implementation proposed by Niño et al. in [8]. This can be appreciated in Figures 1 – 3 . As can be seen, the error distributions reveal similar behavior across all compared filters.

Figure 3.
Posterior ensemble Kalmar filter deterministic (PEnKF-D).

Figure 2.
Posterior ensemble Kalmar filter stochastic (PEnKF-S).

4.3. MCMC filter based on modified Cholesky decomposition

We will describe experiments of our proposed filter based on a modified Cholesky decomposition and the preconditioned Crank-Nicolson Metropolis-Hastings. Again, the numerical model is the Lorenz 96 model. In this section, we are mainly interested on describing the behavior of the method, especially when the observational operator is nonlinear.

The experimental settings are as follows:

The observational operator is quadratic: H x i = x i 2 , where x i denotes the ith component of the vector state x .
The vector of states has n = 40 components.
The ensemble size is N = 20 .
The length of the Markov Chain is 1000.
The proposal is pCN with β = 0.09 .

Figure 4 describes the distribution of six of the 40 components of the state vector after 1000 iterations of the algorithm, attempting to visualize if the actual value is within or near the region of the highest probability density described by the sample contained in the Markov Chain.

Figure 4.
Histogram of six components of the state vector after 1000 iterations of the algorithm, the dashed lines indicate the position of the true value for the respective component.

Note that, not in all cases, the actual values of the model components were located at the peaks of the histogram, but most of them were within or near to zones of high probability described by the sample. This result is important if we take into account that probably the posterior distribution is not normal as is the case of quadratic observation operators.

5. Conclusions

In this chapter, efficient EnKF implementations were discussed. All of them are based on a modified Cholesky decomposition wherein a precision background covariance is obtained in terms of Cholesky factors. In the first filter, the PEnKF-S, synthetic data are utilized in order to compute the posterior members, as done in stochastic formulations of the filter. Even more, a sequence of rank-one updates can be applied over the factors of the prior precision matrix in order to estimate those of the posterior precision. In the second filter, the PEnKF-D, synthetic data are avoided by using perturbations obtained from the physics and the dynamics of the numerical model. Finally, a MCMC-based filter is obtained in order to reduce the impact of bias when Gaussian assumptions are broken during the assimilation of observations, for instance, when the observation operator is nonlinear. Numerical experiments with the Lorenz 96 model reveal that the proposed filters are comparable to filters from the specialized literature.

Acknowledgments

This work was supported by the Applied Math and Computational Science Lab (AML-CS) at Universidad del Norte in Barranquilla, Colombia.

References

1. Bickel PJ, Levina E. Covariance regularization by thresholding. The Annals of Statistics. 2008;36(6):2577-2604. DOI: 10.1214/08-AOS600
2. Evensen G. The ensemble Kalman filter: Theoretical formulation and practical implementation. Ocean Dynamics. 2003;53(4):343-367
3. Burgers G, Jan van Leeuwen P, Evensen G. Analysis scheme in the ensemble Kalman filter. Monthly Weather Review 1998;126(6):1719-1724
4. Sakov P, Oke PR. A deterministic formulation of the ensemble Kalman filter: An alternative to ensemble square root filters. Tellus A. 2008;60(2):361-371
5. Coleman TF, Van Loan C. Handbook for matrix computations. Society for Industrial and Applied Mathematics, 1988
6. Nino-Ruiz ED, Sandu A, Deng X. A parallel implementation of the ensemble Kalman filter based on modified Cholesky decomposition. Journal of Computational Science. 2017
7. Nino-Ruiz ED, Mancilla A, Calabria JC. A posterior ensemble Kalman filter based on a modified Cholesky decomposition. Procedia Computer Science. 2017;108:2049-2058
8. Nino-Ruiz ED. A matrix-free posterior ensemble Kalman filter implementation based on a modified Cholesky decomposition. Atmosphere. 2017;8(7):125
9. Attia A, Sandu A. A hybrid Monte Carlo sampling filter for non-gaussian data assimilation. AIMS Geosciences. 2015;1:41-78
10. David H, Shane Reese C, David Moulton J, Vrugt JA, Colin F. Posterior exploration for computationally intensive. In: Steve B, Andrew G, Jones GL, Meng X-L, editors. Handbook of Markov Chain Monte Carlo. Chapman & Hall; 2011. p. 401-418
11. Cotter SL, Roberts GO, Stuart AM, White D, et al. MCMC methods for functions: Modifying old algorithms to make them faster. Statistical Science. 2013;28(3):424-446
12. Hu Z, Yao Z, Li J. On an adaptive preconditioned Crank--Nicolson MCMC algorithm for infinite dimensional Bayesian inference. Journal of Computational Physics. 2017;332:492-503
13. Fatkullin I, Vanden-Eijnden E. A computational strategy for multiscale systems with applications to Lorenz 96 model. Journal of Computational Physics. 2004;200(2):605

[1] 1. Bickel PJ, Levina E. Covariance regularization by thresholding. The Annals of Statistics. 2008;36(6):2577-2604. DOI: 10.1214/08-AOS600

[2] 2. Evensen G. The ensemble Kalman filter: Theoretical formulation and practical implementation. Ocean Dynamics. 2003;53(4):343-367

[3] 3. Burgers G, Jan van Leeuwen P, Evensen G. Analysis scheme in the ensemble Kalman filter. Monthly Weather Review 1998;126(6):1719-1724

[4] 4. Sakov P, Oke PR. A deterministic formulation of the ensemble Kalman filter: An alternative to ensemble square root filters. Tellus A. 2008;60(2):361-371

[5] 5. Coleman TF, Van Loan C. Handbook for matrix computations. Society for Industrial and Applied Mathematics, 1988

[6] 6. Nino-Ruiz ED, Sandu A, Deng X. A parallel implementation of the ensemble Kalman filter based on modified Cholesky decomposition. Journal of Computational Science. 2017

[7] 7. Nino-Ruiz ED, Mancilla A, Calabria JC. A posterior ensemble Kalman filter based on a modified Cholesky decomposition. Procedia Computer Science. 2017;108:2049-2058

[8] 8. Nino-Ruiz ED. A matrix-free posterior ensemble Kalman filter implementation based on a modified Cholesky decomposition. Atmosphere. 2017;8(7):125

[9] 9. Attia A, Sandu A. A hybrid Monte Carlo sampling filter for non-gaussian data assimilation. AIMS Geosciences. 2015;1:41-78

[10] 10. David H, Shane Reese C, David Moulton J, Vrugt JA, Colin F. Posterior exploration for computationally intensive. In: Steve B, Andrew G, Jones GL, Meng X-L, editors. Handbook of Markov Chain Monte Carlo. Chapman & Hall; 2011. p. 401-418

[11] 11. Cotter SL, Roberts GO, Stuart AM, White D, et al. MCMC methods for functions: Modifying old algorithms to make them faster. Statistical Science. 2013;28(3):424-446

[12] 12. Hu Z, Yao Z, Li J. On an adaptive preconditioned Crank--Nicolson MCMC algorithm for infinite dimensional Bayesian inference. Journal of Computational Physics. 2017;332:492-503

[13] 13. Fatkullin I, Vanden-Eijnden E. A computational strategy for multiscale systems with applications to Lorenz 96 model. Journal of Computational Physics. 2004;200(2):605

Efficient Matrix-Free Ensemble Kalman Filter Implementations: Accounting for Localization

Kalman Filters - Theory for Advanced Applications

Abstract

Keywords

Author Information

Elias David Niño Ruiz*

Rolando Beltrán Arrieta

Alfonso Manuel Mancilla Herrera

1. Introduction

2. Preliminaries

2.1. The ensemble Kalman filter

2.2. Cholesky and modified Cholesky decomposition

3. Filters based on modified Cholesky decomposition

3.1. Ensemble Kalman filter based on modified Cholesky

3.1.1. Posterior ensemble Kalman filter stochastic (PEnKF-S)

3.1.2. Posterior ensemble Kalman filter deterministic (PEnKF-D)

3.2. Markov Chain Monte Carlo-based filters

4. Experimental results

4.1. Stochastic filter PEnKF-S

Table 1.

Figure 1.

4.2. Deterministic filter PEnKF-D

Figure 3.

Figure 2.

4.3. MCMC filter based on modified Cholesky decomposition

Figure 4.

5. Conclusions

Acknowledgments

References

Kalman Filters for Reference Current Generation in Shunt Active Power Filter (APF)

Efficient Matrix-Free Ensemble Kalman Filter Implementations: Accounting for Localization

Kalman Filters - Theory for Advanced Applications

Abstract

Keywords

Author Information

Elias David Niño Ruiz*

Rolando Beltrán Arrieta

Alfonso Manuel Mancilla Herrera

1. Introduction

2. Preliminaries

2.1. The ensemble Kalman filter

2.2. Cholesky and modified Cholesky decomposition

3. Filters based on modified Cholesky decomposition

3.1. Ensemble Kalman filter based on modified Cholesky

3.1.1. Posterior ensemble Kalman filter stochastic (PEnKF-S)

3.1.2. Posterior ensemble Kalman filter deterministic (PEnKF-D)

3.2. Markov Chain Monte Carlo-based filters

4. Experimental results

4.1. Stochastic filter PEnKF-S

Table 1.

Figure 1.

4.2. Deterministic filter PEnKF-D

Figure 3.

Figure 2.

4.3. MCMC filter based on modified Cholesky decomposition

Figure 4.

5. Conclusions

Acknowledgments

References

Continue reading from the same book

Kalman Filters