Home page  
Home   Your Room   Login   Contact   Feedback   Site Map   Search:  
Discover this product  
About Us
Overview
Getting here
Committees
Products
Forecasts
Order Data
Order Software
Services
Computing
Archive
PrepIFS
Research
Modelling
Reanalysis
Seasonal
Publications
Newsletters
Manuals
Library
News&Events
Calendar
Employment
Open Tenders
   
Home > Newsevents > Training > Rcourse_notes > DATA_ASSIMILATION > ASSIM_CONCEPTS >  
   

Data assimilation concepts and methods
March 1999

By F. Bouttier and P. Courtier


1. Basic concepts in data assimilation
2. The state vector, control space and observations
3. The modelling of errors
4. Statistical interpolation with least-squares estimation
5. A simple scalar illustration of least-squares estimation
6. Models of error covariance
7. Optimal interpolation (OI) analysis
8. Three-dimensional variational analysis (3D-Var)
9. 1D-Var and other variational analysis systems
10. Four-dimensional variational assimilation (4D-Var)
11. Estimating the quality of the analysis
12. Implementation techniques
13. Dual formulation of 3D/4D-Var (PSAS)
14. The extended Kalman filter (EKF)
15. Conclusion
Appendix A. A primer on linear matrix algebra
Appendix B. Practical adjoint coding
Appendix C. Exercises
Appendix D. Main symbols
References
 
  Training Course Notes Front Page >>
Table of contents >>
Next Section >>
Previous Section >>





APPENDIX C. Exercises

The number of stars indicate roughly the degree of difficulty.
(i)   Prove equation (A4) giving if is optimal.
(ii)   Prove directly the equations given in the section on the scalar case.
(iii)   Prove the theorem on preconditioning, including the case where the square root of is used. Does the condition number depend on the choice of square root matrix?
(iv)   Compare the BLUE equations with the linear regression equations between the model and observation values.
(v)   Write and comment on the BLUE analysis in a one-dimensional model, with one and then with 2 observations.
(vi)   rewrite the KF equations in the scalar case and examine its convergence in time if the model is the identity and if and are constant.
(vii)   Calculate the product of a vector with the Hessian using the simulator operator only.
(viii)   The SWM (Sherley-Woodbury-Morrisson) approximation of a positive definite matrix is ( are positive scalars, are vectors). Prove that it is positive definite, and derive its inverse and a symmetric square root.
(ix)   * Calculate the normalization factors to define properly the Gaussian pdfs for the background and the analysis states.
(x)   Write the algorithm to implement a Cressman analysis. What happens if the observing network is very dense?
(xi)   a primitive analysis technique is to fit a set of polynomials to the observations. Derive the algorithm in a one-dimensional framework.
(xii)   * Generalize the polynomial fit technique to give different weights to different observations.
(xiii)   Prove that . Is it a sufficient condition for the covariance matrix to be positive definite?
(xiv)   Prove that a covariance matrix can be factorized in the form and describe some numerical methods to do it.
(xv)   * Give examples in which the adjoint is not the inverse, and examples in which it is.
(xvi)   * Derive in the scalar case what is the analysis error if the weight is calculated using an assumed that is not the genuine background standard error.
(xvii)   * Prove that the background error covariance matrix can be factorized as where is a diagonal matrix and is the correlation matrix. What is the physical meaning of ?
(xviii)   * Rewrite the 4D-Var algorithm using the inverse of the model (assuming it exists), putting the analysis time at the end of the time interval.
(xix)   * (physics regularization) In the scalar case, considering the observation operator , design a continuously differentiable observation operator with a tunable "regularization" parameter so that can be as small as required and outside a small interval around zero.
(xx)   ** Design a scalar example using the previous observation operator, in which the cost-function has one or two minima, depending on the value of the regularization parameter.
(xxi)   * Prove that the scalar KF, with the model equal to the identity and constant error statistics, is equivalent to a running average that is defined, in the limit of a continuous time variable, by an exponential weighting function. How does the e-folding time depend on the error statistics?
(xxii)   * (adaptive filter) Rewrite the KF equation as an adaptive statistical adaptation scheme: , where the model state is the two scalars and is the scalar observation, is an externally defined function of time. The forecast model is assumed to be the identity.
(xxiii)   ** Generalize the Cressman algorithm in order to retain some background information at the analysis points, as in the least-squares analysis.
(xxiv)   ** (retrieval and super-obing) Modify the BLUE equation for when the observations are replaced by a linear combination of them through a retrieval algorithm , i.e. .
(xxv)   ** Precondition the PSAS cost function with the symmetric square root of and prove that the condition number is then the same as 3D-Var preconditioned by the symmetric square root of .
(xxvi)   *** (control variable remapping) In a continuous one-dimensional model, derive the adjoint of the "remapping" operator where is the space coordinate and is an invertible, continuously differentiable function. Does this make sense in a discrete model?
(xxvii)   *** Derive the 4D-Var equations by expressing the minimization problem constrained by the model equations with its Lagrangian, and comment on the physical meaning of the Lagrange multiplier at the analysis point.
(xxviii)   ** (flow-dependency in 4D-Var) Derive the Hessian of a 4D-Var in which there is one single observation at the end of the analysis interval. How does the analysis increment compare with the singular vectors of the model? (see the training course on predictability)
(xxix)   *** The NMC method assumes that the covariances of forecast error differences (differences between two forecasts starting from 2 consecutive analyses and valid at the same time) are similar to the forecast error covariances. Formulate this using the KF notation and discuss the validity of the assumption.
(xxx)   *** (lagged innovation covariances) Assuming that the observing network is always the same in the KF, prove that if the analysis weight is optimal, then the innovation departures are not correlated in time.
(xxxi)   *** (fixed-lag Kalman smoother) Derive the equations for the 1-lag Kalman smoother, i.e. a generalization of the KF equations in which the observations at both times of the current analysis and of the next one are used at each analysis step. Tip: extend the KF control variable to include the model state at both analysis times.


Training Course Notes Front Page >>
Table of contents >>
Next Section >>
Previous Section >>







 

Top of page 03.12.2001
 
   Page Details         © ECMWF
shim shim shim