Home page  
Home   Your Room   Login   Contact   Feedback   Site Map   Search:  
Discover this product  
About Us
Overview
Getting here
Committees
Products
Forecasts
Order Data
Order Software
Services
Computing
Archive
PrepIFS
Research
Modelling
Reanalysis
Seasonal
Publications
Newsletters
Manuals
Library
News&Events
Calendar
Employment
Open Tenders
   
User Guide to ECMWF Forecast Products > 
 Use of forecast products and applications Appendix B: The mathematics of forecast errors  
   

Appendix A: 4DVAR - an elementary introduction

Browse
Preface
Presentation of ECMWF
The ECMWF global atmospheric model
The data assimilation system
Why do forecasts go wrong?
The Ensemble Prediction System (EPS)
The forecast products
The verification of ECMWF forecasts
The deterministic use of ECMWF forecasts
The use of the Ensemble Prediction forecasts
Comments on the use of statistical interpretation
Forecasts beyond ten days
Epilogue
References and further literature
Appendix A: 4DVAR - an elementary introduction
Appendix B: The mathematics of forecast errors
Appendix C: The Brier Score
 
 

Assume that we have two reports, +10o and +15o C, of the temperature for a certain location. The first has an assumed error of 2 K, the second 3 K. The true value is therefore probably closer to +10o than to +15o. A traditional statistical least-square technique weights together the two observations, with weights proportional to the "precision" or accuracy of the measurements defined as the inverse of the variances of the assumed errors. In our case these variances are 4 and 9, the weights become 9/(4+9) 0.7 and 4/(4+9) 0.3,which yields an "analysed" value of

10 * 0.7 + 15 * 0.3 = 11.5 C.

Let us now assume that of the two values one is an observation (O), the other a background field value (F) with accuracies σ O and σ B respectively. In accordance with what stated above the weighting formula may then be written

guide2007-77.gif  

 

Most meteorological objective analysis techniques are further developments of this simple least-square approach. They have, however, certain weaknesses. So for example, the method is local: only observations within a limited area could be considered at each time to influence a given grid point. They could not cope well with non-conventional data. That is why a variational technique was introduced.

This technique starts by introducing a so called "cost function" J(S) which measures the sum of the squares of the distances (or misfit) of different atmospheric states S to the observation O and the background F.

guide2007-78.gif  

 

The observation O is known, and we are looking for a value of A that will make the cost function J(S) as small as possible. We do this by differentiating J with respect to S. This will result in the same value of A as for the least square approach above, which shows the internal consistency of the two approaches.

One may therefore ask the motivation for defining this cost function? The reason is that it opens up a door to a mathematical formalism which will provide a powerful tool, in particular when the scalars in the right hand equation are replaced by vector and matrices.

Let us start by trivially re-arranging the equation above

guide2007-79.gif

We now introduce an operator H which can be anything from a simple interpolation of the analysis to the exact position of an observation, to a complicated equation that converts temperature and moisture information into radiance values. Instead of using the difference between analysis and observation, we use the difference between the converted value H(S) and the observation Y:

guide2007-80.gif

The next step is to consider not one, not many, but all observations made simultaneously over the globe. The scalars A, F and Y now become vectors of enormous dimensions since all the global observations enter as elements. So for example when there is "only" one million observations globally available at a certain time interval the vector Y=(obs1, obs2, obs3,................obs999999, obs1000000).

The error σ O2 and σ B 2 variances turn from scalars into the error covariance matrices R for the observations. To match the vectors they are formally of the size 1000000 x 1000000. The matrice.B for the background or first guess is approximately of the same dimension.

guide2007-81.gif

This is the basic equation for the 3DVAR assimilation system. In spite of the enormity of the numerical calculations, it still preserves the structure of our first formula above.

What characterizes 4DVAR is that time enters as an additional element. The aim is to find the A0 in the beginning (t=0) of a 12-hour time interval, which minimizes J. The minimization will not only be dependent on the conditions at t=0, but to the whole dynamic evolution during the 12-hour interval.

So while the first term on the right hand side is kept alone, the second term will be evaluated for every single time step (t=1 -> N). The operator H is as before the operator which converts model states into observation states. But we now also introduce operator Mn, which is the forecast model at time step n.

By integrating the atmosphere forward by Mn and "backward" by the so called adjoint in the 12-hour window, 4DVAR process ensures that the initial conditions at 09 and 21 UTC are worked out in a way that provides optimum fit to the observations throughout the 12 hour window.

So mathematically we are looking for an atmospheric state S0=A at time n=0 which minimizes the cost function J(S).

 

guide2007-82.gif

The first term: the difference between the first guess and the initial state determines only partly the size of J.

The second term: the Σ− term, sums up all the differences between the evolving forecast and the n number of observations of varying kind.




Top of page 31-03-2007
 
   Compare Pages Page Details         © ECMWF
 Use of forecast products and applications Appendix B: The mathematics of forecast errors  
shim shim shim