Newsletter-banner-No-153

New point-rainfall forecasts for flash flood prediction

Fatima Pillosu (ECMWF and University of Reading), Tim Hewson

 

ECMWF has developed a probabilistic point-rainfall product which could support the prediction of flash floods across the globe. The product, which is based on an innovative post-processing method, aims to bridge the gap between the relatively coarse resolution of today’s global forecasting models and the higher-resolution limited-area models needed to describe localised heavy rainfall. The methodology is based on physically relevant statistical relationships between the larger-scale weather features well represented by ECMWF forecasts and local realisations represented by point observations. These relationships make it possible to compute statistically based (rather than raw-ensemble-based) probabilities for point rainfall. This includes extremes, which can be used to infer the likelihood of flash floods for use on platforms such as the European and Global Flood Awareness Systems.

The post-processing method blends together information from different locations whenever they experience similar rainfall generation mechanisms, assuming that these physical mechanisms are universal and dependent on key atmospheric and geographic properties. This means that:

  • one year of global rainfall observations is adequate because it can equate to hundreds of years used in locally-calibrated techniques, and
  • extremes can be successfully predicted even when they do not exist in a local record.

Moreover, the reliance on physics means that forecasts can be confidently produced for anywhere in the world, even places without observations. The post-processing system has been fully automated and requires minimal computing resources to run compared to high-resolution numerical models.

Sub-grid variability
%3Cstrong%3EExample%20of%20large%20sub-grid%20variability%20due%20to%20slow-moving%20convection.%3C/strong%3E%20The%20chart%20shows%20radar-derived%20totals%20(%3Cem%3Esource:%20netweather.tv%3C/em%3E).%20The%20black%20square%20is%20an%20ENS%20gridbox,%20colours%20denote%200%E2%80%9380%20mm.
Example of large sub-grid variability due to slow-moving convection. The chart shows radar-derived totals (source: netweather.tv). The black square is an ENS gridbox, colours denote 0–80 mm.

Methodology

Radar-derived totals show that very different geometries of rainfall total variability can arise within regions of a global model grid box of typical size. Global models include standard output parameters such as convective precipitation fraction, mid-tropospheric wind speed, convective available potential energy (CAPE), total daily clear-sky solar radiation and total precipitation that, on the basis of physical reasoning, make it possible to anticipate sub-grid variability and account for biases in grid-scale rainfall. Those parameters are used to define subsets, hereafter called ‘weather types’, which share the same meteorological/geographical conditions.

Forecast Error Ratio (FER) frequency distributions
%3Cstrong%3EForecast%20Error%20Ratio%20(FER)%20frequency%20distributions.%3C/strong%3E%20The%20distributions%20are%20for%2012-hourly%20post-processed%20short-range%20rainfall%20forecasts%20(FC)%20based%20on%20a%20nine-month%20training%20period,%20representing%201.6%20million%20global%20rainfall%20SYNOP%20observations%20(OBS).%20The%20panels%20on%20the%20right%20show%20how%20frequency%20distributions%20change%20for%20two%20weather%20type%20examples:%20winter%20insolation%20(top)%20and%20summer%20insolation%20(bottom),%20both%20with%20mainly%20convective%20rainfall,%20moderate%20steering%20winds%20and%20medium%20CAPE.
Forecast Error Ratio (FER) frequency distributions. The distributions are for 12-hourly post-processed short-range rainfall forecasts (FC) based on a nine-month training period, representing 1.6 million global rainfall SYNOP observations (OBS). The panels on the right show how frequency distributions change for two weather type examples: winter insolation (top) and summer insolation (bottom), both with mainly convective rainfall, moderate steering winds and medium CAPE.

The amount by which observed rainfall totals at a point differ from short-range forecasts for the equivalent model gridbox can be expressed as a ratio, hereafter called Forecast Error Ratio (FER). By compositing many cases a general frequency distribution can be derived, which can be used to transform ensemble gridbox rainfall forecasts into probabilistic point-rainfall forecasts. The true utility of the new approach, however, lies in creating and using separate frequency distributions for weather types that differ from one another in significant and physically realistic ways, as illustrated by the small panels in the FER frequency distribution figure.

Outputs and example forecasts

Point-rainfall forecast
%3Cstrong%3EPoint-rainfall%20forecast.%3C/strong%3E%2098th%20percentile%20in%20mm/12h%20from%2000%20UTC%20on%208%20September%202017%20for%20the%20period%20t+42%20to%20t+54.%20The%20top%20five%20substantive%20maxima%20are%20labelled.%20The%20Florida/Atlantic%20peaks%20relate%20to%20hurricanes%20Irma%20and%20Jose.%20L%20points%20to%20Livorno.
Point-rainfall forecast. 98th percentile in mm/12h from 00 UTC on 8 September 2017 for the period t+42 to t+54. The top five substantive maxima are labelled. The Florida/Atlantic peaks relate to hurricanes Irma and Jose. L points to Livorno.

The current pre-operational version produces probabilistic rainfall forecasts for points, in the form of percentiles from 1 to 99, once a day (00 UTC run), up to day 5, for 06–18 and 18–06 UTC validity times. The operational version (available soon via ecCharts) will produce forecasts twice a day (00 and 12 UTC), up to day 10, for 6-, 12- and 24-hour accumulations.

The output can be used by forecasters in different ways. For example, a user may be interested in identifying the wettest places in the world in two days’ time in order to anticipate which areas are vulnerable to flash floods. They could plot point rainfall (e.g. in mm/12h) for high percentiles, e.g. the 95th or 98th percentile, and could then say that there is 1/20 or 1/50 chance for the amount displayed to occur at a given location in a given grid box, albeit without being able to pinpoint exact locations within a grid box where extremes are most likely.

For instance, between 2 and 4 a.m. on 10 September 2017 more than 200 mm of rain fell in the vicinity of Livorno in Italy, causing widespread flooding and six fatalities. On the 98th percentile point-rainfall map for that night from two days earlier, parts of northern Italy, including Livorno, stand out as being especially vulnerable to very high totals. On the equivalent raw ensemble plot (not shown) this region does not stand out. On the basis of the point-rainfall forecast, one could have put parts of northern Italy on alert, albeit at a low risk level. Other considerations related to vulnerability (e.g. population density, topographic slopes, land use, soil moisture, flood defences) could have facilitated refinement of warning areas.

Where there is local knowledge regarding flash flood trigger levels, users may alternatively be interested in the probability of exceeding a certain threshold, as shown in the UK case study charts. This example also shows how the post-processing does not simply amount to amplifying signals of more extreme events.

UK point-rainfall case study
%3Cstrong%3EUK%20point-rainfall%20case%20study%3C/strong%3E.%20The%20charts%20show%20the%20raw%20ENS%20probabilities%20(left)%20and%20point-rainfall%20probabilities%20(middle)%20of%20exceeding%2020%20mm/12h%20(30%20to%2042-hour%20forecasts%20valid%2006%E2%80%9318%20UTC%206%20July%202017).%20The%20right-hand%20chart%20shows%20UK%20radar-derived%20totals%20(%3Cem%3Esource:%20netweather.tv%3C/em%3E).
UK point-rainfall case study. The charts show the raw ENS probabilities (left) and point-rainfall probabilities (middle) of exceeding 20 mm/12h (30 to 42-hour forecasts valid 06–18 UTC 6 July 2017). The right-hand chart shows UK radar-derived totals (source: netweather.tv).

Verification

A retrospective global verification for the current pre-operational system covers the period April 2016 to March 2017, for several sets of lead times (days 1, 3, 5), focusing on the reliability and resolution of the forecast (‘resolution’ here refers to how well the system is able to distinguish between occasions when events are more/less likely to occur).

Evaluation of the raw ensemble (ENS) shows an under-representation of small and large values. A much flatter rank histogram for the point-rainfall product is indicative of a much more reliable forecast. Finally, using the ROC area metric, the point-rainfall product exhibits greatly improved resolution. This is notably the case for higher thresholds, where the point-rainfall product at day 5 is about as good as the raw ensemble at day 1, illustrating clear added value for users.

One year of point-rainfall forecast verification (global)
%3Cstrong%3EOne%20year%20of%20point-rainfall%20forecast%20verification%20(global).%3C/strong%3E%20Rank%20histograms%20(t+102%20to%20t+114%20and%20t+114%20to%20t+126)%20for%20the%20raw%20ensemble%20where%20each%20rank%20is%20an%20ENS%20member%20(1%20to%2051%20+%20outliers)%20(top%20left)%20and%20for%20point%20rainfall%20where%20each%20rank%20is%20a%20percentile%20(1%20to%2099%20+%20outliers)%20(bottom%20left).%20A%20flatter%20histogram%20indicates%20better%20reliability.%20The%20other%20charts%20show%20the%20area%20under%20the%20ROC%20curve%20for%20exceeding%20different%20thresholds.%20Higher%20values%20indicate%20better%20resolution.%20The%20verification%20period%20is%20April%202016%20to%20March%202017.
One year of point-rainfall forecast verification (global). Rank histograms (t+102 to t+114 and t+114 to t+126) for the raw ensemble where each rank is an ENS member (1 to 51 + outliers) (top left) and for point rainfall where each rank is a percentile (1 to 99 + outliers) (bottom left). A flatter histogram indicates better reliability. The other charts show the area under the ROC curve for exceeding different thresholds. Higher values indicate better resolution. The verification period is April 2016 to March 2017.