Jorge Baño-Medina

and 4 more

Neural Weather Models (NWM) are novel data-driven weather forecasting tools based on neural networks that have recently achieved comparable deterministic forecast skill to current operational approaches using significantly less real-time computational resources. The short inference times required by NWMs allow the generation of a large ensemble potentially providing benefits in quantifying the forecast uncertainty, particularly for extreme events, which is of critical importance for various socio-economic sectors. Here we propose a novel ensemble design for NWMs spanning two main sources of uncertainty: epistemic —or model uncertainty,— and aleatoric —or initial condition uncertainty. For the epistemic uncertainty, we propose an effective strategy for creating a diverse ensemble of NWMs that captures uncertainty in key model parameters. For the aleatoric, we explore the “breeding of growing modes” for the first time on NWMs, a technique traditionally used for operational numerical weather predictions as an estimate of the initial condition uncertainty. The combination of these two types of uncertainty produces an ensemble of NWM-based forecasts that is shown to improve upon benchmark probabilistic NWM and is competitive with the 51-member ensemble of the European Centre for Medium-Range Weather Forecasts based on the Integrated Forecasting System (IFS) —a gold standard in weather forecasting,— in terms of both error and calibration. In addition, we report better probabilistic skill than the IFS over land for two key variables: surface wind and air surface temperature.

Yuan Yang

and 9 more

Accurate global river discharge estimation is crucial for advancing our scientific understanding of the global water cycle and supporting various downstream applications. In recent years, data-driven machine learning models, particularly the Long Short-Term Memory (LSTM) model, have shown significant promise in estimating discharge. Despite this, the applicability of LSTM models for global river discharge estimation remains largely unexplored. In this study, we diverge from the conventional basin-lumped LSTM modeling in limited basins. For the first time, we apply an LSTM on a global 0.25° grid, coupling it with a river routing model to estimate river discharge for every river reach worldwide. We rigorously evaluate the performance over 5332 evaluation gauges globally for the period 2000-2020, separate from the training basins and period. The grid-scale LSTM model effectively captures the rainfall-runoff behavior, reproducing global river discharge with high accuracy and achieving a median Kling-Gupta Efficiency (KGE) of 0.563. It outperforms an extensively bias-corrected and calibrated benchmark simulation based on the Variable Infiltration Capacity (VIC) model, which achieved a median KGE of 0.466. Using the global grid-scale LSTM model, we develop an improved global reach-level daily discharge dataset spanning 1980 to 2020, named GRADES-hydroDL. This dataset is anticipated to be useful for a myriad of applications, including providing prior information for the Surface Water and Ocean Topography (SWOT) satellite mission. The dataset is openly available via Globus.