Empowering weather and climate forecast

Weather & climate datasets and tools

Datasets and weather & climate machine learning applications will be made accessible via Git; deliverables can be downloaded and papers are linked from this website. Results will be the topic of talks and workshops.

Dataset for energy production forecast

Due 2021-08-31:Delivered

MAELSTROM presents a dataset to forecast the energy production of the near and mid-term future using machine learning. Weather forecast data of the past is used in conjunction with local production of energy to train a tool that can predict power production based on weather forecasts.

Get data set

Dataset for 2m temperature downscaling

Due 2021-08-31:Delivered

MAELSTROM presents a novel dataset to enable the users to explore deep learning methods for 2m temperature downscaling. This dataset includes 2m temperature and surface elevation.

Get data set

Dataset for ensemble predictions

Due 2021-08-31:Delivered

MAELSTROM offers a benchmark machine learning dataset for temperature at 850 hPa and geopotential at 500hPa ensemble forecasts. The dataset consists of T (Temperature), Z(Geopotential), U (U component of wind), V (V component of wind), D (Divergence), W (Vertical velocity) and Q (Specific humidity) input variables with 11 ensemble members at 11 pressure levels and are based hindcast simulations of the European Centre for Medium Range Weather Forecasts. This dataset enables users to learn how to use deep learning for post-processing of ensemble weather forecasts.

Get data set

Datasets for 2m temp. and precipitation short-range forecasts

Due 2021-08-31:Delivered

MAELSTROM offers new datasets for 2m temperature and hourly precipitation short-range forecasts over Nordics/Northern Europe. The dataset consists of several terabytes of real-time observations and forecast outputs, which is provided on a 1796x2321 grid 47 input variables and 60 forecast lead times. This dataset allows the users to explore the use of deep learning for 2m temperature and precipitation predictions.

Get data set

Dataset to emulate radiation

Due 2021-08-31:Delivered

Now available for public use: This dataset enables the use of machine learning to learn the process of radiative heating -- one of the key processes in weather and climate models. The dataset will be available in several tiers with different sizes up to several terabytes. It enables users to accelerate the representation of interactions between the radiation from the sun and the Earth, and the vertical structure of the atmosphere, including clouds. The dataset has a very high resolution of 137 levels in the vertical direction.

Get data set

Report on machine learning solutions and tools

Due 2021-09-30:Delivered

If you want to learn more about the first versions of machine learning tools and solutions, including architectures and loss functions, that will be studied for the six machine learning benchmark datasets, you can check this report. We present the results of a survey of customized machine learning solutions and tools that MAELSTROM applications will adopt.

Download report

v1.0 of custom ML solutions

Due 2022-09-30: 252 days leftThis report will present the first set of customized machine learning solutions that have been developed for the six MAELSTROM applications. Please have a look at the document, download the data, train your own machine learning solution that is beating our benchmark, and let us know!

Version 2.0 of custom ML solutions

Due 2023-09-30: 617 days leftHow much better did we get and how efficient are our solutions on high performance computing architectures? If you are interested, please have a look at this report.

Report on tests with a tangent linear and adjoint version of ML emulators with 4DVar

Due 2024-03-31: 800 days leftA high-risk high-gain task! This task will develop tangent linear and adjoint versions of the ML solutions that were developed to emulate the radiation and cloud parameterization schemes of the European Centre for Medium-Range Weather Forecasts. The tangent linear and adjoint versions of the neural network emulators will be tested within the 4DVar data assimilation framework within analysis experiments and the results will be reported here.

Report on the application of ML solutions within the W&C workflow

Due 2024-03-31: 800 days leftAre we ready to use machine learning solutions within operational weather and climate predictions? This report summarizes tests that run the six MAELSTROM machine learning applications as if they were used for operational predictions. The task will use the latest machine learning software framework and aims to develop production-ready products for weather and climate predictions.