-
Notifications
You must be signed in to change notification settings - Fork 60
Open
Description
There are currently a few subset's of the ERA5 dataset on cloud storage (example but none are complete or updated regularly. It wont be a trivial recipe to implement with Pangeo-Forge but it would be a good stretch goal to support such a dataset.
Source Dataset
- Link to the website / online documentation for the data: https://www.ecmwf.int/en/forecasts/datasets/reanalysis-datasets/era5, https://cds.climate.copernicus.eu/cdsapp#!/dataset/reanalysis-era5-pressure-levels?tab=overview
- The file format: NetCDF/GRIB
- How are the source files organized? One file per api request.
- How are the source files accessed: Copernicus Climate Data Store, API Request (https://cds.climate.copernicus.eu/cdsapp#!/dataset/reanalysis-era5-pressure-levels?tab=form)
- Any special steps required to access the data: access through the
cdsapiorcdstoolboxapis
Transformation / Alignment / Merging
Most likely, the best way to access and arrange the data is in 1-day chunks, concatenating along the time dimension. Given the large user pool for this dataset, I would suggest this recipe does as little data processing as possible.
Output Dataset
One (or more?) Zarr stores. Hourly data for all available variables, all pressure levels, etc.
Reactions are currently unavailable