Skip to content

metno/ESGF_download

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

75 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ESGF_download

shell scripts for downloading CMIP6 climate model data from ESGF used at the Norwegian Meteorological Institute in conjunction with the KeyClim project.

Description

These tools query the ESGF API for bash scripts to download CMIP6 climate model data. This script generation is a function of the ESGF API called wget script. This query is done specific enough (one script for each experiment, variable combination) so that these scripts can be run in parallel to achieve a decent over all download speed. On an Norwegian infrastructure (NIRD) we get ~20Mb/sec per file, that does not go down on the file level if the scripts are run in parallel with 8 threads.

The output of the main script is a text file with commands to run that can be used to run e.g. on a cluster or using parallel on a normal multiprocessor machine. These commands include logging the output of the download scripts into a text file (one per download script; only stdout at this point)

At this time the scripts are rather simple, are not very user friendly and need you to think before you type. They are also rather specific to be used for the KeyClim project at the Norwegian national infrastructure called NIRD. Nevertheless the scripts should be simple enough for an experienced shell user to useful somewhere else as well.

Note that these scripts will download several TB of data per day.

Prerquests

  • Linux box
  • bash
  • wget
  • Java 9 (for handling of the certificate based downloading; see wiki for details)
  • enough space to store your data
  • an ESGF OpenID (publically available)

Limitations

As a standard the download scripts created by th ESGF API use certificates for authentication. These need to be renewed after a short periond of time (3 days?). While the download script can renew the certificate automatically, it needs at least Java 9 to do so (and the user ro privide the password of her / his open ID). Unfortunately the latest JDK available on NIRD is Java 8 which causes the certificate renewal to fail. The workaround is to run one of the download scripts at your local machine with Java 9, perform the certificate renewal and then kill the download script. Then rsync the folder ~/.esg to NIRD.

About

tools for downloading CMIP6 climate model data from ESGF

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages