In the present repository, you can find the materials for the paper
Ayzel G., Heistermann M. The effect of calibration data length on the performance of conceptual versus data-driven hydrological models.
which was submitted to Computers & Geosciences.
In this study, we want to investigate the effect of calibration data length on the validation performance of different rainfall-runoff models. To this aim, we consequently increase the calibration data length from one to twenty calendar years and investigate how that affects the model skill on a hold-out (validation) period.
We use five models for runoff prediction at hourly temporal resolution:
one conceptual
- GR4H -- a conceptual hydrological model. It is a derivative from the GR4J model -- the version for runoff prediction at daily temporal resolution.
and four data-driven models which differ by the type of the computational layer used
- Multi-Layer Perceptron (MLP)
- Recurrent Neural Network (RNN)
- Long Short-Term Memory Network (LSTM)
- Gated Recurrent Units Network (GRU)
The code is written in Python programming language (v3.6) using open-source software libraries, such as numpy, pandas, scipy, numba, tensorflow, and keras. The analysis of obtained results was done also using jupyter notebooks and matplotlib plotting library.
You can install all the required dependencies using conda -- an open-source package management system. First, install conda itself, then use the provided environment.yml
file to create the isolated environment:
conda env create -f environment.yml
There are three files in the code
directory:
- describes the workflow for the main calibration/validation experiment.
- calculates and aggregates the evaluation metrics based on obtained results of streamflow simulation .
- represents the analysis of the effect of calibration data length on the performance of hydrological models.
There are two files in the models
directory:
gr4h.py
- holds the code for the GR4H hydrological model.
anns.py
- holds the code for generating data-driven hydrological models based on different architectures of artificial neural networks: MLP, RNN, LSTM, and GRU.
These files are used as modules in experiment.py
.
Two files aggregate evaluation metrics for the calibration and validation periods: summary_calibration.npy
and summary_validation.npy
, respectively. The figures
subfolder consists of figures that were generated using the analysis.ipynb
jupyter notebook.
Unfortunately, we cannot directly provide the compiled dataset we use for the presented study due to license restrictions from data suppliers. Thus, the discharge data for Rimbaud River as well as rain gauge records for the area Real Collobrier should request it from the RECOVER research laboratory at INRAE (https://www6.paca.inrae.fr/recover). The data for the SAFRAN reanalysis is also available upon request from Meteo France.
Please, contact us in case you need additional details or support for retrieving the data.