retrieve databundle depends on build cutout settings #853

martacki · 2023-09-05T15:31:31Z

Checklist

I am using the current main branch or the latest release. Please indicate.
I am running on an up-to-date pypsa-earth environment. Update via conda env update -f envs/environment.yaml.

Describe the Bug

When rule retrieve_databundle_light is executed, while build_cutout is set to False, it tries to download the file cutouts/cutout-2013-era5 which eventually fails.
I'm not sure if this is intentional, but it is very annoying and hard to spot. Build_cutout at this stage is not even executed, and the cutout is not needed.

Maybe I'm misinterpreting some intentional behavior here, but I'm sure there is a bug somewhere because retrieve_databundle_light should execute regardless of the build_cutout settings, in my opinion.

Error Message

MissingOutputException in rule retrieve_databundle_light in file */pypsa-earth/Snakefile, line 147:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
cutouts/cutout-2013-era5.nc

The text was updated successfully, but these errors were encountered:

martacki · 2023-09-05T15:37:59Z

Not sure if this links to #812

davide-f · 2023-09-05T15:45:25Z

Hello! Thanks for posting!
What country were you testing?
May you have the complete log of the log?
Sometimes for regions outside Africa, Google drive, the only source of those files, limits the number of downloads and may cause that issue

Emre-Yorat89 · 2023-09-11T14:34:38Z

I suppose I have the same issue for Türkiye. The retrieve databundle fetches sandbox links. However, it does not even download cutout bundles which has google drive links. It directly gives below error. I can download cutout bundles manually therefore number of download limit should not be the reason for the error I believe. I am also not sure if it is connected to the build cutout setting in the config file.

ekatef · 2023-09-11T18:29:26Z

Thanks a lot for reporting, @martacki and @Emre-Yorat89!

I can reproduce the issue for Türkiye (@Emre-Yorat89 thank you so much for providing the detailed analysis of the issue!). The problem is in fact linked with loading from google drive and caused by the fact that gdd.download_file_from_google_drive() returns an empty zip file which leads to further troubles when trying to unzip it.

Not sure if it is connected with a daily quota, as in this case we should have 403 error, according to google documentation. Can it be probably the case that google has changed the behaviour but not updated the docs? 🤔

As for the effect of build_cutout, setting build_cutout: true by-passes loading the cutout, which is currently the only data type loaded from google drive instead of zenodo.

As a temporal fix it can be suggested to load the cutout manually using urls specified in configs/bundle_config.yaml

Emre-Yorat89 · 2023-09-11T20:54:58Z

Hello,
I have made a couple of simple experiments with the googledrivedownloader package with the below code. When I first tried it the downloaded was a corrupt zip file. After changing the sharing option from "Restricted" to "Anyone with the link" on google drive solved the issue. Hopefully this is also the case for our problem.

ekatef · 2023-09-11T21:04:58Z

Hello, I have made a couple of simple experiments with the googledrivedownloader package with the below code. When I first tried it the downloaded was a corrupt zip file. After changing the sharing option from "Restricted" to "Anyone with the link" on google drive solved the issue. Hopefully this is also the case for our problem.

Thanks for testing @Emre-Yorat89! Have checked "General access" options for bundle_cutouts_northamerica and bundle_cutouts_asia, and it looks like sharing by link is on: Anyone with link corresponds to Viewer rights. Which should also allow to download file... Although, I feel that your idea leads to a right direction.

ekatef · 2023-09-13T20:47:24Z

Update after some additional testing: the reason of the troubles seems to be in fact a number of downloads. While an initial request to gdisk returns status 200 (== everything is fine), an authorised request

https://github.com/ndrplz/google-drive-downloader/blob/be1aba9e2e43b2375475f19d8214ca50a8621bd6/google_drive_downloader/google_drive_downloader.py#L58-L61

returns 429 which means exactly too many requests.

At the time being, a quick fix is to load a cutout file manually by the links provides in /configs/bundle_config.yaml

Would be probably nice to add a check of server status response and add a meaningful warning or error.

ekatef · 2024-01-06T12:57:27Z

Hello @martacki! Thank you for reporting this issue. It has been investigated in more details by #866 and fixed by #911. So, it data retrival should work properly now. Do you have any additionally comments or can we count this issue as completed? 🙂

martacki added the bug Something isn't working label Sep 5, 2023

ekatef mentioned this issue Sep 13, 2023

Add Western asia databundle #837

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

retrieve databundle depends on build cutout settings #853

retrieve databundle depends on build cutout settings #853

martacki commented Sep 5, 2023

martacki commented Sep 5, 2023

davide-f commented Sep 5, 2023

Emre-Yorat89 commented Sep 11, 2023

ekatef commented Sep 11, 2023

Emre-Yorat89 commented Sep 11, 2023

ekatef commented Sep 11, 2023

ekatef commented Sep 13, 2023 •

edited

Loading

ekatef commented Jan 6, 2024

retrieve databundle depends on build cutout settings #853

retrieve databundle depends on build cutout settings #853

Comments

martacki commented Sep 5, 2023

Checklist

Describe the Bug

Error Message

martacki commented Sep 5, 2023

davide-f commented Sep 5, 2023

Emre-Yorat89 commented Sep 11, 2023

ekatef commented Sep 11, 2023

Emre-Yorat89 commented Sep 11, 2023

ekatef commented Sep 11, 2023

ekatef commented Sep 13, 2023 • edited Loading

ekatef commented Jan 6, 2024

ekatef commented Sep 13, 2023 •

edited

Loading