Skip to content
This repository has been archived by the owner on Nov 29, 2023. It is now read-only.

Pretrain on data from Weather4cast #100

Open
jacobbieker opened this issue Oct 5, 2021 · 4 comments
Open

Pretrain on data from Weather4cast #100

jacobbieker opened this issue Oct 5, 2021 · 4 comments
Labels
enhancement New feature or request good first issue Good for newcomers

Comments

@jacobbieker
Copy link
Member

Detailed Description

https://www.iarai.ac.at/weather4cast/ (on GitHub here: https://github.com/iarai/weather4cast) has some nice weather data from around the world, including cloud masks, all at 4km resolution, so similar to what EUMETSAT gives. This could be useful for pretraining any of the models before finetuning more on our specific data.

Context

Pretraining has been proven to help quite a bit in large models, so this might help there.

Interestingly, they use MSE as their loss function for it all, even though its a video prediction task, and they want the next 8 hours of data predicted. So maybe MSE isn't the worst?

Possible Implementation

Download the data, and run some models on it.

@jacobbieker jacobbieker added enhancement New feature or request good first issue Good for newcomers labels Oct 5, 2021
@jacobbieker
Copy link
Member Author

Actually, this is EUMETSAT data! Including some of the optimum cloud masks, and I am assuming the 15 minute full disk images. So for transfer learning this would actually probably be quite useful?

@jacobbieker
Copy link
Member Author

jacobbieker commented Mar 11, 2022 via email

@codeastra2
Copy link

codeastra2 commented Mar 11, 2022

So to confirm https://github.com/iarai/weather4cast/blob/master/utils/1.%20Onboarding.ipynb these are the steps that need to be done and the h5 files need to be uploaded? If so where should they be uploaded?

@jacobbieker
Copy link
Member Author

Yeah, that would be great! Ideally, they could be uploaded to HuggingFace Datasets so they are easily widely available for anyone to use.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

2 participants