Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Required training time #41

Closed
adamjstewart opened this issue Mar 12, 2024 · 3 comments
Closed

Required training time #41

adamjstewart opened this issue Mar 12, 2024 · 3 comments

Comments

@adamjstewart
Copy link

In your paper, you mention that the model is pre-trained on 80 V100s. How long was the model trained for? I'm working on a review paper and would like an estimate of training time.

@tung-nd
Copy link
Collaborator

tung-nd commented Mar 15, 2024

Hi,

Thank you for your interest in ClimaX. I do not remember the training time exactly, but it was definitely less than 1 day (24 hours).

@adamjstewart
Copy link
Author

Thanks, I'll use 24 hrs as an estimate for now.

@rejuvyesh
Copy link
Collaborator

rejuvyesh commented Mar 15, 2024

Hi @adamjstewart! Small correction. I would put the training time to at least 3 days. The initial CMIP6 pretraining phase was about 200k steps. Then finetuning on ERA5 was done for about 100k steps which is a fair amount of compute time. There is some overhead in terms of time for computing various validation metrics at some frequency as well to make sure training is progressing as expected.

There have been quite a few performance fixes since the initial training and overall it should be much faster to train now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants