You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I ran my model for 6-8 weeks on my 1080 Ti. I created just about the
largest model that I could fit in GPU memory and still run decent batch
size and sequence lengths, so it took a long time to train. Entirely
possible that you could get comparable results with a smaller model and
less training time, or maybe the same model with better hyperparameter
tuning.
How long is needed to train from scratch on fresh data? Given the ti 1080 GPU setup
The text was updated successfully, but these errors were encountered: