We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
just creating a todo. large batch sizes work now having fixed the size_t bug:
size_t
./train_gpt2cu -b 36 -v 200 -s 200 -i data/TinyStories
works, but 48 should fit but doesn't work
./train_gpt2cu -b 48 -v 200 -s 200 -i data/TinyStories
val loss is -nan and train loss stays at inf.
todo track down why and how to prevent
The text was updated successfully, but these errors were encountered:
No branches or pull requests
just creating a todo. large batch sizes work now having fixed the
size_t
bug:works, but 48 should fit but doesn't work
val loss is -nan and train loss stays at inf.
todo track down why and how to prevent
The text was updated successfully, but these errors were encountered: