Skip to content
This repository has been archived by the owner on Oct 31, 2023. It is now read-only.

bfloat16 grads are not supported #25

Open
kurumuz opened this issue May 13, 2022 · 0 comments
Open

bfloat16 grads are not supported #25

kurumuz opened this issue May 13, 2022 · 0 comments

Comments

@kurumuz
Copy link

kurumuz commented May 13, 2022

Is there any plans to support models/grads with bfloat16 type? Bfloat gained quite the popularity lately as every ampere GPU supports the type, and eliminates the need for loss scaling compared to float16.
This is what I get when I try to initialize bnb.AdamW with a bfloat16 casted model:
ValueError: Gradient+optimizer bit data type combination not supported: grad torch.bfloat16, optimizer torch.uint8

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant