Support for bfloat16 #7527

philippwitte · 2023-04-21T02:34:54Z

Description

Are there plans to support the bfloat16 data type in the near future? This data type is becoming increasingly popular in LLM training. It looks like currently it's not supported. I.e., calling y = cp.asarray(x), where x is a torch tensor of type torch.bfloat16, returns "TypeError: Got unsupported ScalarType BFloat16". Are there any recommended workarounds in the meantime?

Additional Information

No response

The text was updated successfully, but these errors were encountered:

leofang · 2023-04-21T02:39:34Z

Curious where/how would you use bf16 if CuPy were to support it? Any pointer or reference? Thanks! 🙂

jglaser · 2023-09-12T23:26:38Z

It would be good if numpy data type extensions à la https://github.com/jax-ml/ml_dtypes/tree/main were supported, which includes bfloat16, fp8 etc.

guberti · 2023-09-18T00:03:55Z

Seconding this! bfloat16 and fp8 support are important for my use case. I'd love to see these.

wuxibin89 · 2023-09-25T06:30:06Z

Any progress on this? We really need it for LLM training and inference.

borisfom · 2024-03-28T07:25:03Z

bfloat16 support is sorely missed in cupy. Would really appreciate it getting fixed!
We are currently forced to work around it like this (thankfully we have torch.view):

x = torch.arange(10, dtype=torch.bfloat16, device="cuda")
print(x)
# tensor([0., 1., 2., 3., 4., 5., 6., 7., 8., 9.], device='cuda:0',
#        dtype=torch.bfloat16)

# view as uint8
y = x.view(dtype=torch.uint8)

array_size_in_bytes = y.nelement()*y.element_size()
mem = cupy.cuda.UnownedMemory(y.data_ptr(), array_size_in_bytes, owner=None)
memptr = cupy.cuda.MemoryPointer(mem, offset=0)
arr = cupy.ndarray(y.size(), dtype=cupy.uint8, memptr=memptr)
out = torch.as_tensor(arr, device=x.device, dtype=torch.uint8)
print(out)
# tensor([  0,   0, 128,  63,   0,  64,  64,  64, 128,  64, 160,  64, 192,  64,
#         224,  64,   0,  65,  16,  65], device='cuda:0', dtype=torch.uint8)

# view as bfloat16 again
out = out.view(x.dtype)
print(out)
# tensor([0., 1., 2., 3., 4., 5., 6., 7., 8., 9.], device='cuda:0',
#        dtype=torch.bfloat16)

yuanlin2004 · 2024-04-29T20:08:26Z

I see (in #8269) that the bfloat16 feature is planned for v14 release. @asi1024 , is there a WIP branch that others can play with or help if needed?

[bfloat16](https://en.wikipedia.org/wiki/Bfloat16_floating-point_format) is widely used in LLM training and inference since it can achieve higher throughput and is less prone to weight growth. ray.util.collective use cupy.cuda.nccl for GPU communication, while cupy doesn't support bfloat16 for now (cupy/cupy#7527). So for allgather/reducescater operation, we should bypass cupy.array and use torch.tensor directly. Signed-off-by: wuxibin <wuxibin89@163.com> Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>

dakofler · 2024-08-06T18:43:52Z

Would also love this!

CloudyDory · 2024-10-04T03:50:35Z

We are using a spiking neural network training library that actually implements custom CuPy functions for forward and backward propagation. The fact that CuPy lacks bfloat16 support is real pain for us. I would highly appreciate any progress on this issue.

philippwitte added the cat:feature New features/APIs label Apr 21, 2023

takagi added the prio:medium label Apr 24, 2023

wuxibin89 mentioned this issue Sep 26, 2023

ray.util.collective support torch.bfloat16 ray-project/ray#39845

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for bfloat16 #7527

Support for bfloat16 #7527

philippwitte commented Apr 21, 2023

leofang commented Apr 21, 2023

jglaser commented Sep 12, 2023 •

edited

Loading

guberti commented Sep 18, 2023

wuxibin89 commented Sep 25, 2023

borisfom commented Mar 28, 2024

yuanlin2004 commented Apr 29, 2024

dakofler commented Aug 6, 2024

CloudyDory commented Oct 4, 2024

Support for bfloat16 #7527

Support for bfloat16 #7527

Comments

philippwitte commented Apr 21, 2023

Description

Additional Information

leofang commented Apr 21, 2023

jglaser commented Sep 12, 2023 • edited Loading

guberti commented Sep 18, 2023

wuxibin89 commented Sep 25, 2023

borisfom commented Mar 28, 2024

yuanlin2004 commented Apr 29, 2024

dakofler commented Aug 6, 2024

CloudyDory commented Oct 4, 2024

jglaser commented Sep 12, 2023 •

edited

Loading