BFLOAT16 support

BFLOAT16 is a new floating-point format. It's a 16-bit floating point format with an 8 bit exponent and 7 bit mantissa (vs 5 bit exponent, 11 bit mantissa of a half-precision float which is currently `f16`) designed for deep learning.

> The bfloat16 format is utilized in upcoming Intel AI processors, such as Nervana NNP-L1000, Xeon processors, and Intel FPGAs, Google Cloud TPUs, and TensorFlow. Arm Neon and SVE also supports bfloat16 format.

Selected excerpts:
  - Rust proposal is to call the type `f16b`.
  - > should always have size 2 and alignment 2 on all platforms

References:
  - [Wikipedia Article](https://en.wikipedia.org/wiki/Bfloat16_floating-point_format)
  - [*A Transprecision Floating-Point Platform for Ultra-Low Power Computing*](https://arxiv.org/abs/1711.10374)
  - [Rust PR](https://github.com/rust-lang/rfcs/pull/2690)
  - [LLVM MR for some x86 intrinsics](https://reviews.llvm.org/D60552)
  - [GCC 10 Adds ARMv8.6-A Targeting, BFloat16 + i8MM Options](https://www.phoronix.com/scan.php?page=news_item&px=GCC-10-ARMv8.6-A-CLI-Switches)

----------------------------

As a more general issue: how should we add new numeric types going forward? e.g. [Unum](https://en.wikipedia.org/wiki/Unum_(number_format)). With zig not supporting operator overloading, such types would have to be provided by the core for ergonomic use.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BFLOAT16 support #3148

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

BFLOAT16 support #3148

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions