A library that encodes 3 to 16 bits wide floating-point numbers.
-
Updated
Feb 14, 2022 - Go
A library that encodes 3 to 16 bits wide floating-point numbers.
Half-precision 16-bit floating point numbers
An implementation of the Subleq OISC using only linear operations on half-precision (16 bit) IEEE-754 floats (and a loop).
Fast Half precision Floating point operations for C++
Implement arithmetic operations to handle half-precision numbers in MIPS instructions.
Half-precision assembly interface for C
Emulating binary, half-precision IEEE-754 (2008) floats
FP16 pseudo random number generator on GPU
Cube root of half-precision floating-point epsilon.
The DYM Math Library for Graphics and Game Programming
Square root of half-precision floating-point epsilon.
Size (in bytes) of a half-precision floating-point number.
Half-precision floating-point mathematical constants.
Basic linear algebra routines implemented using the chop rounding function
Fast SGEMM emulation on Tensor Cores
Half-Precision Floating-Point for Delphi
Swift Half-Precision Floating Point
C++ template library for floating point operations
Optimised Caffe with OpenCL supporting for less powerful devices such as mobile phones
Add a description, image, and links to the half-precision topic page so that developers can more easily learn about it.
To associate your repository with the half-precision topic, visit your repo's landing page and select "manage topics."