half_float

16 bit floating-point data type for C++

Implements a HalfFloat class that implements all the common arithmetic operations for a 16 bit floating-point type (10 bits mantissa, 5 bits exponent and one sign bit) and can thus be used (almost) interchangeably with regular floats. Not all operations have efficent implementations (some just convert to float, compute the result and convert back again) - if in doubt, check out the source code.

The implementation tries to adhere to IEEE 754 in that it supports NaN and Infinity, but fails in other points:

no difference between qnan and snan
no traps
no well-defined rounding mode

We also supply a specialization for std::numeric_limits<half> that half be usable in template code dependent on type traits.

Usage

 // get some halfs (half is a typedef for HalfFloat)
 half a = 1.0f;
 half b = 0.5f;
 
 // and have some FUN
 half c = (a+b) / (a-b);
 ++c;
 
 // now that we have a result in loosy precision,
 // convert it back to double precision.
 // if anybody asks, it's for the lulz.
 double result = c;

Credits to Chris Maiwald for the conversion code to double and extensive testing.

License

3-clause BSD license: use it for anything, but give credit, don't blame us if your rocket crashes and don't advertise with it (who would).

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
HalfPrecisionFloatTest.cpp		HalfPrecisionFloatTest.cpp
Readme.md		Readme.md
stdint.h		stdint.h
umHalf.h		umHalf.h
umHalf.inl		umHalf.inl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

half_float

16 bit floating-point data type for C++

Usage

License

About

Releases

Packages

Languages

acgessler/half_float

Folders and files

Latest commit

History

Repository files navigation

half_float

16 bit floating-point data type for C++

Usage

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages