Skip to content

Fastest implementations of 32-bit and 64-bit integer square roots for x86-64

License

Notifications You must be signed in to change notification settings

komrad36/FastIntegerSqrt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Fastest implementations of 32-bit and 64-bit integer square roots for x86, and querying for perfect squares, by a significant margin.

The square roots truncate, i.e. round down (IntegerSqrt(8) -> 2).

AVX recommended. SSE3 required. haddpd can be replaced with permilpd/addsd to drop the requirement down to SSE2.

The I32/U32 versions work regardless of the FPU rounding mode. The I64/U64 versions require the FPU to be in round-to-nearest mode (which is the default).

About

Fastest implementations of 32-bit and 64-bit integer square roots for x86-64

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages