Skip to content
This repository has been archived by the owner on Apr 2, 2021. It is now read-only.

In depth SIMD level

Jordan Peck edited this page Mar 16, 2017 · 11 revisions

CPU instruction support

SSE2

  • Intel Pentium 4 - 2001
  • AMD Opteron/Athlon - 2003

SEE4.1

  • Intel Penryn - 2007
  • AMD Bulldozer - Q4 2011

AVX

  • Intel Sandy Bridge - Q1 2011
  • AMD Bulldozer - Q4 2011

AVX2

  • Intel Haswell - Q2 2013
  • AMD Carrizo - Q2 2015

FMA3

  • Intel Haswell - Q2 2013
  • AMD Piledriver - 2012

AVX512

  • Not available to consumers yet

Performance Comparisons

Timings below are x1000 ns to generate 32x32x32 points of noise on a single thread.

  • CPU: Intel Xeon Skylake @ 2.0Ghz
  • Compiler: Intel 17.0 x64
Noise Type AVX512 AVX2 SSE4.1 SSE2 FastNoise LibNoise
White Noise 9 13 22 69 142
Value 102 159 338 799 614
Value Fractal 325 511 1077 2456 1961
Perlin 153 354 639 1037 964 1406
Perlin Fractal 470 1060 1969 3153 3125 3832
Simplex 158 350 601 729 1189
Simplex Fractal 482 1042 1835 2241 2774
Cellular 991 1475 3206 4113 2949 57813
Cubic 908 1378 2663 5957 2949
Cubic Fractal 2799 4188 8042 17950 9271
CF Sampled 1 492 720 1323 2805
CF Sampled 2 95 174 282 505
CF Sampled 3 40 94 132 172

CF (Cubic Fractal) Sampled {sampling level}: Comparison of performance increases with different levels of sampling

Clone this wiki locally