-
Notifications
You must be signed in to change notification settings - Fork 3
/
13th Gen Intel(R) Core(TM) i7-13700.txt
356 lines (335 loc) · 23.8 KB
/
13th Gen Intel(R) Core(TM) i7-13700.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
Date: 20240118 212150
ARCH: x64 (x86_64)
FPU : SSE SSE2 SSSE3 SSE4.1 SSE4.2 AVX AVX2 FMA3 F16C
Name: 13th Gen Intel(R) Core(TM) i7-13700
CPU Thread: 24
CPU Core : 16
CPU Group : 2
Group 0: Thread=16 Clock=5.100000 GHz (mask:ffff)
Group 1: Thread= 8 Clock=4.100000 GHz (mask:ff0000)
SSE : yes
AVX : yes
FMA : yes
F16C : yes
AVX512: no
Total:
SingleThread HP max: -
SingleThread SP max: 165.987 GFLOPS
SingleThread DP max: 82.994 GFLOPS
MultiThread HP max: -
MultiThread SP max: 1927.613 GFLOPS
MultiThread DP max: 881.662 GFLOPS
Group 0: Thread=16 Clock=5.100000 GHz (mask:ffff)
SingleThread HP max: -
SingleThread SP max: 165.987 GFLOPS
SingleThread DP max: 82.994 GFLOPS
MultiThread HP max: -
MultiThread SP max: 1404.151 GFLOPS
MultiThread DP max: 619.937 GFLOPS
Group 1: Thread=8 Clock=4.100000 GHz (mask:ff0000)
SingleThread HP max: -
SingleThread SP max: 65.432 GFLOPS
SingleThread DP max: 32.716 GFLOPS
MultiThread HP max: -
MultiThread SP max: 523.463 GFLOPS
MultiThread DP max: 261.725 GFLOPS
* Group 0: Thread=1 Clock=5.100000 GHz (mask:ffff)
* SSE/AVX (SP fp)
TIME(s) MFLOPS MOPS FOP IPC
SSE mulss (32bit x1) n8 : 0.298 10282.7 10282.7 ( 1.0 2.0)
SSE addss (32bit x1) n8 : 0.296 10335.8 10335.8 ( 1.0 2.0)
FMA vfmaddss (32bit x1) n8 : 0.296 20675.8 10337.9 ( 2.0 2.0)
FMA vfmaddss (32bit x1) n12 : 0.442 20750.2 10375.1 ( 2.0 2.0)
FMA vfma+mlss (32bit x1) n12 : 0.442 15560.3 10373.6 ( 1.5 2.0)
FMA vfma+adss (32bit x1) n12 : 0.335 20573.7 13715.8 ( 1.5 2.7)
SSE mulps (32bit x4) n8 : 0.296 41348.4 10337.1 ( 4.0 2.0)
SSE addps (32bit x4) n8 : 0.295 41499.1 10374.8 ( 4.0 2.0)
SSE mul+addps (32bit x4) n8 : 0.295 41497.4 10374.4 ( 4.0 2.0)
FMA vfmaddps (32bit x4) n8 : 0.296 82684.0 10335.5 ( 8.0 2.0)
FMA vfmaddps (32bit x4) n12 : 0.442 82996.2 10374.5 ( 8.0 2.0)
FMA vfma+mlps (32bit x4) n12 : 0.442 62248.3 10374.7 ( 6.0 2.0)
FMA vfma+adps (32bit x4) n12 : 0.336 82007.2 13667.9 ( 6.0 2.7)
SSE ml+ad+adps (32bit x4) n9 : 0.295 46681.1 11670.3 ( 4.0 2.3)
SSE mulss (32bit x1) ns4 : 0.590 5187.7 5187.7 ( 1.0 1.0)
SSE addss (32bit x1) ns4 : 0.296 10344.5 10344.5 ( 1.0 2.0)
SSE mulps (32bit x4) ns4 : 0.590 20749.5 5187.4 ( 4.0 1.0)
SSE addps (32bit x4) ns4 : 0.296 41377.4 10344.3 ( 4.0 2.0)
AVX vmulps (32bit x8) n8 : 0.295 82978.3 10372.3 ( 8.0 2.0)
AVX vaddps (32bit x8) n8 : 0.295 83006.4 10375.8 ( 8.0 2.0)
AVX vmul+addps (32bit x8) n8 : 0.197 124487.7 15561.0 ( 8.0 3.1)
FMA vfmaddps (32bit x8) n8 : 0.371 132011.8 8250.7 ( 16.0 1.6)
FMA vfmaddps (32bit x8) n12 : 0.442 165987.5 10374.2 ( 16.0 2.0)
FMA vfma+mlps (32bit x8) n12 : 0.442 124495.1 10374.6 ( 12.0 2.0)
FMA vfma+adps (32bit x8) n12 : 0.381 144625.0 12052.1 ( 12.0 2.4)
AVX vml+ad+adps (32bit x8) n9 : 0.315 87363.9 10920.5 ( 8.0 2.1)
AVX512 vmulps (32bit x16) n12 : - - - - -
AVX512 vaddps (32bit x16) n12 : - - - - -
AVX512 vfmaddps (32bit x16) n12 : - - - - -
AVX512 vfma+mps (32bit x16) n12 : - - - - -
AVX512 vfma+aps (32bit x16) n12 : - - - - -
AVX512 vmulps (32bit x8) n12 : - - - - -
AVX512 vaddps (32bit x8) n12 : - - - - -
AVX512 vfmaddps (32bit x8) n12 : - - - - -
Average : 0.358 61606.0 10472.1 ( 5.8 2.1)
Highest : 0.197 165987.5 15561.0 ( 16.0 3.1)
* Group 0: Thread=1 Clock=5.100000 GHz (mask:ffff)
* SSE/AVX (DP fp)
TIME(s) MFLOPS MOPS FOP IPC
SSE2 mulsd (64bit x1) n8 : 0.296 10337.6 10337.6 ( 1.0 2.0)
SSE2 addsd (64bit x1) n8 : 0.295 10375.4 10375.4 ( 1.0 2.0)
FMA vfmaddsd (64bit x1) n8 : 0.296 20645.2 10322.6 ( 2.0 2.0)
FMA vfmaddsd (64bit x1) n12 : 0.442 20747.7 10373.9 ( 2.0 2.0)
FMA vfma+mlsd (64bit x1) n12 : 0.442 15562.3 10374.9 ( 1.5 2.0)
FMA vfma+adsd (64bit x1) n12 : 0.333 20655.5 13770.3 ( 1.5 2.7)
SSE2 mulpd (64bit x2) n8 : 0.296 20665.0 10332.5 ( 2.0 2.0)
SSE2 addpd (64bit x2) n8 : 0.295 20749.1 10374.6 ( 2.0 2.0)
SSE2 mul+addpd (64bit x2) n8 : 0.295 20749.8 10374.9 ( 2.0 2.0)
FMA vfmaddpd (64bit x2) n8 : 0.296 41331.9 10333.0 ( 4.0 2.0)
FMA vfmaddpd (64bit x2) n12 : 0.442 41498.2 10374.5 ( 4.0 2.0)
FMA vfma+mlpd (64bit x2) n12 : 0.442 31122.0 10374.0 ( 3.0 2.0)
FMA vfma+adpd (64bit x2) n12 : 0.334 41181.9 13727.3 ( 3.0 2.7)
SSE2 ml+ad+dpd (64bit x2) n9 : 0.296 23298.9 11649.5 ( 2.0 2.3)
SSE2 mulsd (64bit x1) ns4 : 0.590 5183.7 5183.7 ( 1.0 1.0)
SSE2 addsd (64bit x1) ns4 : 0.296 10353.9 10353.9 ( 1.0 2.0)
SSE2 mulpd (64bit x2) ns4 : 0.590 10367.0 5183.5 ( 2.0 1.0)
SSE2 addpd (64bit x2) ns4 : 0.296 20672.3 10336.2 ( 2.0 2.0)
AVX vmulpd (64bit x4) n8 : 0.295 41490.8 10372.7 ( 4.0 2.0)
AVX vaddpd (64bit x4) n8 : 0.295 41494.5 10373.6 ( 4.0 2.0)
AVX vmul+addpd (64bit x4) n8 : 0.197 62241.9 15560.5 ( 4.0 3.1)
FMA vfmaddpd (64bit x4) n8 : 0.371 65960.4 8245.0 ( 8.0 1.6)
FMA vfmaddpd (64bit x4) n12 : 0.442 82994.3 10374.3 ( 8.0 2.0)
FMA vfma+mlpd (64bit x4) n12 : 0.442 62249.8 10375.0 ( 6.0 2.0)
FMA vfma+adpd (64bit x4) n12 : 0.383 71862.3 11977.0 ( 6.0 2.3)
AVX vml_ad_adpd (64bit x4) n9 : 0.221 62248.3 15562.1 ( 4.0 3.1)
AVX512 vmulpd (64bit x8) n12 : - - - - -
AVX512 vaddpd (64bit x8) n12 : - - - - -
AVX512 vfmaddpd (64bit x8) n12 : - - - - -
AVX512 vfma+mpd (64bit x8) n12 : - - - - -
AVX512 vfma+apd (64bit x8) n12 : - - - - -
Average : 0.355 33693.8 10653.6 ( 3.1 2.1)
Highest : 0.197 82994.3 15562.1 ( 8.0 3.1)
* Group 0: Thread=16 Clock=5.100000 GHz (mask:ffff)
* SSE/AVX (SP fp) multi-thread
TIME(s) MFLOPS MOPS FOP IPC
SSE mulss (32bit x1) n8 : 0.621 78893.6 4930.8 ( 16.0 1.0)
SSE addss (32bit x1) n8 : 0.601 81463.8 5091.5 ( 16.0 1.0)
FMA vfmaddss (32bit x1) n8 : 0.620 157904.2 4934.5 ( 32.0 1.0)
FMA vfmaddss (32bit x1) n12 : 0.941 156022.1 4875.7 ( 32.0 1.0)
FMA vfma+mlss (32bit x1) n12 : 0.942 116998.9 7312.4 ( 16.0 1.4)
FMA vfma+adss (32bit x1) n12 : 0.602 182866.7 11429.2 ( 16.0 2.2)
SSE mulps (32bit x4) n8 : 0.620 316108.6 4939.2 ( 64.0 1.0)
SSE addps (32bit x4) n8 : 0.614 318962.8 4983.8 ( 64.0 1.0)
SSE mul+addps (32bit x4) n8 : 0.411 476343.4 7442.9 ( 64.0 1.5)
FMA vfmaddps (32bit x4) n8 : 0.620 631672.7 4934.9 (128.0 1.0)
FMA vfmaddps (32bit x4) n12 : 0.937 626942.8 4898.0 (128.0 1.0)
FMA vfma+mlps (32bit x4) n12 : 0.937 470429.0 4900.3 ( 96.0 1.0)
FMA vfma+adps (32bit x4) n12 : 0.602 731721.9 7622.1 ( 96.0 1.5)
SSE ml+ad+adps (32bit x4) n9 : 0.451 488045.8 7625.7 ( 64.0 1.5)
SSE mulss (32bit x1) ns4 : 0.607 80713.5 5044.6 ( 16.0 1.0)
SSE addss (32bit x1) ns4 : 0.600 81534.6 5095.9 ( 16.0 1.0)
SSE mulps (32bit x4) ns4 : 0.603 325023.5 5078.5 ( 64.0 1.0)
SSE addps (32bit x4) ns4 : 0.600 326384.8 5099.8 ( 64.0 1.0)
AVX vmulps (32bit x8) n8 : 0.649 603390.4 4714.0 (128.0 0.9)
AVX vaddps (32bit x8) n8 : 0.625 626254.1 4892.6 (128.0 1.0)
AVX vmul+addps (32bit x8) n8 : 0.418 936101.9 7313.3 (128.0 1.4)
FMA vfmaddps (32bit x8) n8 : 0.647 1211397.7 4732.0 (256.0 0.9)
FMA vfmaddps (32bit x8) n12 : 0.978 1201455.2 4693.2 (256.0 0.9)
FMA vfma+mlps (32bit x8) n12 : 0.977 901585.5 4695.8 (192.0 0.9)
FMA vfma+adps (32bit x8) n12 : 0.628 1404150.6 7313.3 (192.0 1.4)
AVX vml+ad+adps (32bit x8) n9 : 0.502 878484.7 6863.2 (128.0 1.3)
AVX512 vmulps (32bit x16) n12 : - - - - -
AVX512 vaddps (32bit x16) n12 : - - - - -
AVX512 vfmaddps (32bit x16) n12 : - - - - -
AVX512 vfma+mps (32bit x16) n12 : - - - - -
AVX512 vfma+aps (32bit x16) n12 : - - - - -
AVX512 vmulps (32bit x8) n12 : - - - - -
AVX512 vaddps (32bit x8) n12 : - - - - -
AVX512 vfmaddps (32bit x8) n12 : - - - - -
Average : 0.667 515802.0 5825.3 ( 92.3 1.1)
Highest : 0.411 1404150.6 11429.2 (256.0 2.2)
* Group 0: Thread=16 Clock=5.100000 GHz (mask:ffff)
* SSE/AVX (DP fp) multi-thread
TIME(s) MFLOPS MOPS FOP IPC
SSE2 mulsd (64bit x1) n8 : 0.625 78314.6 4894.7 ( 16.0 1.0)
SSE2 addsd (64bit x1) n8 : 0.601 81481.9 5092.6 ( 16.0 1.0)
FMA vfmaddsd (64bit x1) n8 : 0.621 157569.0 4924.0 ( 32.0 1.0)
FMA vfmaddsd (64bit x1) n12 : 0.938 156584.3 4893.3 ( 32.0 1.0)
FMA vfma+mlsd (64bit x1) n12 : 0.938 117495.7 7343.5 ( 16.0 1.4)
FMA vfma+adsd (64bit x1) n12 : 0.602 182894.6 11430.9 ( 16.0 2.2)
SSE2 mulpd (64bit x2) n8 : 0.621 157786.1 4930.8 ( 32.0 1.0)
SSE2 addpd (64bit x2) n8 : 0.600 163140.7 5098.1 ( 32.0 1.0)
SSE2 mul+addpd (64bit x2) n8 : 0.410 238595.3 7456.1 ( 32.0 1.5)
FMA vfmaddpd (64bit x2) n8 : 0.620 316016.2 4937.8 ( 64.0 1.0)
FMA vfmaddpd (64bit x2) n12 : 0.939 312791.5 4887.4 ( 64.0 1.0)
FMA vfma+mlpd (64bit x2) n12 : 1.002 219939.1 4582.1 ( 48.0 0.9)
FMA vfma+adpd (64bit x2) n12 : 0.687 320637.5 6679.9 ( 48.0 1.3)
SSE2 ml+ad+dpd (64bit x2) n9 : 0.519 212331.2 6635.4 ( 32.0 1.3)
SSE2 mulsd (64bit x1) ns4 : 0.685 71457.0 4466.1 ( 16.0 0.9)
SSE2 addsd (64bit x1) ns4 : 0.667 73371.6 4585.7 ( 16.0 0.9)
SSE2 mulpd (64bit x2) ns4 : 0.672 145722.7 4553.8 ( 32.0 0.9)
SSE2 addpd (64bit x2) ns4 : 0.663 147641.8 4613.8 ( 32.0 0.9)
AVX vmulpd (64bit x4) n8 : 0.711 275368.3 4302.6 ( 64.0 0.8)
AVX vaddpd (64bit x4) n8 : 0.680 287905.2 4498.5 ( 64.0 0.9)
AVX vmul+addpd (64bit x4) n8 : 0.467 419354.0 6552.4 ( 64.0 1.3)
FMA vfmaddpd (64bit x4) n8 : 0.717 546216.7 4267.3 (128.0 0.8)
FMA vfmaddpd (64bit x4) n12 : 1.068 550093.8 4297.6 (128.0 0.8)
FMA vfma+mlpd (64bit x4) n12 : 1.078 408855.9 4258.9 ( 96.0 0.8)
FMA vfma+adpd (64bit x4) n12 : 0.711 619936.9 6457.7 ( 96.0 1.3)
AVX vml_ad_adpd (64bit x4) n9 : 0.528 417486.2 6523.2 ( 64.0 1.3)
AVX512 vmulpd (64bit x8) n12 : - - - - -
AVX512 vaddpd (64bit x8) n12 : - - - - -
AVX512 vfmaddpd (64bit x8) n12 : - - - - -
AVX512 vfma+mpd (64bit x8) n12 : - - - - -
AVX512 vfma+apd (64bit x8) n12 : - - - - -
Average : 0.707 256884.2 5506.3 ( 49.2 1.1)
Highest : 0.410 619936.9 11430.9 (128.0 2.2)
* Group 1: Thread=1 Clock=4.100000 GHz (mask:ff0000)
* SSE/AVX (SP fp)
TIME(s) MFLOPS MOPS FOP IPC
SSE mulss (32bit x1) n8 : 0.358 6875.8 6875.8 ( 1.0 1.7)
SSE addss (32bit x1) n8 : 0.305 8053.3 8053.3 ( 1.0 2.0)
FMA vfmaddss (32bit x1) n8 : 0.507 9705.2 4852.6 ( 2.0 1.2)
FMA vfmaddss (32bit x1) n12 : 0.525 14067.6 7033.8 ( 2.0 1.7)
FMA vfma+mlss (32bit x1) n12 : 0.525 10549.9 7033.3 ( 1.5 1.7)
FMA vfma+adss (32bit x1) n12 : 0.523 10590.5 7060.3 ( 1.5 1.7)
SSE mulps (32bit x4) n8 : 0.352 27932.5 6983.1 ( 4.0 1.7)
SSE addps (32bit x4) n8 : 0.305 32258.5 8064.6 ( 4.0 2.0)
SSE mul+addps (32bit x4) n8 : 0.337 29165.8 7291.5 ( 4.0 1.8)
FMA vfmaddps (32bit x4) n8 : 0.508 38742.6 4842.8 ( 8.0 1.2)
FMA vfmaddps (32bit x4) n12 : 0.521 56609.3 7076.2 ( 8.0 1.7)
FMA vfma+mlps (32bit x4) n12 : 0.524 42229.2 7038.2 ( 6.0 1.7)
FMA vfma+adps (32bit x4) n12 : 0.523 42335.9 7056.0 ( 6.0 1.7)
SSE ml+ad+adps (32bit x4) n9 : 0.344 32144.5 8036.1 ( 4.0 2.0)
SSE mulss (32bit x1) ns4 : 0.602 4089.5 4089.5 ( 1.0 1.0)
SSE addss (32bit x1) ns4 : 0.572 4301.6 4301.6 ( 1.0 1.0)
SSE mulps (32bit x4) ns4 : 0.602 16358.2 4089.5 ( 4.0 1.0)
SSE addps (32bit x4) ns4 : 0.573 17166.0 4291.5 ( 4.0 1.0)
AVX vmulps (32bit x8) n8 : 0.602 32716.0 4089.5 ( 8.0 1.0)
AVX vaddps (32bit x8) n8 : 0.602 32716.1 4089.5 ( 8.0 1.0)
AVX vmul+addps (32bit x8) n8 : 0.602 32716.4 4089.6 ( 8.0 1.0)
FMA vfmaddps (32bit x8) n8 : 0.602 65431.3 4089.5 ( 16.0 1.0)
FMA vfmaddps (32bit x8) n12 : 0.902 65432.2 4089.5 ( 16.0 1.0)
FMA vfma+mlps (32bit x8) n12 : 0.914 48433.6 4036.1 ( 12.0 1.0)
FMA vfma+adps (32bit x8) n12 : 0.914 48434.4 4036.2 ( 12.0 1.0)
AVX vml+ad+adps (32bit x8) n9 : 0.827 26767.9 3346.0 ( 8.0 0.8)
AVX512 vmulps (32bit x16) n12 : - - - - -
AVX512 vaddps (32bit x16) n12 : - - - - -
AVX512 vfmaddps (32bit x16) n12 : - - - - -
AVX512 vfma+mps (32bit x16) n12 : - - - - -
AVX512 vfma+aps (32bit x16) n12 : - - - - -
AVX512 vmulps (32bit x8) n12 : - - - - -
AVX512 vaddps (32bit x8) n12 : - - - - -
AVX512 vfmaddps (32bit x8) n12 : - - - - -
Average : 0.557 29070.1 5612.9 ( 5.8 1.4)
Highest : 0.305 65432.2 8064.6 ( 16.0 2.0)
* Group 1: Thread=1 Clock=4.100000 GHz (mask:ff0000)
* SSE/AVX (DP fp)
TIME(s) MFLOPS MOPS FOP IPC
SSE2 mulsd (64bit x1) n8 : 0.353 6970.2 6970.2 ( 1.0 1.7)
SSE2 addsd (64bit x1) n8 : 0.305 8060.1 8060.1 ( 1.0 2.0)
FMA vfmaddsd (64bit x1) n8 : 0.507 9711.8 4855.9 ( 2.0 1.2)
FMA vfmaddsd (64bit x1) n12 : 0.525 14056.8 7028.4 ( 2.0 1.7)
FMA vfma+mlsd (64bit x1) n12 : 0.525 10541.1 7027.4 ( 1.5 1.7)
FMA vfma+adsd (64bit x1) n12 : 0.523 10588.0 7058.7 ( 1.5 1.7)
SSE2 mulpd (64bit x2) n8 : 0.352 13957.4 6978.7 ( 2.0 1.7)
SSE2 addpd (64bit x2) n8 : 0.305 16119.0 8059.5 ( 2.0 2.0)
SSE2 mul+addpd (64bit x2) n8 : 0.334 14724.6 7362.3 ( 2.0 1.8)
FMA vfmaddpd (64bit x2) n8 : 0.505 19478.2 4869.6 ( 4.0 1.2)
FMA vfmaddpd (64bit x2) n12 : 0.522 28289.2 7072.3 ( 4.0 1.7)
FMA vfma+mlpd (64bit x2) n12 : 0.524 21134.5 7044.8 ( 3.0 1.7)
FMA vfma+adpd (64bit x2) n12 : 0.523 21172.7 7057.6 ( 3.0 1.7)
SSE2 ml+ad+dpd (64bit x2) n9 : 0.344 16071.8 8035.9 ( 2.0 2.0)
SSE2 mulsd (64bit x1) ns4 : 0.602 4089.4 4089.4 ( 1.0 1.0)
SSE2 addsd (64bit x1) ns4 : 0.572 4302.2 4302.2 ( 1.0 1.0)
SSE2 mulpd (64bit x2) ns4 : 0.602 8179.0 4089.5 ( 2.0 1.0)
SSE2 addpd (64bit x2) ns4 : 0.571 8622.8 4311.4 ( 2.0 1.1)
AVX vmulpd (64bit x4) n8 : 0.602 16358.2 4089.6 ( 4.0 1.0)
AVX vaddpd (64bit x4) n8 : 0.602 16358.1 4089.5 ( 4.0 1.0)
AVX vmul+addpd (64bit x4) n8 : 0.602 16357.6 4089.4 ( 4.0 1.0)
FMA vfmaddpd (64bit x4) n8 : 0.602 32714.3 4089.3 ( 8.0 1.0)
FMA vfmaddpd (64bit x4) n12 : 0.902 32715.6 4089.5 ( 8.0 1.0)
FMA vfma+mlpd (64bit x4) n12 : 0.914 24215.6 4035.9 ( 6.0 1.0)
FMA vfma+adpd (64bit x4) n12 : 0.914 24216.9 4036.2 ( 6.0 1.0)
AVX vml_ad_adpd (64bit x4) n9 : 0.677 16356.9 4089.2 ( 4.0 1.0)
AVX512 vmulpd (64bit x8) n12 : - - - - -
AVX512 vaddpd (64bit x8) n12 : - - - - -
AVX512 vfmaddpd (64bit x8) n12 : - - - - -
AVX512 vfma+mpd (64bit x8) n12 : - - - - -
AVX512 vfma+apd (64bit x8) n12 : - - - - -
Average : 0.550 15975.5 5649.3 ( 3.1 1.4)
Highest : 0.305 32715.6 8060.1 ( 8.0 2.0)
* Group 1: Thread=8 Clock=4.100000 GHz (mask:ff0000)
* SSE/AVX (SP fp) multi-thread
TIME(s) MFLOPS MOPS FOP IPC
SSE mulss (32bit x1) n8 : 0.353 55738.2 6967.3 ( 8.0 1.7)
SSE addss (32bit x1) n8 : 0.305 64446.8 8055.9 ( 8.0 2.0)
FMA vfmaddss (32bit x1) n8 : 0.507 77604.2 4850.3 ( 16.0 1.2)
FMA vfmaddss (32bit x1) n12 : 0.524 112665.5 7041.6 ( 16.0 1.7)
FMA vfma+mlss (32bit x1) n12 : 0.524 84424.7 10553.1 ( 8.0 2.6)
FMA vfma+adss (32bit x1) n12 : 0.523 84719.5 10589.9 ( 8.0 2.6)
SSE mulps (32bit x4) n8 : 0.353 223109.6 6972.2 ( 32.0 1.7)
SSE addps (32bit x4) n8 : 0.306 257656.5 8051.8 ( 32.0 2.0)
SSE mul+addps (32bit x4) n8 : 0.338 232918.7 7278.7 ( 32.0 1.8)
FMA vfmaddps (32bit x4) n8 : 0.507 310515.4 4851.8 ( 64.0 1.2)
FMA vfmaddps (32bit x4) n12 : 0.521 453316.1 7083.1 ( 64.0 1.7)
FMA vfma+mlps (32bit x4) n12 : 0.524 337897.3 7039.5 ( 48.0 1.7)
FMA vfma+adps (32bit x4) n12 : 0.523 338712.7 7056.5 ( 48.0 1.7)
SSE ml+ad+adps (32bit x4) n9 : 0.344 257141.4 8035.7 ( 32.0 2.0)
SSE mulss (32bit x1) ns4 : 0.602 32716.2 4089.5 ( 8.0 1.0)
SSE addss (32bit x1) ns4 : 0.572 34391.7 4299.0 ( 8.0 1.0)
SSE mulps (32bit x4) ns4 : 0.602 130865.4 4089.5 ( 32.0 1.0)
SSE addps (32bit x4) ns4 : 0.573 137462.3 4295.7 ( 32.0 1.0)
AVX vmulps (32bit x8) n8 : 0.602 261730.8 4089.5 ( 64.0 1.0)
AVX vaddps (32bit x8) n8 : 0.602 261729.1 4089.5 ( 64.0 1.0)
AVX vmul+addps (32bit x8) n8 : 0.602 261728.7 4089.5 ( 64.0 1.0)
FMA vfmaddps (32bit x8) n8 : 0.602 523462.5 4089.6 (128.0 1.0)
FMA vfmaddps (32bit x8) n12 : 0.902 523456.5 4089.5 (128.0 1.0)
FMA vfma+mlps (32bit x8) n12 : 0.914 387482.9 4036.3 ( 96.0 1.0)
FMA vfma+adps (32bit x8) n12 : 0.914 387462.2 4036.1 ( 96.0 1.0)
AVX vml+ad+adps (32bit x8) n9 : 0.827 214140.1 3345.9 ( 64.0 0.8)
AVX512 vmulps (32bit x16) n12 : - - - - -
AVX512 vaddps (32bit x16) n12 : - - - - -
AVX512 vfmaddps (32bit x16) n12 : - - - - -
AVX512 vfma+mps (32bit x16) n12 : - - - - -
AVX512 vfma+aps (32bit x16) n12 : - - - - -
AVX512 vmulps (32bit x8) n12 : - - - - -
AVX512 vaddps (32bit x8) n12 : - - - - -
AVX512 vfmaddps (32bit x8) n12 : - - - - -
Average : 0.556 232596.0 5887.2 ( 46.2 1.4)
Highest : 0.305 523462.5 10589.9 (128.0 2.6)
* Group 1: Thread=8 Clock=4.100000 GHz (mask:ff0000)
* SSE/AVX (DP fp) multi-thread
TIME(s) MFLOPS MOPS FOP IPC
SSE2 mulsd (64bit x1) n8 : 0.357 55107.8 6888.5 ( 8.0 1.7)
SSE2 addsd (64bit x1) n8 : 0.305 64497.5 8062.2 ( 8.0 2.0)
FMA vfmaddsd (64bit x1) n8 : 0.505 77881.2 4867.6 ( 16.0 1.2)
FMA vfmaddsd (64bit x1) n12 : 0.524 112667.5 7041.7 ( 16.0 1.7)
FMA vfma+mlsd (64bit x1) n12 : 0.525 84417.0 10552.1 ( 8.0 2.6)
FMA vfma+adsd (64bit x1) n12 : 0.523 84662.8 10582.9 ( 8.0 2.6)
SSE2 mulpd (64bit x2) n8 : 0.352 111753.4 6984.6 ( 16.0 1.7)
SSE2 addpd (64bit x2) n8 : 0.305 128946.4 8059.2 ( 16.0 2.0)
SSE2 mul+addpd (64bit x2) n8 : 0.335 117367.5 7335.5 ( 16.0 1.8)
FMA vfmaddpd (64bit x2) n8 : 0.506 155635.9 4863.6 ( 32.0 1.2)
FMA vfmaddpd (64bit x2) n12 : 0.523 225926.9 7060.2 ( 32.0 1.7)
FMA vfma+mlpd (64bit x2) n12 : 0.524 169028.6 7042.9 ( 24.0 1.7)
FMA vfma+adpd (64bit x2) n12 : 0.523 169421.2 7059.2 ( 24.0 1.7)
SSE2 ml+ad+dpd (64bit x2) n9 : 0.344 128577.8 8036.1 ( 16.0 2.0)
SSE2 mulsd (64bit x1) ns4 : 0.602 32715.6 4089.5 ( 8.0 1.0)
SSE2 addsd (64bit x1) ns4 : 0.570 34497.2 4312.2 ( 8.0 1.1)
SSE2 mulpd (64bit x2) ns4 : 0.602 65431.7 4089.5 ( 16.0 1.0)
SSE2 addpd (64bit x2) ns4 : 0.571 68946.8 4309.2 ( 16.0 1.1)
AVX vmulpd (64bit x4) n8 : 0.602 130862.2 4089.4 ( 32.0 1.0)
AVX vaddpd (64bit x4) n8 : 0.602 130862.8 4089.5 ( 32.0 1.0)
AVX vmul+addpd (64bit x4) n8 : 0.602 130863.2 4089.5 ( 32.0 1.0)
FMA vfmaddpd (64bit x4) n8 : 0.602 261724.7 4089.4 ( 64.0 1.0)
FMA vfmaddpd (64bit x4) n12 : 0.902 261725.0 4089.5 ( 64.0 1.0)
FMA vfma+mlpd (64bit x4) n12 : 0.914 193728.3 4036.0 ( 48.0 1.0)
FMA vfma+adpd (64bit x4) n12 : 0.914 193738.5 4036.2 ( 48.0 1.0)
AVX vml_ad_adpd (64bit x4) n9 : 0.677 130863.4 4089.5 ( 32.0 1.0)
AVX512 vmulpd (64bit x8) n12 : - - - - -
AVX512 vaddpd (64bit x8) n12 : - - - - -
AVX512 vfmaddpd (64bit x8) n12 : - - - - -
AVX512 vfma+mpd (64bit x8) n12 : - - - - -
AVX512 vfma+apd (64bit x8) n12 : - - - - -
Average : 0.550 127763.5 5917.1 ( 24.6 1.4)
Highest : 0.305 261725.0 10582.9 ( 64.0 2.6)