/
AMD Custom APU 0405.txt
189 lines (177 loc) · 12.1 KB
/
AMD Custom APU 0405.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
Date: 20230618 104218
ARCH: x64 (x86_64)
FPU : SSE SSE2 SSSE3 SSE4.1 SSE4.2 AVX AVX2 FMA3 F16C
Name: AMD Custom APU 0405
CPU Thread: 8
CPU Core : 4
CPU Group : 1
Group 0: Thread= 8 Clock=2.800000 GHz (mask:ff)
SSE : yes
AVX : yes
FMA : yes
F16C : yes
AVX512: no
Total:
SingleThread HP max: -
SingleThread SP max: 107.328 GFLOPS
SingleThread DP max: 51.903 GFLOPS
MultiThread HP max: -
MultiThread SP max: 448.238 GFLOPS
MultiThread DP max: 203.651 GFLOPS
Group 0: Thread=8 Clock=2.800000 GHz (mask:ff)
SingleThread HP max: -
SingleThread SP max: 107.328 GFLOPS
SingleThread DP max: 51.903 GFLOPS
MultiThread HP max: -
MultiThread SP max: 448.238 GFLOPS
MultiThread DP max: 203.651 GFLOPS
* Group 0: Thread=1 Clock=2.800000 GHz (mask:ff)
* SSE/AVX (SP fp)
TIME(s) MFLOPS MOPS FOP IPC
SSE mulss (32bit x1) n8 : 0.256 6561.3 6561.3 ( 1.0 2.3)
SSE addss (32bit x1) n8 : 0.253 6645.9 6645.9 ( 1.0 2.4)
FMA vfmaddss (32bit x1) n8 : 0.318 10571.7 5285.8 ( 2.0 1.9)
FMA vfmaddss (32bit x1) n12 : 0.369 13671.1 6835.5 ( 2.0 2.4)
FMA vfma+mlss (32bit x1) n12 : 0.376 10063.0 6708.7 ( 1.5 2.4)
FMA vfma+adss (32bit x1) n12 : 0.305 12387.4 8258.3 ( 1.5 2.9)
SSE mulps (32bit x4) n8 : 0.242 27722.9 6930.7 ( 4.0 2.5)
SSE addps (32bit x4) n8 : 0.246 27283.8 6821.0 ( 4.0 2.4)
SSE mul+addps (32bit x4) n8 : 0.190 35356.0 8839.0 ( 4.0 3.2)
FMA vfmaddps (32bit x4) n8 : 0.305 44095.4 5511.9 ( 8.0 2.0)
FMA vfmaddps (32bit x4) n12 : 0.365 55262.1 6907.8 ( 8.0 2.5)
FMA vfma+mlps (32bit x4) n12 : 0.383 39498.5 6583.1 ( 6.0 2.4)
FMA vfma+adps (32bit x4) n12 : 0.315 47958.1 7993.0 ( 6.0 2.9)
SSE ml+ad+adps (32bit x4) n9 : 0.217 34786.9 8696.7 ( 4.0 3.1)
SSE mulss (32bit x1) ns4 : 0.376 4462.2 4462.2 ( 1.0 1.6)
SSE addss (32bit x1) ns4 : 0.382 4396.9 4396.9 ( 1.0 1.6)
SSE mulps (32bit x4) ns4 : 0.373 18022.5 4505.6 ( 4.0 1.6)
SSE addps (32bit x4) ns4 : 0.371 18096.5 4524.1 ( 4.0 1.6)
AVX vmulps (32bit x8) n8 : 0.250 53852.6 6731.6 ( 8.0 2.4)
AVX vaddps (32bit x8) n8 : 0.244 55183.1 6897.9 ( 8.0 2.5)
AVX vmul+addps (32bit x8) n8 : 0.131 102522.6 12815.3 ( 8.0 4.6)
FMA vfmaddps (32bit x8) n8 : 0.310 86591.2 5412.0 ( 16.0 1.9)
FMA vfmaddps (32bit x8) n12 : 0.376 107328.2 6708.0 ( 16.0 2.4)
FMA vfma+mlps (32bit x8) n12 : 0.391 77278.7 6439.9 ( 12.0 2.3)
FMA vfma+adps (32bit x8) n12 : 0.334 90441.7 7536.8 ( 12.0 2.7)
AVX vml+ad+adps (32bit x8) n9 : 0.321 47144.5 5893.1 ( 8.0 2.1)
AVX512 vmulps (32bit x16) n12 : - - - - -
AVX512 vaddps (32bit x16) n12 : - - - - -
AVX512 vfmaddps (32bit x16) n12 : - - - - -
AVX512 vfma+mps (32bit x16) n12 : - - - - -
AVX512 vfma+aps (32bit x16) n12 : - - - - -
AVX512 vmulps (32bit x8) n12 : - - - - -
AVX512 vaddps (32bit x8) n12 : - - - - -
AVX512 vfmaddps (32bit x8) n12 : - - - - -
Average : 0.308 39891.7 6727.0 ( 5.8 2.4)
Highest : 0.131 107328.2 12815.3 ( 16.0 4.6)
* Group 0: Thread=1 Clock=2.800000 GHz (mask:ff)
* SSE/AVX (DP fp)
TIME(s) MFLOPS MOPS FOP IPC
SSE2 mulsd (64bit x1) n8 : 0.242 6953.6 6953.6 ( 1.0 2.5)
SSE2 addsd (64bit x1) n8 : 0.244 6891.1 6891.1 ( 1.0 2.5)
FMA vfmaddsd (64bit x1) n8 : 0.313 10751.0 5375.5 ( 2.0 1.9)
FMA vfmaddsd (64bit x1) n12 : 0.363 13874.5 6937.3 ( 2.0 2.5)
FMA vfma+mlsd (64bit x1) n12 : 0.384 9831.6 6554.4 ( 1.5 2.3)
FMA vfma+adsd (64bit x1) n12 : 0.304 12449.2 8299.4 ( 1.5 3.0)
SSE2 mulpd (64bit x2) n8 : 0.241 13919.5 6959.7 ( 2.0 2.5)
SSE2 addpd (64bit x2) n8 : 0.242 13871.1 6935.6 ( 2.0 2.5)
SSE2 mul+addpd (64bit x2) n8 : 0.183 18344.3 9172.2 ( 2.0 3.3)
FMA vfmaddpd (64bit x2) n8 : 0.303 22148.8 5537.2 ( 4.0 2.0)
FMA vfmaddpd (64bit x2) n12 : 0.366 27547.5 6886.9 ( 4.0 2.5)
FMA vfma+mlpd (64bit x2) n12 : 0.391 19342.0 6447.3 ( 3.0 2.3)
FMA vfma+adpd (64bit x2) n12 : 0.305 24746.6 8248.9 ( 3.0 2.9)
SSE2 ml+ad+dpd (64bit x2) n9 : 0.217 17426.2 8713.1 ( 2.0 3.1)
SSE2 mulsd (64bit x1) ns4 : 0.370 4538.0 4538.0 ( 1.0 1.6)
SSE2 addsd (64bit x1) ns4 : 0.367 4573.2 4573.2 ( 1.0 1.6)
SSE2 mulpd (64bit x2) ns4 : 0.384 8758.2 4379.1 ( 2.0 1.6)
SSE2 addpd (64bit x2) ns4 : 0.387 8680.3 4340.2 ( 2.0 1.6)
AVX vmulpd (64bit x4) n8 : 0.255 26348.9 6587.2 ( 4.0 2.4)
AVX vaddpd (64bit x4) n8 : 0.260 25882.8 6470.7 ( 4.0 2.3)
AVX vmul+addpd (64bit x4) n8 : 0.171 39266.8 9816.7 ( 4.0 3.5)
FMA vfmaddpd (64bit x4) n8 : 0.367 36594.7 4574.3 ( 8.0 1.6)
FMA vfmaddpd (64bit x4) n12 : 0.388 51902.8 6487.9 ( 8.0 2.3)
FMA vfma+mlpd (64bit x4) n12 : 0.403 37552.6 6258.8 ( 6.0 2.2)
FMA vfma+adpd (64bit x4) n12 : 0.346 43758.4 7293.1 ( 6.0 2.6)
AVX vml_ad_adpd (64bit x4) n9 : 0.200 37709.7 9427.4 ( 4.0 3.4)
AVX512 vmulpd (64bit x8) n12 : - - - - -
AVX512 vaddpd (64bit x8) n12 : - - - - -
AVX512 vfmaddpd (64bit x8) n12 : - - - - -
AVX512 vfma+mpd (64bit x8) n12 : - - - - -
AVX512 vfma+apd (64bit x8) n12 : - - - - -
Average : 0.308 20910.1 6717.6 ( 3.1 2.4)
Highest : 0.171 51902.8 9816.7 ( 8.0 3.5)
* Group 0: Thread=8 Clock=2.800000 GHz (mask:ff)
* SSE/AVX (SP fp) multi-thread
TIME(s) MFLOPS MOPS FOP IPC
SSE mulss (32bit x1) n8 : 0.500 26883.5 3360.4 ( 8.0 1.2)
SSE addss (32bit x1) n8 : 0.517 25984.2 3248.0 ( 8.0 1.2)
FMA vfmaddss (32bit x1) n8 : 0.556 48363.2 3022.7 ( 16.0 1.1)
FMA vfmaddss (32bit x1) n12 : 0.755 53423.5 3339.0 ( 16.0 1.2)
FMA vfma+mlss (32bit x1) n12 : 0.790 38299.1 4787.4 ( 8.0 1.7)
FMA vfma+adss (32bit x1) n12 : 0.741 40794.6 5099.3 ( 8.0 1.8)
SSE mulps (32bit x4) n8 : 0.477 112738.4 3523.1 ( 32.0 1.3)
SSE addps (32bit x4) n8 : 0.467 115129.9 3597.8 ( 32.0 1.3)
SSE mul+addps (32bit x4) n8 : 0.302 178230.4 5569.7 ( 32.0 2.0)
FMA vfmaddps (32bit x4) n8 : 0.534 201471.7 3148.0 ( 64.0 1.1)
FMA vfmaddps (32bit x4) n12 : 0.790 204034.9 3188.0 ( 64.0 1.1)
FMA vfma+mlps (32bit x4) n12 : 0.783 154499.5 3218.7 ( 48.0 1.1)
FMA vfma+adps (32bit x4) n12 : 0.726 166550.5 3469.8 ( 48.0 1.2)
SSE ml+ad+adps (32bit x4) n9 : 0.379 159645.2 4988.9 ( 32.0 1.8)
SSE mulss (32bit x1) ns4 : 0.494 27221.8 3402.7 ( 8.0 1.2)
SSE addss (32bit x1) ns4 : 0.478 28099.5 3512.4 ( 8.0 1.3)
SSE mulps (32bit x4) ns4 : 0.511 105256.2 3289.3 ( 32.0 1.2)
SSE addps (32bit x4) ns4 : 0.507 106103.1 3315.7 ( 32.0 1.2)
AVX vmulps (32bit x8) n8 : 0.454 236978.5 3702.8 ( 64.0 1.3)
AVX vaddps (32bit x8) n8 : 0.463 232475.2 3632.4 ( 64.0 1.3)
AVX vmul+addps (32bit x8) n8 : 0.380 283223.9 4425.4 ( 64.0 1.6)
FMA vfmaddps (32bit x8) n8 : 0.534 402439.6 3144.1 (128.0 1.1)
FMA vfmaddps (32bit x8) n12 : 0.803 401825.8 3139.3 (128.0 1.1)
FMA vfma+mlps (32bit x8) n12 : 0.795 304483.4 3171.7 ( 96.0 1.1)
FMA vfma+adps (32bit x8) n12 : 0.540 448238.2 4669.1 ( 96.0 1.7)
AVX vml+ad+adps (32bit x8) n9 : 0.410 295300.2 4614.1 ( 64.0 1.6)
AVX512 vmulps (32bit x16) n12 : - - - - -
AVX512 vaddps (32bit x16) n12 : - - - - -
AVX512 vfmaddps (32bit x16) n12 : - - - - -
AVX512 vfma+mps (32bit x16) n12 : - - - - -
AVX512 vfma+aps (32bit x16) n12 : - - - - -
AVX512 vmulps (32bit x8) n12 : - - - - -
AVX512 vaddps (32bit x8) n12 : - - - - -
AVX512 vfmaddps (32bit x8) n12 : - - - - -
Average : 0.565 169142.1 3753.1 ( 46.2 1.3)
Highest : 0.302 448238.2 5569.7 (128.0 2.0)
* Group 0: Thread=8 Clock=2.800000 GHz (mask:ff)
* SSE/AVX (DP fp) multi-thread
TIME(s) MFLOPS MOPS FOP IPC
SSE2 mulsd (64bit x1) n8 : 0.490 27451.7 3431.5 ( 8.0 1.2)
SSE2 addsd (64bit x1) n8 : 0.491 27400.5 3425.1 ( 8.0 1.2)
FMA vfmaddsd (64bit x1) n8 : 0.499 53846.0 3365.4 ( 16.0 1.2)
FMA vfmaddsd (64bit x1) n12 : 0.720 56023.8 3501.5 ( 16.0 1.3)
FMA vfma+mlsd (64bit x1) n12 : 0.742 40738.1 5092.3 ( 8.0 1.8)
FMA vfma+adsd (64bit x1) n12 : 0.716 42240.7 5280.1 ( 8.0 1.9)
SSE2 mulpd (64bit x2) n8 : 0.433 62039.1 3877.4 ( 16.0 1.4)
SSE2 addpd (64bit x2) n8 : 0.432 62184.7 3886.5 ( 16.0 1.4)
SSE2 mul+addpd (64bit x2) n8 : 0.333 80772.9 5048.3 ( 16.0 1.8)
FMA vfmaddpd (64bit x2) n8 : 0.521 103219.9 3225.6 ( 32.0 1.2)
FMA vfmaddpd (64bit x2) n12 : 0.771 104642.6 3270.1 ( 32.0 1.2)
FMA vfma+mlpd (64bit x2) n12 : 0.791 76464.2 3186.0 ( 24.0 1.1)
FMA vfma+adpd (64bit x2) n12 : 0.734 82448.9 3435.4 ( 24.0 1.2)
SSE2 ml+ad+dpd (64bit x2) n9 : 0.415 72869.1 4554.3 ( 16.0 1.6)
SSE2 mulsd (64bit x1) ns4 : 0.524 25672.8 3209.1 ( 8.0 1.1)
SSE2 addsd (64bit x1) ns4 : 0.497 27033.2 3379.1 ( 8.0 1.2)
SSE2 mulpd (64bit x2) ns4 : 0.499 53867.8 3366.7 ( 16.0 1.2)
SSE2 addpd (64bit x2) ns4 : 0.516 52050.8 3253.2 ( 16.0 1.2)
AVX vmulpd (64bit x4) n8 : 0.432 124584.0 3893.3 ( 32.0 1.4)
AVX vaddpd (64bit x4) n8 : 0.427 125811.5 3931.6 ( 32.0 1.4)
AVX vmul+addpd (64bit x4) n8 : 0.355 151281.9 4727.6 ( 32.0 1.7)
FMA vfmaddpd (64bit x4) n8 : 0.554 194135.1 3033.4 ( 64.0 1.1)
FMA vfmaddpd (64bit x4) n12 : 0.858 188076.8 2938.7 ( 64.0 1.0)
FMA vfma+mlpd (64bit x4) n12 : 0.826 146466.0 3051.4 ( 48.0 1.1)
FMA vfma+adpd (64bit x4) n12 : 0.594 203650.8 4242.7 ( 48.0 1.5)
AVX vml_ad_adpd (64bit x4) n9 : 0.317 190862.7 5964.5 ( 32.0 2.1)
AVX512 vmulpd (64bit x8) n12 : - - - - -
AVX512 vaddpd (64bit x8) n12 : - - - - -
AVX512 vfmaddpd (64bit x8) n12 : - - - - -
AVX512 vfma+mpd (64bit x8) n12 : - - - - -
AVX512 vfma+apd (64bit x8) n12 : - - - - -
Average : 0.557 91378.3 3829.6 ( 24.6 1.4)
Highest : 0.317 203650.8 5964.5 ( 64.0 2.1)