chore: benchmark gpu ci#6107
Conversation
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
Merging this PR will degrade performance by 15.76%
|
| Mode | Benchmark | BASE |
HEAD |
Efficiency | |
|---|---|---|---|---|---|
| 🆕 | WallTime | u64_FoR[1K] |
N/A | 7 µs | N/A |
| 🆕 | WallTime | u8_FoR[10M] |
N/A | 5.9 µs | N/A |
| 🆕 | WallTime | u32_FoR[1M] |
N/A | 11.8 µs | N/A |
| 🆕 | WallTime | u32_FoR[10K] |
N/A | 6.3 µs | N/A |
| 🆕 | WallTime | u64_FoR[100K] |
N/A | 12.3 µs | N/A |
| 🆕 | WallTime | u32_FoR[1K] |
N/A | 6.3 µs | N/A |
| 🆕 | WallTime | u16_FoR[1K] |
N/A | 5.9 µs | N/A |
| 🆕 | WallTime | u8_FoR[1K] |
N/A | 8.9 µs | N/A |
| 🆕 | WallTime | u16_FoR[10M] |
N/A | 6.8 µs | N/A |
| 🆕 | WallTime | u32_FoR[10M] |
N/A | 174.3 µs | N/A |
| 🆕 | WallTime | u64_FoR[10M] |
N/A | 341.7 µs | N/A |
| 🆕 | WallTime | u8_FoR[100K] |
N/A | 5.9 µs | N/A |
| 🆕 | WallTime | u64_FoR[1M] |
N/A | 34.8 µs | N/A |
| 🆕 | WallTime | u32_FoR[100K] |
N/A | 7.6 µs | N/A |
| 🆕 | WallTime | u16_FoR[1M] |
N/A | 6.2 µs | N/A |
| 🆕 | WallTime | u16_FoR[10K] |
N/A | 7.4 µs | N/A |
| 🆕 | WallTime | u8_FoR[1M] |
N/A | 5.9 µs | N/A |
| 🆕 | WallTime | u16_FoR[100K] |
N/A | 10.1 µs | N/A |
| 🆕 | WallTime | u8_FoR[10K] |
N/A | 5.9 µs | N/A |
| 🆕 | WallTime | u64_FoR[10K] |
N/A | 13.8 µs | N/A |
| ... | ... | ... | ... | ... | ... |
ℹ️ Only the first 20 benchmarks are displayed. Go to the app to view all benchmarks.
Comparing ji/cuda-ci-benchmark (b4d59b0) with develop (9d18652)
Footnotes
-
1254 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
Add codspeed runs for GPU kernels --------- Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
Add codspeed runs for GPU kernels