Conversation
Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
CodSpeed Performance ReportMerging this PR will degrade performance by 15.94%Comparing
|
| Mode | Benchmark | BASE |
HEAD |
Efficiency | |
|---|---|---|---|---|---|
| 🆕 | WallTime | u16_FoR[1K] |
N/A | 5.6 µs | N/A |
| 🆕 | WallTime | u8_FoR[10M] |
N/A | 5.6 µs | N/A |
| 🆕 | WallTime | u32_FoR[100K] |
N/A | 7.3 µs | N/A |
| 🆕 | WallTime | u16_FoR[1M] |
N/A | 6.1 µs | N/A |
| 🆕 | WallTime | u8_FoR[1K] |
N/A | 7.5 µs | N/A |
| 🆕 | WallTime | u32_FoR[10K] |
N/A | 6.3 µs | N/A |
| 🆕 | WallTime | u8_FoR[1M] |
N/A | 5.6 µs | N/A |
| 🆕 | WallTime | u64_FoR[100K] |
N/A | 12.4 µs | N/A |
| 🆕 | WallTime | u16_FoR[10M] |
N/A | 9.7 µs | N/A |
| 🆕 | WallTime | u32_FoR[1K] |
N/A | 6 µs | N/A |
| 🆕 | WallTime | u64_FoR[10K] |
N/A | 14.3 µs | N/A |
| 🆕 | WallTime | u64_FoR[1K] |
N/A | 6.9 µs | N/A |
| 🆕 | WallTime | u64_FoR[10M] |
N/A | 342.5 µs | N/A |
| 🆕 | WallTime | u16_FoR[100K] |
N/A | 8.1 µs | N/A |
| 🆕 | WallTime | u32_FoR[10M] |
N/A | 174.2 µs | N/A |
| 🆕 | WallTime | u32_FoR[1M] |
N/A | 11.6 µs | N/A |
| 🆕 | WallTime | u8_FoR[100K] |
N/A | 5.6 µs | N/A |
| 🆕 | WallTime | u8_FoR[10K] |
N/A | 5.6 µs | N/A |
| 🆕 | WallTime | u16_FoR[10K] |
N/A | 6.3 µs | N/A |
| 🆕 | WallTime | u64_FoR[1M] |
N/A | 32.7 µs | N/A |
| ... | ... | ... | ... | ... | ... |
ℹ️ Only the first 20 benchmarks are displayed. Go to the app to view all benchmarks.
Footnotes
-
1274 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩
Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
Always include CUDA code in tests and benchmarks if the environment has a CUDA installation. This does not require the user to pass a feature flag but is determined at build.rs time.