[cuda] Switch cuda2 on and cuda1 off by default #16107

antiagainst · 2024-01-12T05:51:14Z

This commit switches the cuda2 HAL driver on and
the cuda HAL driver (which is renamed to cuda1) off
by default in CMake. In order to do this, we also
switched cuda2 to use stream-based command buffer
by default to follow cuda1 for simple transition.

Fixes #13245

benchmark-extra: cuda-large

github-actions · 2024-01-17T07:41:19Z

Abbreviated Benchmark Summary

@ commit e0e4e48a08cae52b6c7492e1bde31cf9a8eb6dd0 (vs. base 13dad384f9c0645cbc86eb735f486bff99084082)

Regressed Latencies 🚩

Benchmark Name	Average Latency (ms)	Median Latency (ms)	Latency Standard Deviation (ms)
MiniLML12H384Uncased(stablehlo) [cuda-sm\_80-linux\_gnu-cuda][default-flags] cuda(none)[full-inference,default-flags] with default @ a2-highgpu-1g[gpu]	1.681 (vs. 1.530, 9.87%↑)	1.679	0.011
matmul\_128x256x8192\_f16t\_tile\_config\_default(linalg) [cuda-sm\_80-linux\_gnu-cuda][ukernel,matmul,splitk] cuda(none)[full-inference,default-flags] with default @ a2-highgpu-1g[gpu]	0.026 (vs. 0.025, 5.54%↑)	0.026	0.000

No improved or regressed compilation metrics 🏖️

For more information:

Source Workflow Run

This commit changes the benchmark capture steps to start the capture process first so that we can reduce the number of benchmark repetitions during capture to reduce capture size.

…tions"

antiagainst · 2024-01-23T19:16:54Z

Okay this is good to go now. Only two benchmarks regressed slightly; I won't bother with it too much there.

ScottTodd

Thanks for staging this work into separable PRs! Next steps are to remove cuda1 and drop the '2' from cuda2 names?

antiagainst · 2024-01-23T22:11:12Z

Thanks for staging this work into separable PRs! Next steps are to remove cuda1 and drop the '2' from cuda2 names?

Yup exactly.

antiagainst added the hal/cuda Runtime CUDA HAL backend label Jan 12, 2024

antiagainst force-pushed the cuda2-try-switch branch 2 times, most recently from 8ab514e to 84396e3 Compare January 12, 2024 05:58

antiagainst added benchmarks:cuda Run default CUDA benchmarks and removed benchmarks:cuda Run default CUDA benchmarks labels Jan 12, 2024

antiagainst force-pushed the cuda2-try-switch branch from 9c07ec0 to 7d33252 Compare January 17, 2024 04:44

antiagainst added the benchmarks:cuda Run default CUDA benchmarks label Jan 17, 2024

antiagainst force-pushed the cuda2-try-switch branch 3 times, most recently from a782a67 to 0d5f442 Compare January 23, 2024 16:36

antiagainst added 3 commits January 23, 2024 11:11

[cuda] Switch cuda2 on and cuda off as the default

e36a434

[ci] Start capture process earlier to reduce benchmark repetitions

8bb0f5a

This commit changes the benchmark capture steps to start the capture process first so that we can reduce the number of benchmark repetitions during capture to reduce capture size.

Revert "[ci] Start capture process earlier to reduce benchmark repeti…

5376b13

…tions"

antiagainst force-pushed the cuda2-try-switch branch from 0d5f442 to 5376b13 Compare January 23, 2024 19:11

antiagainst changed the title ~~[cuda] Try to switch cuda2 on as the default~~ [cuda] Switch cuda2 on as the default Jan 23, 2024

antiagainst changed the title ~~[cuda] Switch cuda2 on as the default~~ [cuda] Switch cuda2 on and cuda off by default Jan 23, 2024

antiagainst changed the title ~~[cuda] Switch cuda2 on and cuda off by default~~ [cuda] Switch cuda2 on and cuda1 off by default Jan 23, 2024

antiagainst marked this pull request as ready for review January 23, 2024 19:16

antiagainst requested a review from benvanik as a code owner January 23, 2024 19:16

antiagainst requested a review from ScottTodd January 23, 2024 19:16

benvanik approved these changes Jan 23, 2024

View reviewed changes

ScottTodd approved these changes Jan 23, 2024

View reviewed changes

antiagainst merged commit 3b3cef9 into iree-org:main Jan 23, 2024
58 checks passed

antiagainst deleted the cuda2-try-switch branch January 23, 2024 22:11

antiagainst mentioned this pull request Jan 23, 2024

[Epic] CUDA HAL driver rewrite for production #13245

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cuda] Switch cuda2 on and cuda1 off by default #16107

[cuda] Switch cuda2 on and cuda1 off by default #16107

antiagainst commented Jan 12, 2024 •

edited

Loading

github-actions bot commented Jan 17, 2024 •

edited

Loading

antiagainst commented Jan 23, 2024

ScottTodd left a comment

antiagainst commented Jan 23, 2024

[cuda] Switch cuda2 on and cuda1 off by default #16107

[cuda] Switch cuda2 on and cuda1 off by default #16107

Conversation

antiagainst commented Jan 12, 2024 • edited Loading

github-actions bot commented Jan 17, 2024 • edited Loading

Abbreviated Benchmark Summary

Regressed Latencies 🚩

antiagainst commented Jan 23, 2024

ScottTodd left a comment

Choose a reason for hiding this comment

antiagainst commented Jan 23, 2024

antiagainst commented Jan 12, 2024 •

edited

Loading

github-actions bot commented Jan 17, 2024 •

edited

Loading