-
Notifications
You must be signed in to change notification settings - Fork 798
[NFC][SYCL] Avoid unnecessary std::unique_ptr<HostKernel<...>>
instantiations
#17640
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[NFC][SYCL] Avoid unnecessary std::unique_ptr<HostKernel<...>>
instantiations
#17640
Conversation
…antiations The change improves "host" compilation times for cases with multiple kernels.
Out of curiosity, do you happen to have performance improvement numbers? if so, please share them. |
14.4s -> 8.6s for
|
@aelovikov-intel, are you using debug build of the compiler? All these numbers seem to be too high. |
It's 400 kernels, see compile-time loops. |
@aelovikov-intel, would you mind checking if this change improves compile time of SYCL-CTS, please? SYCL-CTS compilation time with DPC++ compiler on GitHub runner exceeds the limit. I wonder if this change helps to fix this problem. |
Surprisingly, it might. 13m -> 10.5m on SPR+PVC, where there are a few tests that a bottlenecks. On a less powerful system might be even more impactful. That said, I'm not sure how stable/reproducible the gain is. |
Thanks! I expect tests checking math built-ins and vector operations to benefit from this change. These tests auto-generate a lot of kernels. |
The change improves "host" compilation times for cases with multiple kernels.