-
Notifications
You must be signed in to change notification settings - Fork 124
[L0] create abstraction for ur_event_handle_t #1909
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
to allow switching the implementation to v2::ur_event_handle_t. This is similar to how ur_queue_handle_t behaves right now.
|
Compute Benchmarks level_zero run (with params: ): |
|
Compute Benchmarks level_zero run (): Summary
Benchmark Results---
config:
gantt:
rightPadding: 10
leftPadding: 120
sectionFontSize: 10
numberSectionStyles: 2
---
gantt
title api_overhead_benchmark_sycl, mean execution time per 10 kernels
todayMarker off
dateFormat X
axisFormat %s
section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)<br>Imm-CmdLists-OFF
This PR (23.04 μs) : crit, 0, 23
baseline (22.705 μs) : 0, 22
- : 0, 0
- : 0, 0
section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)<br>Imm-CmdLists-OFF
This PR (26.196 μs) : crit, 0, 26
baseline (23.606 μs) : 0, 23
- : 0, 0
- : 0, 0
section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)<br>
This PR (23.467 μs) : crit, 0, 23
baseline (23.62 μs) : 0, 23
- : 0, 0
- : 0, 0
section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)<br>
This PR (25.934 μs) : crit, 0, 25
baseline (25.476 μs) : 0, 25
- : 0, 0
- : 0, 0
---
config:
gantt:
rightPadding: 10
leftPadding: 120
sectionFontSize: 10
numberSectionStyles: 2
---
gantt
title Velocity-Bench Hashtable
todayMarker off
dateFormat X
axisFormat %s
section hashtable<br>Imm-CmdLists-OFF
This PR (328.870693 M keys/sec) : crit, 0, 328
baseline (306.262877 M keys/sec) : 0, 306
- : 0, 0
- : 0, 0
section hashtable<br>
This PR (331.496686 M keys/sec) : crit, 0, 331
baseline (360.15055 M keys/sec) : 0, 360
- : 0, 0
- : 0, 0
---
config:
gantt:
rightPadding: 10
leftPadding: 120
sectionFontSize: 10
numberSectionStyles: 2
---
gantt
title Velocity-Bench Bitcracker
todayMarker off
dateFormat X
axisFormat %s
section bitcracker<br>Imm-CmdLists-OFF
This PR (35.6003 s) : crit, 0, 35
baseline (39.0378 s) : 0, 39
- : 0, 0
- : 0, 0
section bitcracker<br>
This PR (35.6791 s) : crit, 0, 35
baseline (35.6105 s) : 0, 35
- : 0, 0
- : 0, 0
---
config:
gantt:
rightPadding: 10
leftPadding: 120
sectionFontSize: 10
numberSectionStyles: 2
---
gantt
title Velocity-Bench Easywave
todayMarker off
dateFormat X
axisFormat %s
section easywave<br>Imm-CmdLists-OFF
This PR (435 ms) : crit, 0, 435
baseline (606.0 ms) : 0, 606
- : 0, 0
- : 0, 0
section easywave<br>
This PR (247 ms) : crit, 0, 247
baseline (241.0 ms) : 0, 241
- : 0, 0
- : 0, 0
---
config:
gantt:
rightPadding: 10
leftPadding: 120
sectionFontSize: 10
numberSectionStyles: 2
---
gantt
title Velocity-Bench QuickSilver
todayMarker off
dateFormat X
axisFormat %s
section QuickSilver<br>
This PR (117.45 MMS/CTT) : crit, 0, 117
baseline (110.88 MMS/CTT) : 0, 110
- : 0, 0
- : 0, 0
---
config:
gantt:
rightPadding: 10
leftPadding: 120
sectionFontSize: 10
numberSectionStyles: 2
---
gantt
title Velocity-Bench Sobel Filter
todayMarker off
dateFormat X
axisFormat %s
section sobel_filter<br>Imm-CmdLists-OFF
This PR (609.857 ms) : crit, 0, 609
baseline (609.227 ms) : 0, 609
- : 0, 0
- : 0, 0
section sobel_filter<br>
This PR (604.705 ms) : crit, 0, 604
baseline (548.773 ms) : 0, 548
- : 0, 0
- : 0, 0
DetailsSubmitKernel(api=sycl Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0) Imm-CmdLists-OFFEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/test-user/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type SubmitKernel(api=sycl Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0) Imm-CmdLists-OFFEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/test-user/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type SubmitKernel(api=sycl Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0)Environment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/test-user/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type SubmitKernel(api=sycl Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0)Environment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/test-user/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type hashtable Imm-CmdLists-OFFEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify Output:hashtable - total time for whole calculation: 0.408117 s hashtableEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify Output:hashtable - total time for whole calculation: 0.404884 s bitcracker Imm-CmdLists-OFFEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/test-user/bench_workdir/bitcracker/bitcracker -f /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000 Output:---------> BitCracker: BitLocker password cracking tool <--------- ==================================
|
pbalcer
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not the biggest fan of having to do this, but I don't see a better option...
|
Alternative approach (not fully working yet): #1913 |
|
Replaced by #1913 |
to allow switching the implementation to v2::ur_event_handle_t. This is similar to how ur_queue_handle_t behaves right now.