-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
maxIterGPU
assert failure in alpaka_serial_sync::pixelClustering::FindClus
#44077
Comments
cms-bot internal usage |
A new Issue was created by @missirol. @rappoccio, @makortel, @antoniovilela, @Dr15Jones, @sextonkennedy, @smuzaffar can you please review it and eventually sign/assign? Thanks. cms-bot commands are listed here |
assign heterogeneous, reconstruction, hlt @cms-sw/trk-dpg-l2 FYI |
New categories assigned: heterogeneous,reconstruction,hlt @Martin-Grunewald,@mmusich,@fwyzard,@jfernan2,@makortel,@mandrenguyen you have been requested to review this Pull request/Issue and eventually sign? Thanks |
The assertion fails because in
that require 17 iterations, while In the CUDA version the block size is 384 (to accommodate for |
+hlt explicitly tested with: cmsrel CMSSW_14_0_0
cd CMSSW_14_0_0/src/
cmsenv
git cms-merge-topic 44082
scram b -j 20 and then following the recipe at #44077 (comment) |
+heterogeneous |
Running a recent HLT menu with
customizeHLTforAlpaka
inCMSSW_14_0_0
as in [1] leads to a runtime error.The full stack trace from running [1] can be found in pixel_findclus_cpu.log. Note that [1] forces the job to run on CPU only.
A similar crash occurs also on GPU (stack track in pixel_findclus_gpu.log), but a GPU is not needed to reproduce the issue.
There is no runtime error if the Alpaka customisation is not used.
Could experts please have a look ?
FYI: @AdrianoDee @borzari @fwyzard @cms-sw/hlt-l2
[1]
PS. Just for my own reference, I encountered this crash while testing a recent HLT menu in 14_0_X on one of the HiLTON nodes as described here.
The text was updated successfully, but these errors were encountered: