New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
reduce size of isOuterHitOfCell in patatrack #35285
Conversation
@cmsbuild , please test |
enable gpu |
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-35285/25287
|
A new Pull Request was created by @VinInn (Vincenzo Innocente) for master. It involves the following packages:
@makortel, @jpata, @fwyzard, @slava77 can you please review it and eventually sign? Thanks. cms-bot commands are listed here |
RecoPixelVertexing/PixelTriplets/plugins/CAHitNtupletGeneratorKernels.cc
Outdated
Show resolved
Hide resolved
+1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-822c8d/18626/summary.html GPU Comparison SummarySummary:
Comparison SummarySummary:
|
will wait for further comments before making a new commit to fix what found so far |
@cmsbuild , please test |
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-35285/25443
|
+1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-822c8d/18816/summary.html GPU Comparison SummarySummary:
Comparison SummarySummary:
|
+reconstruction
|
@cms-sw/heterogeneous-l2 |
+heterogeneous |
This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2) |
+1 |
For the record, here is a comparison of the throughput with respect to CMSSW_12_1_0_pre3, running over (uncompressed) TTbar events with pileup:
|
hits in BPIX1 can not be OuterHitOfCell. BPIX1 holds 1/ 4 of the hits.
This PR removes BPIX1 hits from isOuterHitOfCell.
Properly resize the array and the kernel launch boundaries.
Took the opportunity to reduce the number of device2host memcpy as well.
small throughput improvement observed.
technical: no regression expected. no regression observed.