Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pixel Heterogeneous DQM: add module rechit @cpu / @gpu comparisons #37860

Merged
merged 3 commits into from May 20, 2022

Conversation

sroychow
Copy link
Contributor

@sroychow sroychow commented May 9, 2022

PR description:

Code is by @arossi83
Adding a module to compare rechits @gpu/@cpu comparisons. The 2D correlation plots are done per layer/disk for Bpix/Fpix.

PR validation:

wfs with .503 runs successfully.

if this PR is a backport please specify the original PR and why you need to backport that PR:

Backport to 12.3.X will be needed.

@mmusich
Copy link
Contributor

mmusich commented May 9, 2022

test parameters:

  • enable_tests = gpu
  • workflows_gpu = 11634.503, 11634.506, 11634.583, 11634.587

@cmsbuild
Copy link
Contributor

cmsbuild commented May 9, 2022

-code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-37860/29808

  • This PR adds an extra 16KB to repository

  • There are other open Pull requests which might conflict with changes you have proposed:

    • File DQM/SiPixelPhase1Heterogeneous/python/SiPixelPhase1HeterogenousDQM_FirstStep_cff.py modified in PR(s): Follow up to 37617 #37798

Code check has found code style and quality issues which could be resolved by applying following patch(s)

Copy link
Contributor

@mmusich mmusich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

some preliminary comments

@@ -35,13 +36,25 @@
topFolderName = 'SiPixelHeterogeneous/PixelVertexSoAGPU',
)

siPixelPhase1MonitorRecHitsSoACPU = siPixelPhase1MonitorRecHitsSoA.clone(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mmusich
Copy link
Contributor

mmusich commented May 9, 2022

assign heterogeneous

@cmsbuild
Copy link
Contributor

cmsbuild commented May 9, 2022

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-37860/29810

  • This PR adds an extra 16KB to repository

  • There are other open Pull requests which might conflict with changes you have proposed:

    • File DQM/SiPixelPhase1Heterogeneous/python/SiPixelPhase1HeterogenousDQM_FirstStep_cff.py modified in PR(s): Follow up to 37617 #37798

@mmusich mmusich mentioned this pull request May 9, 2022
@cmsbuild
Copy link
Contributor

cmsbuild commented May 9, 2022

New categories assigned: heterogeneous

@fwyzard,@makortel you have been requested to review this Pull request/Issue and eventually sign? Thanks

@cmsbuild
Copy link
Contributor

cmsbuild commented May 9, 2022

A new Pull Request was created by @sroychow (Suvankar Roy Chowdhury) for master.

It involves the following packages:

  • DQM/SiPixelPhase1Heterogeneous (dqm)

@makortel, @emanueleusai, @ahmad3213, @cmsbuild, @jfernan2, @fwyzard, @pmandrik, @micsucmed, @rvenditti can you please review it and eventually sign? Thanks.
@hdelanno, @fioriNTU, @jandrea, @idebruyn, @mmusich, @threus this is something you requested to watch as well.
@perrotta, @dpiparo, @qliphy you are the release manager for this.

cms-bot commands are listed here

@mmusich
Copy link
Contributor

mmusich commented May 9, 2022

type trk

@cmsbuild cmsbuild added the trk label May 9, 2022
@emanueleusai
Copy link
Member

  • waiting for Marco's comments to be addressed before re-issuing the test command.

Comment on lines 100 to 101
float dx = soa2dCPU->xLocal(i) - soa2dGPU->xLocal(i);
float dy = soa2dCPU->yLocal(i) - soa2dGPU->yLocal(i);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is't the GPU index j ?
Then these should be

Suggested change
float dx = soa2dCPU->xLocal(i) - soa2dGPU->xLocal(i);
float dy = soa2dCPU->yLocal(i) - soa2dGPU->yLocal(i);
float dx = soa2dCPU->xLocal(i) - soa2dGPU->xLocal(j);
float dy = soa2dCPU->yLocal(i) - soa2dGPU->yLocal(j);

Comment on lines +103 to +132
if (distance < minD) {
minD = distance;
matchedHit = j;
}

This comment was marked as outdated.

Comment on lines +111 to +138
int16_t sizeXCPU = std::ceil(float(std::abs(soa2dCPU->clusterSizeX(i)) / 8.));
int16_t sizeYCPU = std::ceil(float(std::abs(soa2dCPU->clusterSizeY(i)) / 8.));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why the std::abs ?

@AdrianoDee
Copy link
Contributor

AdrianoDee commented May 12, 2022

If a backport to 12_3_X is needed for this PR, we will need backports of #37798 and #37617. I've opened #37933 if that's the case.

@mmusich
Copy link
Contributor

mmusich commented May 17, 2022

please test

  • let's try again

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-46f431/24803/summary.html
COMMIT: 8e54100
CMSSW: CMSSW_12_5_X_2022-05-17-1100/slc7_amd64_gcc10
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/37860/24803/install.sh to create a dev area with all the needed externals and cmssw changes.

GPU Comparison Summary

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-46f431/11634.503_TTbar_14TeV+2021_Patatrack_PixelOnlyGPU_Validation+TTbar_14TeV_TuneCP5_GenSim+Digi+RecoNano+HARVESTNano
  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-46f431/11634.583_TTbar_14TeV+2021_Patatrack_AllGPU_Validation+TTbar_14TeV_TuneCP5_GenSim+Digi+RecoNano+HARVESTNano
  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-46f431/11634.587_TTbar_14TeV+2021_Patatrack_AllTripletsGPU_Validation+TTbar_14TeV_TuneCP5_GenSim+Digi+RecoNano+HARVESTNano

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 24 differences found in the comparisons
  • DQMHistoTests: Total files compared: 7
  • DQMHistoTests: Total histograms compared: 76180
  • DQMHistoTests: Total failures: 984
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 75196
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 1528.4199999999996 KiB( 6 files compared)
  • DQMHistoSizes: changed ( 11634.506 ): -3488.195 KiB SiPixelHeterogeneous/PixelRecHitsSoA
  • DQMHistoSizes: changed ( 11634.503,... ): 8648.654 KiB SiPixelHeterogeneous/PixelRecHitsCompareGPUvsCPU
  • DQMHistoSizes: changed ( 11634.503,... ): -3488.225 KiB SiPixelHeterogeneous/PixelRecHitsSoACPU
  • DQMHistoSizes: changed ( 11634.503,... ): -3488.225 KiB SiPixelHeterogeneous/PixelRecHitsSoAGPU
  • Checked 24 log files, 18 edm output root files, 7 DQM output files
  • TriggerResults: no differences found

Comparison Summary

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-46f431/11634.301_TTbar_14TeV+2021_Run3FS+TTbar_14TeV_TuneCP5_GenSim+HARVESTNano

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3688458
  • DQMHistoTests: Total failures: 2
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3688434
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: -3488.195 KiB( 50 files compared)
  • DQMHistoSizes: changed ( 11634.501 ): -3488.195 KiB SiPixelHeterogeneous/PixelRecHitsSoA
  • Checked 212 log files, 48 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2)

@emanueleusai
Copy link
Member

a few comments:

  • There's a large number of failures, is this understood?
  • In 11634.506 baseline GUI in a large number of histos is changed and some are adde. I understand this is most likely caused by this PR, but I just want to make sure this is understood
  • Comparison and Comparison GPU have failed (probably because of the point above?)
  • [cosmetics] perhaps some 2d plots could be displayed in colz?

image

@mmusich
Copy link
Contributor

mmusich commented May 18, 2022

Hi @emanueleusai

There's a large number of failures, is this understood?

if you are referring to the histograms that change (and not the ones that are added or removed), as you might remember it was discussed in several other PRs, there is a small residual intrinsic non-reproducibility in the tracking GPU workflows caused by operations that do not commute when the execution order is changed.

In 11634.506 baseline GUI in a large number of histos is changed and some are adde. I understand this is most likely caused by this PR, but I just want to make sure this is understood.

I think it is.

Comparison and Comparison GPU have failed (probably because of the point above?)

Not sure to get what you mean by "failed" in this context

[cosmetics] perhaps some 2d plots could be displayed in colz?

the particular plot you link in the comment above #37860 (comment) has not been added in this PR, but in general, as you might be aware, that sort of cosmetics is generally dealt with at the level of render plugins in the GUI code and not in cmssw (we actually do plan to provide new render plugins once this is merged).

@emanueleusai
Copy link
Member

+1

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2)

@qliphy
Copy link
Contributor

qliphy commented May 20, 2022

+1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

8 participants