Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add hltTracksMerged monitoring (both DQM and Validation) #19594

Merged
merged 1 commit into from Jul 20, 2017

Conversation

mtosi
Copy link
Contributor

@mtosi mtosi commented Jul 6, 2017

back-port of #19591

add HLTSiPixelMonitoring_cff.py as well

add HLTSiPixelMonitoring_cff.py as well
@mtosi
Copy link
Contributor Author

mtosi commented Jul 6, 2017

@cmsbuild, please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Jul 6, 2017

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/21227/console Started: 2017/07/06 17:18

@cmsbuild
Copy link
Contributor

cmsbuild commented Jul 6, 2017

A new Pull Request was created by @mtosi (mia tosi) for CMSSW_9_2_X.

It involves the following packages:

DQM/HLTEvF
DQMOffline/Trigger
Validation/RecoTrack

@cmsbuild, @vazzolini, @kmaeshima, @dmitrijus, @Martin-Grunewald, @silviodonato, @fwyzard, @vanbesien, @davidlange6 can you please review it and eventually sign? Thanks.
@battibass, @makortel, @felicepantaleo, @GiacomoSguazzoni, @jhgoh, @VinInn, @calderona, @HuguesBrun, @rovere, @wmtford, @ebrondol, @trocino, @dgulhan, @rociovilar this is something you requested to watch as well.
@davidlange6 you are the release manager for this.

cms-bot commands are listed here

@cmsbuild
Copy link
Contributor

cmsbuild commented Jul 6, 2017

@cmsbuild
Copy link
Contributor

cmsbuild commented Jul 6, 2017

Comparison job queued.

@cmsbuild
Copy link
Contributor

cmsbuild commented Jul 6, 2017

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-19594/21227/summary.html

Comparison Summary:

  • You potentially added 929 lines to the logs
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 22
  • DQMHistoTests: Total histograms compared: 1813696
  • DQMHistoTests: Total failures: 35514
  • DQMHistoTests: Total nulls: 65
  • DQMHistoTests: Total successes: 1777951
  • DQMHistoTests: Total skipped: 166
  • DQMHistoTests: Total Missing objects: 0
  • Checked 90 log files, 14 edm output root files, 22 DQM output files

@Martin-Grunewald
Copy link
Contributor

+1

@dmitrijus
Copy link
Contributor

+1

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next CMSSW_9_2_X IBs (tests are also fine) and once validation in the development release cycle CMSSW_9_3_X is complete. This pull request requires discussion in the ORP meeting before it's merged. @davidlange6, @smuzaffar

@cmsbuild
Copy link
Contributor

-1

Tested at: abd1f59

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:
2c69321
You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-19594/21600/git-log-recent-commits
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-19594/21600/git-merge-result

You can see the results of the tests here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-19594/21600/summary.html

I found follow errors while testing this PR

Failed tests: AddOn

  • AddOn:

I found errors in the following addon tests:

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:
2c69321
You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-19594/21600/git-log-recent-commits
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-19594/21600/git-merge-result

@cmsbuild
Copy link
Contributor

Comparison job queued.

@Martin-Grunewald
Copy link
Contributor

Martin-Grunewald commented Jul 19, 2017

@smuzaffar
Hi,

Could you please have a look at the reporting of the above addOnTests error? The above message is not very helpful in indicating which of the addOnTests has failed - this used to be indicated in the past.
Also, as before, running all addOnTests in my area I do not find any error.

@smuzaffar
Copy link
Contributor

@Martin-Grunewald , looking athttps://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-19594/21600/addOnTests.log , I see that it is not complete which means the addOnTests were timed out after 1.5 hours. This test ran on a large machine (24 cores and 48GB ram) so I guess there was some slowness in reading/accessing data.

@Martin-Grunewald
Copy link
Contributor

@smuzaffar
Since this seems to happens more often lately, could you please perhaps increase the timeout to 2 h?

@davidlange6
Copy link
Contributor

davidlange6 commented Jul 19, 2017 via email

@davidlange6
Copy link
Contributor

davidlange6 commented Jul 19, 2017 via email

@Martin-Grunewald
Copy link
Contributor

The full machine for one PR - or shared with other PR tests?

@Martin-Grunewald
Copy link
Contributor

I do not buy that stack trace as a run-time error, as I can not reproduce it offline. I rather think this is the stack trace due to the external signal cutting when the time goes above 1.5h.

@davidlange6
Copy link
Contributor

davidlange6 commented Jul 19, 2017 via email

@Martin-Grunewald
Copy link
Contributor

Hmm ok - but are all cpus fully used? Ie, how many addOnTests are done in parallel? It can be at most
21 - the number of addOnTests, and some are much shorter than others. The steps within a single addOn test can not be parallelised as they need to run sequential - output of one feeding input of next.

@smuzaffar
Copy link
Contributor

@Martin-Grunewald , when a PR test starts then it occupy full machine. We run N parallel jobs where N is cores available on that machine.

@davidlange6
Copy link
Contributor

davidlange6 commented Jul 19, 2017 via email

@davidlange6
Copy link
Contributor

davidlange6 commented Jul 19, 2017 via email

@smuzaffar
Copy link
Contributor

@davidlange6 , if you are referring to https://github.com/cms-sw/cmssw/blob/master/Utilities/ReleaseScripts/scripts/addOnTests.py#L265 then do not worry about that as we run addOnTests.py -j N
and I do not think each addOn job run in threaded mode so we should be fine by running N jobs.

@davidlange6
Copy link
Contributor

davidlange6 commented Jul 19, 2017 via email

@smuzaffar
Copy link
Contributor

@davidlange6 , I see that now.
I do not think all release cycles use 4 threads for addOn tests so just blindly running N/4 process will under utilize the resources.

@davidlange6
Copy link
Contributor

davidlange6 commented Jul 19, 2017 via email

@cmsbuild
Copy link
Contributor

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-19594/21600/summary.html

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 22
  • DQMHistoTests: Total histograms compared: 1756608
  • DQMHistoTests: Total failures: 17459
  • DQMHistoTests: Total nulls: 33
  • DQMHistoTests: Total successes: 1738950
  • DQMHistoTests: Total skipped: 166
  • DQMHistoTests: Total Missing objects: 0
  • Checked 90 log files, 14 edm output root files, 22 DQM output files

@davidlange6
Copy link
Contributor

merge

@cmsbuild cmsbuild merged commit 24cd051 into cms-sw:CMSSW_9_2_X Jul 20, 2017
@JanFSchulte
Copy link
Contributor

Thanks for merging this! One question: When do we plan to have a new 9_2_X release? Without this PR, we are not able to monitor the HLT tracking in the data we are taking right now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

7 participants