Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Responding to PR 24183 (Updating gridpacks for GEN related Rel-Val failures) #24268

Merged
merged 3 commits into from Aug 14, 2018

Conversation

Saptaparna
Copy link
Contributor

Updated to CMSSW master and includes my changes.

@cmsbuild
Copy link
Contributor

The code-checks are being triggered in jenkins.

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @Saptaparna (Saptaparna Bhattacharya) for master.

It involves the following packages:

Configuration/Generator

@alberto-sanchez, @cmsbuild, @efeyazgan, @perrozzi can you please review it and eventually sign? Thanks.
@Martin-Grunewald this is something you requested to watch as well.
@davidlange6, @slava77, @fabiocos you are the release manager for this.

cms-bot commands are listed here

@alberto-sanchez
Copy link
Member

please test workflow 562.0, 25210.0, 1360.0

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 11, 2018

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/29782/console Started: 2018/08/11 18:06

@cmsbuild
Copy link
Contributor

-1

Tested at: 7bd2ba3

You can see the results of the tests here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-24268/29782/summary.html

I found follow errors while testing this PR

Failed tests: RelVals

  • RelVals:

The relvals timed out after 2 hours.

@cmsbuild
Copy link
Contributor

Comparison not run due to runTheMatrix errors (RelVals and Igprof tests were also skipped)

@Saptaparna
Copy link
Contributor Author

@alberto-sanchez Indeed, I also see the problem when testing interactively. The problem is with the ttbar gridpack and hence affects workflows 25210.0 and 1360.0.

Updating the gridpack to an ever more recent one (MG5 260) seems to solve the problem. Running local tests now.

@alberto-sanchez
Copy link
Member

please test workflow 562.0, 25210.0, 1360.0

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

Comparison job queued.

@cmsbuild
Copy link
Contributor

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-24268/29798/summary.html

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /build/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/JR-comparison/PR-24268/1360.0_TTbar012Jets_NLO_Mad_py8_Evt_13+TTbar012Jets_NLO_Mad_py8_Evt_13+DIGIUP15+RECOUP15+HARVESTUP15
  • /build/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/JR-comparison/PR-24268/25210.0_TTbar012Jets_NLO_Mad_py8_Evt_13+TTbar012Jets_NLO_Mad_py8_Evt_13+DIGIUP15_PU25+RECOUP15_PU25+HARVESTUP15_PU25
  • /build/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/JR-comparison/PR-24268/562.0_BulkG_ZZ_2L2Q_M1200_narrow_13TeV_pythia8+BulkG_M1200_narrow_2L2Q_LHE_13TeV+Hadronizer_TuneCUETP8M1_Mad_pythia8+HARVESTGEN2

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 28
  • DQMHistoTests: Total histograms compared: 2811739
  • DQMHistoTests: Total failures: 1
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2811557
  • DQMHistoTests: Total skipped: 181
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 27 files compared)
  • Checked 122 log files, 14 edm output root files, 28 DQM output files

@Saptaparna
Copy link
Contributor Author

@alberto-sanchez Does this mean the PR can be merged?

@boudoul
Copy link
Contributor

boudoul commented Aug 13, 2018

Dear @Saptaparna , please make an explicit title an better explanation,- we can't guess from this title if you are talking about HCAL , Tracker, Trigger, Generator, and what you changed .... Thank you

@Saptaparna
Copy link
Contributor Author

@boudoul This PR is a response to some requests in #24183 and refers to workflows 562.0, 25210.0, 1360.0 that failed at the generator step. The problem was due to the presence of old gridpacks. The paths of the gridpacks have been updated and based on the last code check do not lead to crashes during Rel-Val production.

@boudoul
Copy link
Contributor

boudoul commented Aug 13, 2018

I know , but we can't guess from the title 'Responding to PR 24183', if you are talking about HCAL , Tracker, Trigger, Generator .

The title of PRS are very important because intensively used by validators (for example) or to easily keep track of what is entering in a given release: it is what will appear in the release notes and we should know what is the topic of a PR by just reading it .

So please edit the title [you can keep 'Responding to PR 24183' if you want in it] but please add something more explicit in addition- The edit button is next to the title

@Saptaparna Saptaparna changed the title Responding to PR 24183 Responding to PR 24183 (Updating gridpacks for GEN related Rel-Val failures) Aug 13, 2018
@alberto-sanchez
Copy link
Member

+1

@alberto-sanchez
Copy link
Member

errors went away

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @davidlange6, @slava77, @smuzaffar, @fabiocos (and backports should be raised in the release meeting by the corresponding L2)

@fabiocos
Copy link
Contributor

@Saptaparna for my understanding, how do these gridpacks solve the problem, without specification of another environment? Are they simply produced with more recent CMSSW versions?

@fabiocos
Copy link
Contributor

@Saptaparna hmm, not really looking at the name: BulkGraviton_ZZ_inclu_narrow_M1200_slc6_amd64_gcc481_CMSSW_7_1_30_tarball

@fabiocos
Copy link
Contributor

@Saptaparna @alberto-sanchez could you please clarify how these new gridpacks are solving the problem? I am pleased of the solution of the issue, but I would like to understand the sgtrategy behind. At some point we will need to discuss a global strategy valid in general

@Saptaparna
Copy link
Contributor Author

@fabiocos Apologies for the slow response. The problematic gridpacks were the ones that did not include some of the new features in the runcmsgrid script (https://github.com/cms-sw/genproductions/blob/master/bin/MadGraph5_aMCatNLO/runcmsgrid_LO.sh#L20ToL42). The updated gridpacks were produced with newer versions of MadGraph (2.2.2 vs 2.4.2) and a new version for script referred to above. One thing to note is that even with MadGraph 2.4.2, the ttbar workflow repeatedly failed (investigating as why this happened, possibly that gridpack was corrupted for some reason). Updating the ttbar workflow to a MG5 2.6.0 gridpack fixed the problem. It maybe worth digging deeper to understand what worked in 260 that didn't work in 242. However, given the recent push to upgrade genproductions to 260 (branch 260 has existed for many months, but now 260 will be upgraded to master) in general may mean that soon 242 gridpacks may have only limited use.

@fabiocos
Copy link
Contributor

+1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants