Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change premixing library format from RAW to DIGI #24184

Merged
merged 8 commits into from Sep 25, 2018

Conversation

makortel
Copy link
Contributor

@makortel makortel commented Aug 3, 2018

The premixing for phase2 is already using (sim) DIGI as the premixing library format, because it allows simplicity (no need to pack+unpack) with only tiny cost in the file size. As outlined in the spring O&C week
https://indico.cern.ch/event/711343/contributions/2964389/attachments/1631761/2601778/slides_premix_20180412.pdf
this PR does the same for the run2 premixing. In addition

  • L1 is removed as unnecessary from the stage1 workflow (=produces the premixing library)
  • PREMIX event content definitions are distributed to the corresponding packages
    • I hope this to be yet another step for others to become better aware of what happens in premixing

Since the packing+unpacking are (or can be) lossy, there will be small changes in run2 FullSim and FastSim. However, the changes should be towards the classical mixing (as the workflow becomes more similar), and this is what I see e.g. for CSC digis.

Here are links (to my private DQM GUI; black is the release and blue this PR) to 100-event comparison of 250202.181 (2018 ttbar+PU FullSim)
http://127.0.0.1:8081/dqm/relval/start?runnr=1;dataset=/RelValTTbar_13/CMSSW_10_2_0-PU50_2018_premixing_100ev_orig_v1-v1/DQMIO;sampletype=offline_relval;filter=all;referencepos=ratiooverlay;referenceshow=all;referencenorm=True;referenceobj1=other%3A%3A/RelValTTbar_13/CMSSW_10_2_0-PU50_2018_premixing_100ev_dev_v1-v1/DQMIO%3A;referenceobj2=none;referenceobj3=none;referenceobj4=none;search=;striptype=object;stripruns=;stripaxis=run;stripomit=none;workspace=Everything;size=M;root=;focus=;zoom=no;

10-event for 250202.171 (2017)
http://127.0.0.1:8081/dqm/relval/start?runnr=1;dataset=/RelValTTbar_13/CMSSW_10_2_0-PU35_2017_premixing_10ev_orig_v1-v1/DQMIO;sampletype=offline_relval;filter=all;referencepos=ratiooverlay;referenceshow=all;referencenorm=True;referenceobj1=other%3A%3A/RelValTTbar_13/CMSSW_10_2_0-PU35_2017_premixing_10ev_dev_v1-v1/DQMIO%3A;referenceobj2=none;referenceobj3=none;referenceobj4=none;search=;striptype=object;stripruns=;stripaxis=run;stripomit=none;workspace=Everything;size=M;root=;focus=;zoom=no;

250202.1 (2016)
http://127.0.0.1:8081/dqm/relval/start?runnr=1;dataset=/RelValTTbar_13/CMSSW_10_2_0-PU35_2016_premixing_10ev_orig_v1-v1/DQMIO;sampletype=offline_relval;filter=all;referencepos=ratiooverlay;referenceshow=all;referencenorm=True;referenceobj1=other%3A%3A/RelValTTbar_13/CMSSW_10_2_0-PU35_2016_premixing_10ev_dev_v1-v1/DQMIO%3A;referenceobj2=none;referenceobj3=none;referenceobj4=none;search=;striptype=object;stripruns=;stripaxis=run;stripomit=none;workspace=Everything;size=M;root=;focus=;zoom=no;

and 250402.1 (2016 FastSim, added in #24149)
http://127.0.0.1:8081/dqm/relval/start?runnr=1;dataset=/RelValTTbar_13/CMSSW_10_2_0-PU35_FS2016_premixing_10ev_orig_v1-v1/DQMIO;sampletype=offline_relval;filter=all;referencepos=ratiooverlay;referenceshow=all;referencenorm=True;referenceobj1=other%3A%3A/RelValTTbar_13/CMSSW_10_2_0-PU35_FS2016_premixing_10ev_dev_v1-v1/DQMIO%3A;referenceobj2=none;referenceobj3=none;referenceobj4=none;search=;striptype=object;stripruns=;stripaxis=run;stripomit=none;workspace=Everything;size=M;root=;focus=;zoom=no;

with the following SSH tunnel recipe

ssh -L8081:mkdev.cern.ch:8081 lxplus.cern.ch
http://127.0.0.1:8081/dqm/relval/

Tested in 10_2_0 (rebased on top of CMSSW_10_3_X_2018-08-29-2300 CMSSW_10_3_X_2018-09-04-1100), expecting small changes in run2 FullSim and FastSim premixing, but no changes in phase2 (FullSim) premixing. Because the premixing library format changes, this PR breaks all stage2-only premixing workflows.

@kpedro88 @mdhildreth

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 3, 2018

The code-checks are being triggered in jenkins.

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 3, 2018

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 3, 2018

A new Pull Request was created by @makortel (Matti Kortelainen) for master.

It involves the following packages:

Configuration/EventContent
Configuration/PyReleaseValidation
Configuration/StandardSequences
EventFilter/ESDigiToRaw
EventFilter/SiStripRawToDigi
L1Trigger/Configuration
SimCalorimetry/Configuration
SimGeneral/Configuration
SimGeneral/PreMixingModule
SimMuon/Configuration
SimTracker/Configuration

@perrotta, @pgunnell, @prebello, @cmsbuild, @thomreis, @zhenhu, @emeschi, @civanch, @mdhildreth, @fabozzi, @nsmith-, @rekovic, @franzoni, @kpedro88, @mommsen, @slava77, @GurpreetSinghChahal, @fabiocos, @davidlange6 can you please review it and eventually sign? Thanks.
@echabert, @trocino, @battibass, @felicepantaleo, @abbiendi, @GiacomoSguazzoni, @gbenelli, @jhgoh, @VinInn, @Martin-Grunewald, @calderona, @prolay, @HuguesBrun, @ebrondol, @drkovalskyi, @threus, @dildick, @mmusich, @rovere, @dgulhan, @folguera this is something you requested to watch as well.
@davidlange6, @slava77, @fabiocos you are the release manager for this.

cms-bot commands are listed here

@makortel makortel changed the title Replace the primixing library format from RAW to DIGI Change primixing library format from RAW to DIGI Aug 3, 2018
@makortel
Copy link
Contributor Author

makortel commented Aug 3, 2018

@cmsbuild, please test workflow 250202.1,250202.171,250402.1 with #24149

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 3, 2018

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/29635/console Started: 2018/08/03 10:30

@makortel
Copy link
Contributor Author

makortel commented Aug 3, 2018

The premixing library file size stays essentially the same. In the 2018 ttbar+PU, the FEDRawDataCollection takes currently 3.3 MB/event (compressed, on 100 event test), and with this PR the corresponding DIGI collections take 3.0 MB/event (compressed, same events).

@prebello
Copy link
Contributor

prebello commented Aug 3, 2018

hi @makortel I cannot access your links above (private DQM GUI)

@makortel
Copy link
Contributor Author

makortel commented Aug 3, 2018

@prebello See the SSH tunnel recipe

ssh -L8081:mkdev.cern.ch:8081 lxplus.cern.ch
http://127.0.0.1:8081/dqm/relval/

or use the links below that work only from CERN network

250202.181 (2018 ttbar+PU FullSim)
http://mkdev.cern.ch:8081/dqm/relval/start?runnr=1;dataset=/RelValTTbar_13/CMSSW_10_2_0-PU50_2018_premixing_100ev_orig_v1-v1/DQMIO;sampletype=offline_relval;filter=all;referencepos=ratiooverlay;referenceshow=all;referencenorm=True;referenceobj1=other%3A%3A/RelValTTbar_13/CMSSW_10_2_0-PU50_2018_premixing_100ev_dev_v1-v1/DQMIO%3A;referenceobj2=none;referenceobj3=none;referenceobj4=none;search=;striptype=object;stripruns=;stripaxis=run;stripomit=none;workspace=Everything;size=M;root=;focus=;zoom=no;

10-event for 250202.171 (2017)
http://mkdev.cern.ch:8081/dqm/relval/start?runnr=1;dataset=/RelValTTbar_13/CMSSW_10_2_0-PU35_2017_premixing_10ev_orig_v1-v1/DQMIO;sampletype=offline_relval;filter=all;referencepos=ratiooverlay;referenceshow=all;referencenorm=True;referenceobj1=other%3A%3A/RelValTTbar_13/CMSSW_10_2_0-PU35_2017_premixing_10ev_dev_v1-v1/DQMIO%3A;referenceobj2=none;referenceobj3=none;referenceobj4=none;search=;striptype=object;stripruns=;stripaxis=run;stripomit=none;workspace=Everything;size=M;root=;focus=;zoom=no;

250202.1 (2016)
http://mkdev.cern.ch:8081/dqm/relval/start?runnr=1;dataset=/RelValTTbar_13/CMSSW_10_2_0-PU35_2016_premixing_10ev_orig_v1-v1/DQMIO;sampletype=offline_relval;filter=all;referencepos=ratiooverlay;referenceshow=all;referencenorm=True;referenceobj1=other%3A%3A/RelValTTbar_13/CMSSW_10_2_0-PU35_2016_premixing_10ev_dev_v1-v1/DQMIO%3A;referenceobj2=none;referenceobj3=none;referenceobj4=none;search=;striptype=object;stripruns=;stripaxis=run;stripomit=none;workspace=Everything;size=M;root=;focus=;zoom=no;

and 250402.1 (2016 FastSim, added in #24149)
http://mkdev.cern.ch:8081/dqm/relval/start?runnr=1;dataset=/RelValTTbar_13/CMSSW_10_2_0-PU35_FS2016_premixing_10ev_orig_v1-v1/DQMIO;sampletype=offline_relval;filter=all;referencepos=ratiooverlay;referenceshow=all;referencenorm=True;referenceobj1=other%3A%3A/RelValTTbar_13/CMSSW_10_2_0-PU35_FS2016_premixing_10ev_dev_v1-v1/DQMIO%3A;referenceobj2=none;referenceobj3=none;referenceobj4=none;search=;striptype=object;stripruns=;stripaxis=run;stripomit=none;workspace=Everything;size=M;root=;focus=;zoom=no;

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 3, 2018

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 3, 2018

Comparison job queued.

@kpedro88
Copy link
Contributor

@mommsen @emeschi @rekovic @thomreis @nsmith- please sign

@fabiocos
Copy link
Contributor

@mommsen @emeschi @rekovic @thomreis @nsmith- could you please check and sign it in case?

@fabiocos
Copy link
Contributor

@rekovic @thomreis @nsmith- we need to converge with this PR for 10_3_X, could you please express possible concerns that are keeping your signature on hold? Or sign the PR?

@fabiocos
Copy link
Contributor

+operations

the event content is modified for step2 of the test workflow, i.e. premix stage1, without evidence of loss of functionality

@rekovic
Copy link
Contributor

rekovic commented Sep 25, 2018

+1

@fabiocos
Copy link
Contributor

+1

@fabiocos
Copy link
Contributor

merge

@emeschi @mommsen changes are affecting only simulation, please check and sign it anyway in case

@cmsbuild cmsbuild merged commit 84f0523 into cms-sw:master Sep 25, 2018
@fabiocos
Copy link
Contributor

@makortel as you have warned, a number of premixing test workflows are broken because of this PR, starting from CMSSW_10_3_X_2018-09-25-1100 . In my understanding the fix passes through a regeneration of the input files with the new event content, so it is almost unavoidable to ha period when some IBs are broken, please correct me if I am wrong. On one hand I think that it would be useful to have this feature integrated in 10_3_X for easier tests, although in order to fix the situation and have a complete premixing validation we would need one extra pre-release cycle.

@franzoni @prebello @pgunnell @zhenhu could you please clarify whether it is reasonable to produce quickly new input files and have premixing validation in a possible fast 10_3_0_pre6 to be done just for this purpose?

@davidlange6
Copy link
Contributor

davidlange6 commented Sep 26, 2018 via email

@prebello
Copy link
Contributor

@fabiocos, as @davidlange6 said above, as soon as this PR is merged for the next pre release then the RelValPREMIX... libraries will be regenerated with the new input format.

@fabiocos
Copy link
Contributor

@prebello @davidlange6 that new input files are produced with pre5 is what I expect, but in principle then PyReleaseValidation should be updated accordingly in places like

https://cmssdt.cern.ch/lxr/source/Configuration/PyReleaseValidation/python/relval_steps.py#0612

I understand that this is going to be done manually, providing a PR that can be quickly merged to stop the failures, but without the need of an explicit build to start the production of premixed samples for the PMX campaign. Is this that you mean?

@davidlange6
Copy link
Contributor

davidlange6 commented Sep 26, 2018 via email

@prebello
Copy link
Contributor

@fabiocos the strings # 8 - fastSim premixed MinBias and #15 - fullSim PU 25ns UP18 premix will be updated by hand as usual

@makortel
Copy link
Contributor Author

@fabiocos

a number of premixing test workflows are broken because of this PR, starting from CMSSW_10_3_X_2018-09-25-1100 . In my understanding the fix passes through a regeneration of the input files with the new event content, so it is almost unavoidable to ha period when some IBs are broken, please correct me if I am wrong.

This is correct (and unfortunate, I agree, but I don't know a way around it).

@prebello
Copy link
Contributor

@fabiocos

a number of premixing test workflows are broken because of this PR, starting from CMSSW_10_3_X_2018-09-25-1100 . In my understanding the fix passes through a regeneration of the input files with the new event content, so it is almost unavoidable to ha period when some IBs are broken, please correct me if I am wrong.

@makortel me and @davidlange6 have already replied about it. Did you see our comments above?

This is correct (and unfortunate, I agree, but I don't know a way around it).

@makortel
Copy link
Contributor Author

@prebello Yes I did.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

10 participants