-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix isData when neither --data nor --mc is used #34302
Conversation
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-34302/23631
|
A new Pull Request was created by @srimanob (Phat Srimanobhas) for master. It involves the following packages: Configuration/Applications @cmsbuild, @silviodonato, @qliphy, @fabiocos, @davidlange6 can you please review it and eventually sign? Thanks. cms-bot commands are listed here |
Please test |
assign xpog |
New categories assigned: xpog @fgolf,@mariadalfonso,@gouskos you have been requested to review this Pull request/Issue and eventually sign? Thanks |
+1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d3a480/16389/summary.html Comparison SummarySummary:
|
type bug-fix |
+operations |
Hi @cms-sw/xpog-l2 @qliphy |
+1 @cms-sw/xpog-l2 Let us know if you have any comment. |
merge |
in miniAOD when the default is not set, pick the MC configuration, and now you set a different logic in nano. so having something different Is not good at all. This flag should not be there probably . |
Hi, If you want to fix Nano sequence is OK too. I avoid touching it in the beginning. It was there long ago somehow. |
@srimanob Please note there are some unit failures after merging this PR. |
Thanks @qliphy |
However, you can revert it for now. I can make another PR when the situation is clear. |
@srimanob Ok, thanks. Will revert it for now. |
Answer why IB failed in The validation step seems to run a strange sequence, using Data as input but look for MC content (i.e. mixing). When we set isData, the issue pop up. This is wrong. However, to avoid IB failing, I will make another PR to fix for Nano sequence for now. But we should follow up with DQM why they need to do that kind of sequence. |
It seems we do not have a straightforward way to fix. Mini(PAT) will face the same issue if it is data workflow but run without "--data". CMSSW will guess it is data, but PAT sequence will pick MC customization because nothing is set (*). I believe this PR is the right thing to do, by assigning isData when the guess result is data workflow. What should be fix is the DQM validation sequence. (*) |
@srimanob Would you please make a new PR or issue page to move discussions there? Thanks! |
Here it is, |
PR description:
When neither
--data
nor--mc
is used, cmsDriver will consider if the job should be data or MC usinghttps://github.com/cms-sw/cmssw/blob/CMSSW_12_0_0_pre3/Configuration/Applications/python/cmsDriverOptions.py#L214-L235
and then print out.
However, when it considers a job as data,
options.isData
is not set. Some config, i.e. NanoAOD,https://github.com/cms-sw/cmssw/blob/CMSSW_12_0_0_pre3/Configuration/Applications/python/ConfigBuilder.py#L1705
will need
options.isData
to be defined. If not MC customization will be picked.This bug was shown in the NanoV9 pilot sample,
https://cms-unified.web.cern.ch/cms-unified/report/pdmvserv_Run2018C_JetHT_UL2018_MiniAODv2_NanoAODv9_pilot_210617_171408_1560
with a config file in
https://cmsweb.cern.ch/couchdb/reqmgr_config_cache/5dabb3c8610cca5485ccc25118fc5c8c/configFile
shows
from PhysicsTools.NanoAOD.nano_cff import nanoAOD_customizeMC
is used, instead of data.
PR validation:
Run cmsDriver without --mc, --data, and see if it picks up the proper NanoAOD:
cmsDriver.py RECO --conditions 106X_dataRun2_v35 --customise Configuration/DataProcessing/Utils.addMonitoring --datatier NANOAOD --era Run2_2018,run2_nanoAOD_106Xv2 --eventcontent NANOEDMAOD --filein placeholder.root --fileout file:ReReco-Run2018C-JetHT-UL2018_MiniAODv2_NanoAODv9_pilot-00001.root --nThreads 2 --no_exec --python_filename ReReco-Run2018C-JetHT-UL2018_MiniAODv2_NanoAODv9_pilot-00001_0_cfg.py --scenario pp --step NANO
will give
This is correct. Before, it picked up MC customization.
if this PR is a backport please specify the original PR and why you need to backport that PR:
Backport to 10_6