-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wrong setting of RunNumber in harvesting output for MC MultiRun #9690
Comments
@srimanob Hi Phat, it also points to the original issue, where we described the feature requirements with Marco Rovere and Federico. Can you please have a look at those tickets and decide whether a feature change is required? Perhaps you also want to touch base with the DQM experts that proposed it back 3 or 4 years ago. |
Hi Alan @amaltaro |
Dear all, |
Hi @amaltaro Force RUN = 1 for MC, and 999999 for data for MRH. |
Sorry for missing it, Phat.
Can you please confirm it? |
Hi @amaltaro if multiRun and isCMSSWSupported(self.getCmsswVersion(), "CMSSW_8_0_0"): (as MC in MRH will have RunNumber as in Data, but we would like to ignore it on MC). Thanks in advance. |
@srimanob I expected it to be a simple fix, but it turns out we need to systematically identify data and run-dep MC files in the agent. I will get back to this issue in the next week. |
AlCa is currently using a naming convention in which global tags for run-dependent MC have the string "_RD" immediately prior to the version number. From our perspective, files from run-dependent MC workflows can be identified by the string "_RD_v" in the name, such as |
I would like to understand if the obstacles to resolving this issue are:
At this point, given the small number of run-dependent MC relval samples, it is feasible to run the harvesting and upload to the DQM GUI manually. So, this issue currently is not particularly urgent. At the same time, I don't want to waste the time of those who would perform these manual actions if a fix on the WMCore side is straightforward. |
Hi @christopheralanwest , sorry for the belated response. Yes, we still have to find out a generic and robust way to distinguish between data and MC. I'm afraid matching the global tag against Nonetheless, this issue hasn't been added to our todo queue for this quarter, so unless we can make it an hour or two of work, we will likely not be able to work on it before Q3. Perhaps you would like to discuss this with PPD/PdmV group, since they have other WMCore issues that we need to consider as well. |
Clearly we could choose an longer string, such as Run-dependent MC does not have run numbers other than 1 in the GEN-SIM step: But in the DIGI-PREMIX step the lumiblock number is mapped to a run number:
I think it's fine to postpone this issue until run-dependent MC relval production becomes more frequent, which would not happen before Q3. |
I know that it may be late, but I don't really see a problem with distinguishing MC and data based on run numbers. At the end run numbers is just a number and someone can choose a range which can be assigned to real data and to MC. For instance, MC run numbers can start from 1M and above, everything below is allocated for real data run numbers. I think it is general issue in CMS and should be discussed separately. From a technical point of view I don't see any issue with such approach. |
Impact of the bug
Output from MC MultiRun harvesting can't upload to GUI due to RunNumber is set to 999999 from production side while it's forced to be 1 in cmsDriver injected with the workflow.
Describe the bug
MC Run-Dependent relvals have been submitted, e.g.
https://dmytro.web.cern.ch/dmytro/cmsprodmon/workflows.php?campaign=CMSSW_11_1_0_pre7__RD_1HS-1588962304
All steps look OK except no histogram is found in GUI. When we check GUI log,
The filename seems to be wrong. The RunNumber should be 1. We force if to be 1 in cmsDriver of harvesting step in the workflow,
https://cmsweb.cern.ch/couchdb/reqmgr_config_cache/53979640a5175ff795ce39b4522dc8fe/configFile
Looking around WMCore, I found
https://github.com/dmwm/WMCore/blob/master/src/python/WMCore/WMRuntime/Scripts/SetupCMSSWPset.py#L546-L547
Is this overwritten the Harvesting cmsDriver we submit with the workflow?
How to reproduce it
I don't know how to reproduce, as it seems problem will happen on production side only and can't reproduce when I run locally. Running locally, I get the proper filename, i.e.
DQM_V0001_R000000001__Global__CMSSW_X_Y_Z__RECO.root
Expected behavior
We expect RunNumber = 1 for MC MultiRun.
The text was updated successfully, but these errors were encountered: