Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixing FTAG MCMC SFs #1686

Merged
merged 4 commits into from
May 16, 2024
Merged

Fixing FTAG MCMC SFs #1686

merged 4 commits into from
May 16, 2024

Conversation

mdhank
Copy link
Contributor

@mdhank mdhank commented Apr 24, 2024

Updating MC-MC SFs to match https://ftag.docs.cern.ch/algorithms/activities/mcmc/#release-22-supported-generators and the CDI files (with the CDI files given precedence). Note this requires a new option in BJetEfficiencyCorrector, isRun3, which is set to False by default.

@mdhank mdhank changed the title Draft: Fixing FTAG MCMC SFs Fixing FTAG MCMC SFs Apr 25, 2024
@mdhank
Copy link
Contributor Author

mdhank commented Apr 25, 2024

Dear @tofitsch ,

Could you take a quick look at this? I've tested that it behaves as I expect, but it's good to have a reviewer separate from the author.

Thanks,
Michael

@tofitsch
Copy link
Contributor

tofitsch commented May 2, 2024

Looks generally good to me. I trust you with all the numbers and will not check them :D only worry I have is regarding backwards compatibility. E.g. before any file sample name containing H7 got assigned Herwig7, now only if it contains PHH7EG it is Herwig7p2 but if it contains H7 but not PHH7EG it will be assigned Unknown. Not sure if that's ok? Are there no possible sample names that contain H7 but not PHH7EG?

@mdhank
Copy link
Contributor Author

mdhank commented May 2, 2024

Looks generally good to me. I trust you with all the numbers and will not check them :D only worry I have is regarding backwards compatibility. E.g. before any file sample name containing H7 got assigned Herwig7, now only if it contains PHH7EG it is Herwig7p2 but if it contains H7 but not PHH7EG it will be assigned Unknown. Not sure if that's ok? Are there no possible sample names that contain H7 but not PHH7EG?

Good idea, I'll check for h7s. There are some such as
mc23_13p6TeV.525891.MGH7EG_LO_lj_tchannel_el_150to200.deriv.DAOD_PHYS.e8544_s4162_r14622_p5855,
mc23_13p6TeV.522027.aMCH7EG_NNPDF30NLO_H723UE_ttmumu_run3.deriv.DAOD_PHYS.e8558_a910_r14932_p6026,
mc20_13TeV.346526.PowhegHerwig7EvtGen_H7UE_NNPDF30ME_ttH125_gamgam.deriv.DAOD_PHYS.e7488_s3681_r13167_p5631, or
mc23_13p6TeV.830148.H7EG_H72NNPDF30NLO_jetjet_Cluster_JZ6.deriv.DAOD_PHYS.e8551_s4159_r14799_p5855.

The amc definitely shouldn't fall under this case as it has it's own MC-MC SFs, and I don't think MG should either as the MC-MC SFs (https://ftag.docs.cern.ch/algorithms/activities/mcmc/#run-2) specify they are for Powheg+Herwig7. I believe the one with PowhegHerwig in the name should fall under Herwig7.1. I'm less clear on the dijet samples.
There are also datasets like
mc23_13p6TeV.902037.QBHPy8EG_QBH_photonjet_n6_Mth7000.deriv.DAOD_PHYS.e8557_s4162_r14622_p6026 and
mc23_13p6TeV.603317.PhPy8EG_ggH700W20_tautaulh.deriv.DAOD_PHYS.e8566_s4162_r14622_p6026
which have h7 but should definitely not be included. So I'd lean towards leaving as is to avoid false postives.

It could be worth adding a warning in general that it's good to double-check what values the parsing uses, as it's very easy to miss edge cases, but I have mixed feelings on this.

@tofitsch Does this plan (the PR as is) sound reasonable? I think this is the safest solution- if there's patterns that we should be counting but aren't, they'll give errors and users can either report them (in which case we'll adjust to add them), or manually set which to use. I believe the alternative would count patterns that shouldn't be counted, which I would rather not do as it makes it possible we will process some incorrectly.

@mdhank mdhank merged commit 03f2e09 into UCATLAS:main May 16, 2024
12 checks passed
else if(tmp_name.Contains("SH_2210")) return Sherpa2210;
else if(tmp_name.Contains("SH_2211")) return Sherpa2210;
else if(tmp_name.Contains("SH_2212")) return Sherpa2210;
else if(tmp_name.Contains("SH_2214")) return Sherpa2210;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi, I realise I'm a little late now, but would it be possible to declare SH_2214 also Sherpa2212 so that the BJetEffCorr doesn't crash on the Run3 V+jets samples?
I can start a new pull request for this line if needed

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @dbuchin ,

You should be able to manually set the calibration by setting m_EfficiencyCalibration to the desired value in your configuration ("700660" to use the Sherpa 2.2.12 SFs). Would this work for you? If possible, I would prefer not to automatically declare any unless they are officially supported (see https://ftag.docs.cern.ch/algorithms/activities/mcmc/#release-22-supported-generators for the official list).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants