New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Apply regression corrections from database in PFECAL-SuperClusters, decrease PF-SC seeding thresholds #972
Conversation
A new Pull Request was created by @lgray (Lindsey Gray) for CMSSW_7_0_X. Apply regression corrections from database in PFECAL-SuperClusters It involves the following packages: RecoEcal/EgammaClusterProducers @nclopezo, @smuzaffar, @thspeer, @slava77 can you please review it and eventually sign? Thanks. |
@@ -116,21 +116,21 @@ class GBRWrapperMaker : public edm::EDAnalyzer { | |||
// iSetup.get<SetupRecord>().get(pSetup); | |||
// #endif | |||
//from Josh: | |||
TFile *infile = new TFile("/afs/cern.ch/user/b/bendavid/cmspublic/gbrv3ph.root","READ"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this code ever called other than in user tests?
(I recall seeing it once and back then concluded it's not used).
If you need the file, please request a corresponding external and add this file in package/data
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No this code is only ever called from the test directory of this package.
Can move this to test/ as well if you want?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Lindsey
Yes, please, move to test.
It's a bit disturbing to see private directories/files used in the general code area.
/test is much more obviously for testing
HI Lindsey Slava |
Hi Slava, The .db file is only temporary, it's meant for testing. I'll send it in for inclusion to in the global tags tomorrow. -Lindsey |
@apfeiffer1 @ggovi Could you guys please grab the .db file in this PR and integrate it into the global tag? |
... oops, I almost overlooked this one, this kind of requests should The short answer is sorry, no, we can't. :( Assuming that this file contains new payloads, please use the dropbox Please let us know if you find any problems with these procedures. On Wed, Oct 2, 2013 at 7:33 PM, Lindsey Gray notifications@github.comwrote:
Thanks, |
@apfeiffer1 Could you please give me a link to the dropbox and standard procedures, I have never gone through them before. Thanks! |
@slava77 Thanks. In the mean time while I get this stuff in the db, let's test to make sure we see the improvements we expect? |
Thanks, Slava! And more info on handling the global tags is at: https://twiki.cern.ch/twiki/bin/viewauth/CMS/AlCaDB#Global_Tags (which includes the link to the DropBox and one to the Procedure to submit HTH, On Fri, Oct 4, 2013 at 9:13 AM, slava77 notifications@github.com wrote:
Thanks, |
With respect to "all the other stuff" becoming big enough to be an external, I think a fair amount of the other stuff in data/ is no longer used or can be phased out. I will check |
OK |
Lindsey, I was hoping that at least e.g. in workflow 1000.0 step2: process.offlinePrimaryVertices is a part of process.vertexreco Please, swap the two in your topic branch, test the short matrix (command above) and then update this pull request. Thanks Slava |
@slava77 Ahh, it passed for me since I was doing a rereco. I'll swap them out. Thanks. |
Sure, I'll fix it. Vestige of original implementation. |
@@ -15,6 +57,10 @@ | |||
#PFClusters collection | |||
PFClusters = cms.InputTag("particleFlowClusterECAL"), | |||
PFClustersES = cms.InputTag("particleFlowClusterPS"), | |||
vertexCollection = cms.InputTag("offlinePrimaryVertices"), | |||
#rechit collections for lazytools | |||
reducedEcalRecHitsEB = cms.InputTag('ecalRecHit','EcalRecHitsEB'), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, OK, these are new parameters that were not here before.
My last comment was based only on the last commit changes.
That made it look like input tags have changed wrt the current values already in the release.
Lindsey, Looking at the object diffs, I see that only energy() changes in The plot is for the TTbar PU sample, workflow 202.0 What's surprising is that this doesn't affect the downstream objects: Timing looks about the same. |
@slava77 The mustache SCs aren't yet used in PFEG-algo, so you wouldn't see any change until we flipped the switch for the inputs. It would be more informative to take a look at E_reco/E_true or di-SuperCluster mass widths using just the superclusters. Which sample is the above plot from |
That was a ttbar PU (I updated the comment). OK for the no change in particleFlowEGamma, got it. Is the size of the change in the energy expected? Note that this is a log10(energy) plot. That's a pretty large correction. |
For the barrel the correction tends to be > 1, for endcap somewhat < 1. The pT threshold for the EB/EE is 3/5 GeV. |
Matteo Sani reports that he sees improved resolution when running the SCs in the HLT, with a few bins worse possibly due to statistical fluctuations, checking further. Waiting for word from Emanuele di Marco for the offline electron seeding. I want to wait a bit before putting the .db in the dropbox, we need to make sure we don't need separate regressions for offline/online or other things along those lines. |
@slava77 Make sure timing doesn't go nuts from lowered threshold. We need to lower the threshold otherwise we see turn-on effects up to very high pT in the endcaps, O(15 GeV). |
@slava77 working on it |
@slava77 fixed, matrix tests passed |
+1 Tested f808ec7 DQM @lgray
|
@lgray I'll wait for the above mentioned documentation update before merging this. |
@ktf it is already updated. |
@@ -116,21 +116,21 @@ class GBRWrapperMaker : public edm::EDAnalyzer { | |||
// iSetup.get<SetupRecord>().get(pSetup); | |||
// #endif | |||
//from Josh: | |||
TFile *infile = new TFile("/afs/cern.ch/user/b/bendavid/cmspublic/gbrv3ph.root","READ"); | |||
TFile *infile = new TFile("/afs/cern.ch/user/l/lgray/work/public/CMSSW_7_0_0_pre4/src/RecoEgamma/EgammaTools/test/GBR_Clustering_PFMustache_results.root","READ"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This needs to go somewhere central.
Reco updates -- Apply regression corrections from database in PFECAL-SuperClusters, decrease PF-SC seeding thresholds
This branch includes an update to apply a BDT regression to found PFECAL-SuperClusters in an event.
The penalty to timing is marginal, about 0.003 seconds are added for this module per event in the ttbar sample.
We should see improved energy resolution for superclusters associated to electrons and photons.
Not sure of the impact on soft clusters.
The associated .db file will be moved into databases soon, for now it reads from a local file. This will be changed once it's in the GlobalTags.
This PR is also pending some final validation from the developer of the regression weights.
Also:
barrel seeding threshold goes from 3 GeV to 1 GeV
endcap seeding threshold goes from 5 GeV to 1 Gev
There is a corresponding increase in superclustering timing.