New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LowPtElectrons: final ID model weights file #16
LowPtElectrons: final ID model weights file #16
Conversation
A new Pull Request was created by @bainbrid for branch master. @perrotta, @smuzaffar, @mrodozov, @cmsbuild, @slava77, @jpata can you please review it and eventually sign? Thanks. |
We (i.e. @crovelli!) have investigated various hyperparameter and training configurations for our ID BDT model, with the aim of reducing the disk and memory footprint of the GBRForest. Attached is a plot that shows the performance of various models. Below is a table that summarises the disk footprint of the .root file that would be stored in cms-data. The "B-Parking" model in the table below is the "2020Sept15" model that is currently used by the CMSSW_11_3_X, merged from the PR #15. It is planned to be integrated within CMSSW_10_6_X as part of the back port cms-sw/cmssw#32372 for use with the UL reminiAOD compaign. It was trained based on BParking studies and it is our benchmark for all the studies below. It is indicated by the black curve in the attached plot. The other models, labeled DepthXX, are indicated in the plot as coloured dashed curves. They are trained on AOD inputs taken from B->J/psi(->ee)K samples produced with 10_6_X.
The general improvement w.r.t. the baseline "B-Parking" model is understood to be due to the improved reconstruction from the UltraLegacy AOD samples w.r.t. the Prompt conditions used by the BParking samples. Our recommendation is to use the Depth10 model, so we plan to update this PR to use the Depth10 model. Please let us know if you have any objections. |
…n 2; rm Autumn18 file
87555f6
to
01c2f68
Compare
Pull request #16 was updated. |
The last commit implements the changes discussed here #16 (comment) |
+1 based on tests in cms-sw/cmssw#32391 |
+1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-697892/12048/summary.html Comparison SummarySummary:
|
I'm a bit surprised why the test here passed; @smuzaffar do we have a mechanism to hide the base release data area ? |
@slava77 , not currently for data externals but I can update https://github.com/cms-sw/cmssw-config/blob/scramv3/SCRAM/hooks/runtime/50-remove-release-external-lib to drop the data path from release too. let me update build rules and re-run tests for this PR |
please test with cms-sw/cmsdist#6538 |
@smuzaffar , if you agree, I will merge this PR with cms-sw/cmssw#32391 once the tests with cms-sw/cmsdist#6538 are finished |
please test with cms-sw/cmsdist#6538 |
@silviodonato , yes please wait for the results. |
-1 Failed Tests: UnitTests RelVals AddOn Unit TestsI found errors in the following unit tests: ---> test runtestPhysicsToolsPatAlgos had ERRORS ---> test testRecoMETMETProducers had ERRORS ---> test testPhase2PixelNtuple had ERRORS ---> test testTauEmbeddingProducers had ERRORS RelVals
|
merge |
this looks good now, I mean bot now properly remove data path of release. |
@smuzaffar I didn't see the PR from @cmsbuild , so I manually created the new branch (V01-03-00) on cms-data and the PR for cmsdist (cms-sw/cmsdist#6543). I hope everything is ok. |
you mean new tag V01-04-00 ... right |
the generated release already existed when it tried to create it. |
This PR contains a .root file containing weights for a BDT model used by the LowPtGsfElectronIDProducer module.
cms-sw/cmssw#32391 depends in this PR.