DNN-based Tau-Id discrimians (94X) #25385

mbluj · 2018-11-30T13:52:05Z

This pull request provides two new DNN-based Tau-Ids, DeepTau and DPFTau, to be produced for pat::Taus with MiniAOD.
It is a backport of #25016 to 94X for analyses based on 2016+2017 data and detailed description can be found therein.

- Defined base class for deep tau discriminators. - Removed weight files from home cms repository. Now using weights from cms-data. - Defined WP for both discriminators. Now all discriminators return the corresponding WP results. - Removed cfi files. Using fillDescriptions instead. - General code review and cleaning.

…ection with the new Tau-Ids

…e-topic

Integration of DPFIsolation and DeepTauId

A few cleanings to DNN tools

Made DeepTauId and DPFIsolation thread-safe

…es quantized - Added a new parameter 'version' on runTauIdMVA, used on DPFIsolation - Changes on DeepTauId to reduce memory consumption

…read and reduce the memory consuption - Creation of class DeepTauCache in DeepTauBase, in which now is created graph and session - Implementation of two new static methods inside the class DeepTauBase: initializeGlobalCache and globalEndJob. The graph and DeepTauCache object are created now inside initializeGlobalCache

… memory mapping

TauWPThreshold class parses WP cut string (or value) provided in the python configuration. It is needed because the use of the standard StringObjectFunction class to parse complex expression results in an extensive memory usage (> 100 MB per expression).

…riginal files

- Implementation of global cache to avoid reloading graph for each thread - Creation of two new static methods inside the class DeepTauBase: initializeGlobalCache and globalEndJob. The graph and DeepTauCache object are created now inside initializeGlobalCache. The memory consumption of initializeGlobalCache for the original, quantized and files that are load using memory mapping method are in the memory_usage.pdf file - Implemented configuration to use new training files quantized, and set them as default - Implementation of configuration for load files using memory mapping. In our case there wasn't any improvement, respect at the memory consumption of this method, respect the quantized files, so this is not used, but set for future training files - General code review and cleaning.

fabiocos · 2019-01-08T15:17:53Z

+code-checks

fabiocos · 2019-01-08T15:31:32Z

code-checks

fabiocos · 2019-01-08T15:57:32Z

@mbluj as far as I can see the list of commits here is containing the unwanted big files, so we need to squash it. In case we do not want to miss the review history, I suggest to just open a new PR with the commit resulting from the squash.

mbluj · 2019-01-08T16:03:44Z

@fabiocos: please squash the history as done with 102X version of this PR if you you don't mind. The full history is anyway preserved for original PR to the master. Thank you.

cmsbuild · 2019-01-08T17:03:43Z

+1
Tested at: 88e791e
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-25385/32469/summary.html

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:
88e791e
5f57180
94d4d95
7c78a26
3044c0f
42d930e
07534bb
5e95905
4802cd0
58fb53c
c9fee2f
71fa749
You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-25385/32469/git-log-recent-commits
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-25385/32469/git-merge-result

cmsbuild · 2019-01-08T17:03:53Z

Comparison job queued.

cmsbuild · 2019-01-08T17:04:11Z

The code-checks are being triggered in jenkins.

cmsbuild · 2019-01-08T17:09:35Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-25385/7868

This PR adds an extra 145200KB to repository
Found files with invalid states:
- RecoTauTag/RecoTau/data/DPFIsolation_2017v1.pb:
  - Added: 5fc5dbb
  - Deleted: ab40a65
- RecoTauTag/RecoTau/test/runTauIdMVA.py:
  - Added: 64a3e03
  - Deleted: 56ccc8a
- RecoTauTag/RecoTau/data/deepTau_2017v1_20L1024N.pb:
  - Added: 64a3e03
  - Deleted: ab40a65
- RecoTauTag/RecoTau/python/DPFIsolation_cfi.py:
  - Added: de68398
  - Deleted: ab40a65
- RecoTauTag/RecoTau/python/DeepTauId_cfi.py:
  - Added: de68398
  - Deleted: ab40a65
- RecoTauTag/RecoTau/data/DPFIsolation_2017v0.pb:
  - Added: 56ccc8a
  - Deleted: ab40a65
- RecoTauTag/RecoTau/python/runTauIdMVA.py:
  - Added: d9b6402
  - Modified: c5b1c01, ab40a65
  - Deleted: 9386bcb

cmsbuild · 2019-01-08T18:23:07Z

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-25385/32469/summary.html

Comparison Summary:

No significant changes to the logs found
Reco comparison results: 0 differences found in the comparisons
DQMHistoTests: Total files compared: 27
DQMHistoTests: Total histograms compared: 2721493
DQMHistoTests: Total failures: 108
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 2721223
DQMHistoTests: Total skipped: 162
DQMHistoTests: Total Missing objects: 0

fabiocos · 2019-01-09T10:36:16Z

@mbluj @perrotta I would prefer to avoid losing the review history. But as 10 we need to squash anyway and 2) @smuzaffar will implement the possibility to instruct the bot for it, I will clone this PR into another one, and try the new possibility there.

fabiocos · 2019-01-10T15:02:06Z

code-checks

cmsbuild · 2019-01-10T15:02:35Z

The code-checks are being triggered in jenkins.

cmsbuild · 2019-01-10T15:07:59Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-25385/7927

This PR adds an extra 145196KB to repository
Found files with invalid states:
- RecoTauTag/RecoTau/data/DPFIsolation_2017v1.pb:
  - Added: 5fc5dbb
  - Deleted: ab40a65
- RecoTauTag/RecoTau/test/runTauIdMVA.py:
  - Added: 64a3e03
  - Deleted: 56ccc8a
- RecoTauTag/RecoTau/data/deepTau_2017v1_20L1024N.pb:
  - Added: 64a3e03
  - Deleted: ab40a65
- RecoTauTag/RecoTau/python/DPFIsolation_cfi.py:
  - Added: de68398
  - Deleted: ab40a65
- RecoTauTag/RecoTau/python/DeepTauId_cfi.py:
  - Added: de68398
  - Deleted: ab40a65
- RecoTauTag/RecoTau/data/DPFIsolation_2017v0.pb:
  - Added: 56ccc8a
  - Deleted: ab40a65
- RecoTauTag/RecoTau/python/runTauIdMVA.py:
  - Added: d9b6402
  - Modified: c5b1c01, ab40a65
  - Deleted: 9386bcb

fabiocos · 2019-01-11T10:53:36Z

@smuzaffar I have cross checked that the code in #26621 is identical to that in this PR, but code-checks is complaining in the new and not in the old version. If I just manually merge in a working area 25385 and run "scram b code-checks" I indeed find the complaints, and produces the patch as in #25621. So what is going on? Is the bot missing the differences in some case?

mbluj · 2019-01-11T11:13:42Z

It is indeed interesting issue to understand, but maybe in parallel it makes sense to fix the code issues here and prepare small PRs with similar fixes also for master and 102X? Or start with fix to master with backports to 102X and here (with cherry-pick)?

fabiocos · 2019-01-11T12:16:17Z

In order to manage things in a ordered way, I would move forward with merging the present version, and let you add fixes where needed for all the possible releases, staring with master

fabiocos · 2019-01-11T16:54:12Z

@mbluj as the squashed version has been merged, this one may be closed, and will stay as a a reference for the review.

perrotta · 2019-01-14T14:35:21Z

-1
Superseeded by #25621: please @mbluj close this PR

kandrosov and others added 30 commits June 15, 2018 16:33

First implementation of deep tau id.

64a3e03

Merged deep_tau_2017v1 from repository kandrosov with cms-merge-topic

71fa749

Building dpf isolation module

56ccc8a

Merged dpfisolation from repository ocolegro with cms-merge-topic

c9fee2f

Adding in v1

5fc5dbb

Adding in runTauIDMVA for other users

d9b6402

making things fully reproducible

410511f

Merged dpfisolation from repository ocolegro with cms-merge-topic

58fb53c

Reorganisation of configuration files: cff split to cfi and cff

de68398

Some code cleaning

41f1d4e

adapt to cfi/cff reorganization

c5b1c01

Added example of a python configuration file to produce pat::Tau coll…

6a48473

…ection with the new Tau-Ids

requested changes on runDeepTauIDsOnMiniAOD.py

90424aa

Merged CMSSW_9_4_X_DPFIso_deepTau from repository MRD2F with cms-merg…

4802cd0

…e-topic

Clean runTauIdMVA.py tool and test config to run tauIDs

9386bcb

Merge pull request #98 from MRD2F/CMSSW_9_4_X_DPFIso_deepTau

5e95905

Integration of DPFIsolation and DeepTauId

Merge pull request #99 from mbluj/CMSSW_9_4_X_DPFIso_deepTau

07534bb

A few cleanings to DNN tools

Made DeepTauId and DPFIsolation thread-safe

3735241

Finish implement thread-safe requirements on DPFIsolation

0cdabe2

Disable DPFTau_2016_v1 and issue some warnings

21fb48f

Merge pull request #101 from MRD2F/CMSSW_9_4_X_DPFIso_deepTau

42d930e

Made DeepTauId and DPFIsolation thread-safe

- Implemented on runTauIdMVA the option to work with new training fil…

ef7dc2e

…es quantized - Added a new parameter 'version' on runTauIdMVA, used on DPFIsolation - Changes on DeepTauId to reduce memory consumption

Applied changes on DeepTauBase to allow load new training files using…

0410883

… memory mapping

Remove the qm.pb input files and leaving just the quantized and the o…

61a2928

…riginal files

Applied style comments

1961ca9

Applied style comments

5bb7984

cmsbuild added the tests-started label Jan 8, 2019

cmsbuild added tests-approved and removed tests-started labels Jan 8, 2019

cmsbuild added comparison-available and removed comparison-pending labels Jan 8, 2019

fabiocos mentioned this pull request Jan 10, 2019

DNN-based Tau-Id discrimians (94X) - Squashed version of PR 25385 #25621

Merged

cmsbuild added pending-signatures reconstruction-rejected and removed fully-signed reconstruction-approved labels Jan 14, 2019

mbluj closed this Jan 14, 2019

mbluj deleted the CMSSW_9_4_X_tau_pog_DNNTauIDs branch October 10, 2023 10:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DNN-based Tau-Id discrimians (94X) #25385

DNN-based Tau-Id discrimians (94X) #25385

mbluj commented Nov 30, 2018

fabiocos commented Jan 8, 2019

fabiocos commented Jan 8, 2019

fabiocos commented Jan 8, 2019

mbluj commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

fabiocos commented Jan 9, 2019

fabiocos commented Jan 10, 2019

cmsbuild commented Jan 10, 2019

cmsbuild commented Jan 10, 2019

fabiocos commented Jan 11, 2019

mbluj commented Jan 11, 2019 •

edited

fabiocos commented Jan 11, 2019

fabiocos commented Jan 11, 2019

perrotta commented Jan 14, 2019

DNN-based Tau-Id discrimians (94X) #25385

DNN-based Tau-Id discrimians (94X) #25385

Conversation

mbluj commented Nov 30, 2018

fabiocos commented Jan 8, 2019

fabiocos commented Jan 8, 2019

fabiocos commented Jan 8, 2019

mbluj commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

cmsbuild commented Jan 8, 2019

fabiocos commented Jan 9, 2019

fabiocos commented Jan 10, 2019

cmsbuild commented Jan 10, 2019

cmsbuild commented Jan 10, 2019

fabiocos commented Jan 11, 2019

mbluj commented Jan 11, 2019 • edited

fabiocos commented Jan 11, 2019

fabiocos commented Jan 11, 2019

perrotta commented Jan 14, 2019

mbluj commented Jan 11, 2019 •

edited