Add MXNet prediction tool #4167

hqucms · 2018-07-03T15:15:10Z

This is to add MXNet to CMSSW externals following the discussions in cms-sw/cmssw#21314. It is needed for the integration of the DeepAK8 tagger into CMSSW.

This includes both the C and C++ APIs. MXNet is built in prediction-only mode (MXNET_PREDICT_ONLY=1) here, as so far we only intend to use it for the evaluation of the neural networks, not for training (which is typically done on a GPU w/ the python API). By compiling in the prediction-only mode, MXNet is also set to run in Native Engine mode such that it runs in the master thread instead of creating its own thread pool.

cmsbuild · 2018-07-03T15:15:31Z

A new Pull Request was created by @hqucms for branch IB/CMSSW_10_2_X/gcc630.

@cmsbuild, @smuzaffar, @gudrutis, @mrodozov can you please review it and eventually sign? Thanks.
You can sign-off by replying to this message having '+1' in the first line of your reply.
You can reject by replying to this message having '-1' in the first line of your reply.

hqucms · 2018-07-03T15:17:55Z

@gouskos who would also like to follow this thread.

davidlange6 · 2018-07-03T15:19:35Z

please test

cmsbuild · 2018-07-03T15:20:20Z

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/28993/console

smuzaffar · 2018-07-03T16:10:00Z

@hqucms , please also update cmssw-tool-conf.spec to depend on mxnet-predict-toolfile. This is needed to get it within cmssw env.

smuzaffar · 2018-07-03T16:11:18Z

@mrodozov , can you please check if this builds correctly on slc7/aarch64 and , gcc7/gcc8?

cmsbuild · 2018-07-03T16:20:34Z

Pull request #4167 was updated.

hqucms · 2018-07-03T16:20:56Z

@smuzaffar Done.

cmsbuild · 2018-07-03T18:14:59Z

+1
Tested at: 5168958
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-4167/28993/summary.html

cmsbuild · 2018-07-03T18:15:03Z

Comparison job queued.

cmsbuild · 2018-07-03T19:47:41Z

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-4167/28993/summary.html

Comparison Summary:

No significant changes to the logs found
Reco comparison results: 0 differences found in the comparisons
DQMHistoTests: Total files compared: 31
DQMHistoTests: Total histograms compared: 2899480
DQMHistoTests: Total failures: 1
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 2899289
DQMHistoTests: Total skipped: 190
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 30 files compared)
Checked 128 log files, 14 edm output root files, 31 DQM output files

mrodozov · 2018-07-05T12:42:27Z

tested on slc6_amd64_gcc630,700 slc7_amd64_gcc700,810 slc7_aarch64_gcc700

builds fine.

mrodozov · 2018-07-05T12:45:02Z

please test

cmsbuild · 2018-07-05T12:45:17Z

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/29017/console

cmsbuild · 2018-07-05T14:56:54Z

+1
Tested at: 09b222f
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-4167/29017/summary.html

cmsbuild · 2018-07-05T14:56:57Z

Comparison job queued.

smuzaffar · 2018-07-05T15:56:59Z

+externals

cmsbuild · 2018-07-05T15:57:17Z

This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_10_2_X/gcc630 IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @davidlange6, @slava77, @smuzaffar, @fabiocos (and backports should be raised in the release meeting by the corresponding L2)

cmsbuild · 2018-07-05T16:06:34Z

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-4167/29017/summary.html

Comparison Summary:

No significant changes to the logs found
Reco comparison results: 6 differences found in the comparisons
DQMHistoTests: Total files compared: 31
DQMHistoTests: Total histograms compared: 2899480
DQMHistoTests: Total failures: 2
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 2899288
DQMHistoTests: Total skipped: 190
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 30 files compared)
Checked 128 log files, 14 edm output root files, 31 DQM output files

slava77 · 2018-07-10T15:24:51Z

@smuzaffar @fabiocos
Please let me know what is the plan for merging this.
Thank you.

fabiocos · 2018-07-11T11:59:56Z

@slava77 as the build looks functional I think that we have no technical obstacle to integrate it, @smuzaffar please comment in case.

Anyway, looking at the discussions at the root of this request, I would like to better understand the overall strategy, as this is the 3rd deep learning-oriented external tool that we are integrating to my knowledge, besides tensorflow and lwtnn. From the slides of @Dr15Jones and @makortel at the november O&C meeting I understand that the idea is to use the DeepAK8 tagger based on MXNet as a testbed for that approach. As I did not take part to that discussion, I would like to understand whether this request is part of an overall strategy, as it seems. Do we want to have the 3 of them within CMSSW for comparison, with the idea of possibly moving in the longer term towards a single approach?

smuzaffar · 2018-07-11T12:17:59Z

@fabiocos , yes there are not technical obstacle integrating it. As this is new tool, so we can integrate it now OR we can wait for #4185 (which also include this PR changes) to be fully tested.

makortel · 2018-07-11T19:13:53Z

My feeling (could be wrong) is that at the moment we don't have enough experience on these tools on the inference side to make a clear choice (are there any other downsides than supporting yet another external?). Also, if we start restricting the inference tools, it should be clearly communicated to the developer community.

davidlange6 · 2018-07-11T19:43:52Z

Hi I agree. If we want to move out of the “let 1000 flowers bloom” mode in this problem space, a pr is not the place to decide (nor is first come first served the approach to take). I haven’t seen any research to suggest that one solution is now covering the problem space, so am biased toward continuing to integrate data science supported tools. Cheers, David On 11 Jul 2018, at 22:14, Matti Kortelainen <notifications@github.com<mailto:notifications@github.com>> wrote: My feeling (could be wrong) is that at the moment we don't have enough experience on these tools on the inference side to make a clear choice (are there any other downsides than supporting yet another external?). Also, if we start restricting the inference tools, it should be clearly communicated to the developer community. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#4167 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AEzyw_byfnF50XCV9XKaPyeE3ZvCEHSLks5uFk5ygaJpZM4VBJSo>.

fabiocos · 2018-07-11T19:47:18Z

@davidlange6 I agree that general long term strategies goes beyond the single PR discussion. But the problem is practically posed through PRs. This tool was already discussed and mentioned, so ok, but in case more arrives, I think we need to understand where we want to go

fabiocos · 2018-07-11T19:47:27Z

+1

Add MXNet prediction tool

5168958

cmsbuild added externals-pending orp-pending pending-signatures tests-pending labels Jul 3, 2018

cmsbuild added tests-started and removed tests-pending labels Jul 3, 2018

Add mxnet-predict to cmssw-tool-conf.spec

09b222f

cmsbuild added tests-pending and removed tests-started labels Jul 3, 2018

cmsbuild added tests-started and removed tests-pending labels Jul 5, 2018

cmsbuild added tests-approved and removed tests-started labels Jul 5, 2018

cmsbuild added externals-approved fully-signed and removed externals-pending pending-signatures labels Jul 5, 2018

hqucms mentioned this pull request Jul 9, 2018

DeepAK8 tagger integration cms-sw/cmssw#23768

Merged

cmsbuild added orp-approved and removed orp-pending labels Jul 11, 2018

cmsbuild merged commit e10e505 into cms-sw:IB/CMSSW_10_2_X/gcc630 Jul 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MXNet prediction tool #4167

Add MXNet prediction tool #4167

hqucms commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

hqucms commented Jul 3, 2018

davidlange6 commented Jul 3, 2018

cmsbuild commented Jul 3, 2018 •

edited

smuzaffar commented Jul 3, 2018

smuzaffar commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

hqucms commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

mrodozov commented Jul 5, 2018 •

edited

mrodozov commented Jul 5, 2018

cmsbuild commented Jul 5, 2018 •

edited

cmsbuild commented Jul 5, 2018

cmsbuild commented Jul 5, 2018

smuzaffar commented Jul 5, 2018

cmsbuild commented Jul 5, 2018

cmsbuild commented Jul 5, 2018

slava77 commented Jul 10, 2018

fabiocos commented Jul 11, 2018

smuzaffar commented Jul 11, 2018

makortel commented Jul 11, 2018

davidlange6 commented Jul 11, 2018 via email

fabiocos commented Jul 11, 2018

fabiocos commented Jul 11, 2018

Add MXNet prediction tool #4167

Add MXNet prediction tool #4167

Conversation

hqucms commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

hqucms commented Jul 3, 2018

davidlange6 commented Jul 3, 2018

cmsbuild commented Jul 3, 2018 • edited

smuzaffar commented Jul 3, 2018

smuzaffar commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

hqucms commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

cmsbuild commented Jul 3, 2018

mrodozov commented Jul 5, 2018 • edited

mrodozov commented Jul 5, 2018

cmsbuild commented Jul 5, 2018 • edited

cmsbuild commented Jul 5, 2018

cmsbuild commented Jul 5, 2018

smuzaffar commented Jul 5, 2018

cmsbuild commented Jul 5, 2018

cmsbuild commented Jul 5, 2018

slava77 commented Jul 10, 2018

fabiocos commented Jul 11, 2018

smuzaffar commented Jul 11, 2018

makortel commented Jul 11, 2018

davidlange6 commented Jul 11, 2018 via email

fabiocos commented Jul 11, 2018

fabiocos commented Jul 11, 2018

cmsbuild commented Jul 3, 2018 •

edited

mrodozov commented Jul 5, 2018 •

edited

cmsbuild commented Jul 5, 2018 •

edited