L1TMuonEndCapTrackProducer::produce() takes 96 MB memory per stream #42526

makortel · 2023-08-09T16:17:51Z

Live memory profiles of #40437 (comment) show that L1TMuonEndCapTrackProducer::produce() takes 100 MB memory / stream

1 thread and stream https://mkortela.web.cern.ch/mkortela/cgi-bin/navigator/issue40437/reco_07.10_live/240
4 threads and streams https://mkortela.web.cern.ch/mkortela/cgi-bin/navigator/issue40437/reco_4th_4st_07.10_live/101

The L1TMuonEndCapTrackProducer module itself is an edm::stream. The memory consumption can be split into

93 MB / stream in PtAssignmentEngine::load(), that seems to load some kind of BDT?
1.8 MB / stream in EMTFSetup::reload() loading some LUTs
0.5 MB / stream in SectorProcessor::process() that is mostly Tensorflow

Assuming the PtAssignmentEngine::load() is indeed BDT or similar, does its representation really need to be that large? Ideally all of these would be in GlobalCache of the module (e.g. Tensorflow stuff), or in the EventSetup (e.g. the BDT whose content apparently depends on the L1TMuonEndCapParams and L1TMuonEndCapForest EventSetup data products).

The text was updated successfully, but these errors were encountered:

makortel · 2023-08-09T16:17:57Z

assign l1

cmsbuild · 2023-08-09T16:18:12Z

New categories assigned: l1

@epalencia,@aloeliger you have been requested to review this Pull request/Issue and eventually sign? Thanks

cmsbuild · 2023-08-09T16:18:16Z

A new Issue was created by @makortel Matti Kortelainen.

@Dr15Jones, @perrotta, @dpiparo, @rappoccio, @makortel, @smuzaffar can you please review it and eventually sign/assign? Thanks.

cms-bot commands are listed here

makortel · 2023-08-09T16:18:28Z

Moving the Tensorflow stuff to GlobalCache was also discussed in #32894

eyigitba · 2023-08-09T17:17:14Z

Hi @makortel , as you said the main contribution here is loading of the BDT in PtAssignmentEngine::load and coordinate conversion LUTs in SectorProcessorLUT::read. I don't know how we can reduce these easily.

Regarding the Tensorflow stuff and GlobalCache, I currently don't have the time to rework the code unfortunately. If you and/or L1T offline software think that this should be done, I can try to pass this task to someone within the EMTF group and we can see how quickly we can implement this.

makortel · 2023-08-09T17:33:02Z

Hi @eyigitba I think addressing the PtAssignmentEngine::load() (at least making read-only parts shared across streams; I hope the memory there is mostly read-only) would be important.

The Tensorflow part would be nice (e.g. if the model would become larger in the future, or serving as an example for others), but with 0.5 MB / stream not that important today.

eyigitba · 2023-08-09T18:19:07Z

Hi @makortel , ok we'll look into the BDT loading and also see how we can improve the Tensorflow part.

How urgent is this btw?

VinInn · 2023-08-10T09:30:47Z

Memory budget is 2GB per stream (including shared component and I/O buffers).
L1TMuonEndCapTrackProducer alone accounts for 5% of that.
To L1 mngmt to judge how urgent it WAS.

VinInn · 2023-08-10T09:38:53Z

BTW: why using your own implementation of a Forest and not the highly optimized common CMS one
https://cmssdt.cern.ch/dxr/CMSSW/source/CondFormats/GBRForest/interface/GBRForest.h#24
?

I would advice to switch to that.

eyigitba · 2023-08-10T10:25:36Z

Thanks for the advice @VinInn . I don't know why it was implemented like this, but this code is quite old, from 2016 or so. I unfortunately don't have much time these couple of weeks, but we'll discuss in the EMTF group to come up with a solution soon.

cmsbuild added l1-pending pending-signatures labels Aug 9, 2023

This was referenced Aug 9, 2023

Crashes in TensorFlow via L1TMuonEndCapTrackProducer #32894

Open

Failures in Run 3 data reprocessing #40437

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

L1TMuonEndCapTrackProducer::produce() takes 96 MB memory per stream #42526

L1TMuonEndCapTrackProducer::produce() takes 96 MB memory per stream #42526

makortel commented Aug 9, 2023

makortel commented Aug 9, 2023

cmsbuild commented Aug 9, 2023

cmsbuild commented Aug 9, 2023

makortel commented Aug 9, 2023

eyigitba commented Aug 9, 2023

makortel commented Aug 9, 2023

eyigitba commented Aug 9, 2023

VinInn commented Aug 10, 2023 •

edited

Loading

VinInn commented Aug 10, 2023

eyigitba commented Aug 10, 2023

L1TMuonEndCapTrackProducer::produce() takes 96 MB memory per stream #42526

L1TMuonEndCapTrackProducer::produce() takes 96 MB memory per stream #42526

Comments

makortel commented Aug 9, 2023

makortel commented Aug 9, 2023

cmsbuild commented Aug 9, 2023

cmsbuild commented Aug 9, 2023

makortel commented Aug 9, 2023

eyigitba commented Aug 9, 2023

makortel commented Aug 9, 2023

eyigitba commented Aug 9, 2023

VinInn commented Aug 10, 2023 • edited Loading

VinInn commented Aug 10, 2023

eyigitba commented Aug 10, 2023

VinInn commented Aug 10, 2023 •

edited

Loading