Track Kaboom #243

osschar · 2019-09-18T23:36:39Z

Track is broken into TrackBase and Track
TrackCand is used during track finding, implementing common hit-on-track storage for all candidates stemming from the same seed
Track now uses std::vector for hit storage
Hit has been extended with bit-packed module-id-in-layer, charge_per_cm, and span_X/Y
Bin-file-writer has been modified accordingly
This requires a new bin file version (now at 4, samples have to be recreated, some are available at phi1 and phi3 in /data2)

Add HitOnTrack::operator< to allow making sets and maps of them.

…ize. Make Track and TrackCand ctors from TrackBase explicit. Modify Hit and mem-file-writer to store charge-per-cm and cluster spans according to expected needs.

kmcdermo · 2019-09-18T23:39:27Z

Cool! @osschar , can you share the validation plots here?

osschar · 2019-09-18T23:45:31Z

There are difficulties creating a new version of our standard tkNtuple as input EDM files are not accessible. New tkNtuples are required as the old ones do not contain cluster size information.

For the old sample validation has been run just before actual storing of cluster sizes has been added. This was the last step and this info is not used during tracking -- so in principle there should be no changes compared to this.
http://xrd-cache-1.t2.ucsd.edu/matevz/PKF/666-TrackCand-for-HitStorage-v3/
New sample PU70 2018 sim
http://xrd-cache-1.t2.ucsd.edu/matevz/PKF/666-TrackCand-for-HitStorage-v4-2018-sim-pu70/
New sample PU50 2018 sim
http://xrd-cache-1.t2.ucsd.edu/matevz/PKF/666-TrackCand-for-HitStorage-v4-2018-sim-pu50/

cerati · 2019-09-19T13:44:26Z

So TrackCand contains a pointer to CombCandidate, which in turn is a vector of TrackCand. Do I understand correctly this vector contains the TrackCand itself and all its siblings?
Trying to digest the changes, I may have more questions later...
Thanks!

makortel · 2019-09-19T15:16:24Z

If I understand correctly the CMSSW side needs to be updated to call for a constructed Hit object

hit.setupAsPixel(...) for pixel hits
hit.setupAsStrip(...) for strip hits

(for @cerati I believe) Can you show how exactly the TrackingNtuple as filled for the arguments of these functions (i.e. pix_clustSizeCol, pix_clustSizeRow, str_clustSize branches)? (it would also be nice to get these changes in the ntuple contributed back to upstream CMSSW)

cerati · 2019-09-19T15:19:31Z

@makortel, this went into @slava77's fork: slava77/cmssw#102
It should be straightforward to replicate or cherry-pick for the upstream CMSSW.

makortel · 2019-09-19T15:20:57Z

tkNtuple/WriteMemoryFile.cc

+      unsigned int imoduleid;
+      {
+        auto ii = module_shortId_hash[ilay].emplace(pix_detId->at(ipix), (unsigned int) module_shortId_hash[ilay].size());
+        imoduleid = ii.first->second;


(For CMSSW side) Is it enough to provide a unique 12-bit number for each module on a layer? Or does the "short id" as implemented here carry any other meaning?

(I'm just thinking that for CMSSW it would be more natural to derive this kind of mapping from DetId to a "short id" once from the geometry instead of loops over hits)

Yes, it would be preferable to extract it from DetId and store that. I wasn't exactly sure how this changes with different tracker phases so I picked a lazy way out. We also have more bits if we need them.

Also, don't forget that we use stereo/mono as different layers.

Do you know how to do it in a reasonable way?

(In lossyspeak) I was thinking to just take the geometry, ask for all modules, loop over them and use a similar module_shortId_has_ trick to map DetId to a "sequence number".

By very quick look the GeomDets actually have a global indexing
https://github.com/cms-sw/cmssw/blob/43c5f0689f3839d0e430dcf63cbf78c2ece55adc/Geometry/CommonDetUnit/interface/GeomDet.h#L86-L88
that is used at least for tracker
https://github.com/cms-sw/cmssw/blob/43c5f0689f3839d0e430dcf63cbf78c2ece55adc/Geometry/TrackerGeometryBuilder/src/TrackerGeometry.cc#L141
One option would be to use that directly (the geometry seems to internally map DetIds to const GeomDet* with a std::unordered_map so it should not be much slower than going through std::unordered_map<unsigned, unsigned>). I'm not sure if everything would fit in 12 bits though (in the available 18 for sure, I don't know on the top of my head whether phase2 tracker could hit 260k modules in total). In a sense this index would be overkill (it would be unique through the tracker), but it would still be unique within a layer :)

Slava once pointed me to this: https://github.com/cms-sw/cmssw/blob/02d4198c0b6615287fd88e9a8ff650aea994412e/Geometry/TrackerNumberingBuilder/README.md

And it scared me enough to go the hash_map way :) The largest number of modules I saw for my scheme was between 860 and 880 ... for the outer TOB layer.

But you are right, we can have an id that is unique across more than just "our" layer.

On the other hand, we could also have an id, that is only unique across potentially overlapping modules ... but this then becomes a "map coloring problem" and is probably even harder.

Now, if we ever need to access per module state (orientation, dead/always-on strips), this id would also be a "natural" index into the module_state vector.

Sigh, I guess I'm not helping. :)

Slava once pointed me to this: https://github.com/cms-sw/cmssw/blob/02d4198c0b6615287fd88e9a8ff650aea994412e/Geometry/TrackerNumberingBuilder/README.md

I also stared that page a lot while writing the comments above. While the idea of taking the within-layer data from the DetId itself (with bit masks) sounds intriguing, it won't work for phase2 outer tracker, which really seems to use all 20 lowest bits (thus not fitting in the available 18 bits). Everything else seems to fit in 18 bits (note that for phase1/2 pixel barrel layer starts at bit 20, but has the lowest 2 bits unused).

I'm really fine with the current solution (CMSSW tracking uses similar "sequence numbering" as well instead of DetIds in various places). I was really after that (for now at least) the only requirements for this number are

unique within mkFit layer

fits in 12 bits

and I interpret your comment ("unique across potentially overlapping modules ") such that this is indeed the case.

osschar · 2019-09-19T16:28:44Z

@cerati Yes, CombCand is a vector of TrackCands (as it was before a vector of Tracks). We need a back pointer TrackCand->ComCand as ComCand now implements hit storage for all TrackCands it manages. Well, it could eventually also access seed-related info, if needed, but I don't think it does right now.

…amples.

osschar · 2019-09-23T16:56:49Z

The 10mu and PU50 and 70 samples are on UCSD phi machines in /data2.

Validation for all UCSD machines (and proper PR number):
http://xrd-cache-1.t2.ucsd.edu/matevz/PKF/243-TrackKaboom-2018-sim-pu50/
Note: the titles of histograms still say PU70 ... but this is 2018 PU50.

@kmcdermo, can you please check these titles and whatever else might need to be modified for validation/benchmarking, I only changed the data-sample locations on UCSD machines. Oh, and you'll probably want to copy these samples over to Cornell and modify the locations for LNX machines, too.

osschar · 2019-09-25T17:10:42Z

I fixed the PU70 on histo titles to PU50 -- the plots are on the same location:
http://xrd-cache-1.t2.ucsd.edu/matevz/PKF/243-TrackKaboom-2018-sim-pu50/

kmcdermo · 2019-09-25T22:30:02Z

Attaching some quick slides regarding this PR and then will merge: track_kaboom_validation.pdf

Overall, when apples-to-apples with the old ttbar+PU70 2017 samples, speedup increases by 5-10% in building on a single thread, and maybe 2-3% at full load for full loop time. Efficiency also increases, mostly at low pT, in the endcaps, and at low nLayers.

With the new 2018 samples, given the 40-50% loss of quadruplets, it is a bit hard to make definitive statements. For sure, the overall time goes down when compared to devel (by about 40-50%). It seems that the PU50 samples actually take slightly longer to run than the PU70. N.B. The tracking performance also improves very nicely at high eta and at low nLayers with PU50.

cerati · 2019-09-25T22:48:18Z

did we expect the efficiency to change?

…

________________________________________ From: Kevin McDermott <notifications@github.com> Sent: Wednesday, September 25, 2019 5:30 PM To: trackreco/mkFit Cc: Giuseppe B. Cerati; Mention Subject: Re: [trackreco/mkFit] Track Kaboom (#243) Attaching some quick slides regarding this PR and then will merge: track_kaboom_validation.pdf<https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_trackreco_mkFit_files_3655060_track-5Fkaboom-5Fvalidation.pdf&d=DwMCaQ&c=gRgGjJ3BkIsb5y6s49QqsA&r=cZ1DN6XgZbqMf23e3rFZ6w&m=IUrn_Jg9I5fZsyJ1s2XagTeEFwoyHBhqx0AR-9SMV3U&s=n0Q0xWGLodKsF9ASE5oi-bT0d9Iy0c3E3y_P_-zlLmQ&e=> Overall, when apples-to-apples with the old ttbar+PU70 2017 samples, speedup increases by 5-10% in building on a single thread, and maybe 2-3% at full load for full loop time. Efficiency also increases, mostly at low pT, in the endcaps, and at low nLayers. With the new 2018 samples, given the 40-50% loss of quadruplets, it is a bit hard to make definitive statements. For sure, the overall time goes down when compared to devel (by about 40-50%). It seems that the PU50 samples actually take slightly longer to run than the PU70. N.B. The tracking performance also improves very nicely at high eta and at low nLayers with PU50. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_trackreco_mkFit_pull_243-3Femail-5Fsource-3Dnotifications-26email-5Ftoken-3DABEYMGQRVG45GHIXHJNWTUDQLPQ6XA5CNFSM4IYEYH3KYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD7TUA2I-23issuecomment-2D535249001&d=DwMCaQ&c=gRgGjJ3BkIsb5y6s49QqsA&r=cZ1DN6XgZbqMf23e3rFZ6w&m=IUrn_Jg9I5fZsyJ1s2XagTeEFwoyHBhqx0AR-9SMV3U&s=J64DsoScUPlCla-vt4KRmP_VOiG-f0MxpoxuvMucOaQ&e=>, or mute the thread<https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_ABEYMGVWUA6RPBDOMHKEER3QLPQ6XANCNFSM4IYEYH3A&d=DwMCaQ&c=gRgGjJ3BkIsb5y6s49QqsA&r=cZ1DN6XgZbqMf23e3rFZ6w&m=IUrn_Jg9I5fZsyJ1s2XagTeEFwoyHBhqx0AR-9SMV3U&s=SihrZ9_ojoQOKnbIBSw86gIW5DHd2Z1zM4WCF36WEeE&e=>.

osschar · 2019-09-25T22:53:28Z

Not significantly :) There are two possible causes:

there is no more limit on N_hits (also for cmssw sim tracks)
score is now stored as float (not quantized into Status bits)

osschar added 2 commits September 11, 2019 11:49

Use TrackCand and common hit storage during outward track search.

0cd9f57

Add HitOnTrack::operator< to allow making sets and maps of them.

Track uses std::vector for hits, Hit has fields for cluster adc and s…

db45663

…ize. Make Track and TrackCand ctors from TrackBase explicit. Modify Hit and mem-file-writer to store charge-per-cm and cluster spans according to expected needs.

makortel reviewed Sep 19, 2019

View reviewed changes

Setup cmssw benchmark and validation scripts for new PU50 2018 data s…

9451be9

…amples.

Fix histogram title (name of output files) from PU70 to PU50.

422f0d3

kmcdermo merged commit e864ed9 into devel Sep 25, 2019

makortel mentioned this pull request Oct 2, 2019

Add support pixel triplet seeds #242

Merged

slava77 mentioned this pull request Jan 13, 2020

MkFit modules and test configs from makortel:mkfit_1040p1 trackreco/cmssw#3

Merged

This was referenced Feb 7, 2020

What to do about Track #188

Closed

Possible bug in counting nFoundHits #234

Closed

makortel mentioned this pull request Sep 22, 2020

Building 2.0.1 with AVX2 fails #277

Closed

osschar deleted the store-hits-per-seed-rb2 branch June 21, 2021 22:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track Kaboom #243

Track Kaboom #243

osschar commented Sep 18, 2019

kmcdermo commented Sep 18, 2019

osschar commented Sep 18, 2019

cerati commented Sep 19, 2019 •

edited

makortel commented Sep 19, 2019

cerati commented Sep 19, 2019

makortel Sep 19, 2019

osschar Sep 19, 2019

makortel Sep 19, 2019

osschar Sep 19, 2019

makortel Sep 19, 2019

osschar commented Sep 19, 2019

osschar commented Sep 23, 2019

osschar commented Sep 25, 2019

kmcdermo commented Sep 25, 2019

cerati commented Sep 25, 2019 via email

osschar commented Sep 25, 2019

Track Kaboom #243

Track Kaboom #243

Conversation

osschar commented Sep 18, 2019

kmcdermo commented Sep 18, 2019

osschar commented Sep 18, 2019

cerati commented Sep 19, 2019 • edited

makortel commented Sep 19, 2019

cerati commented Sep 19, 2019

makortel Sep 19, 2019

Choose a reason for hiding this comment

osschar Sep 19, 2019

Choose a reason for hiding this comment

makortel Sep 19, 2019

Choose a reason for hiding this comment

osschar Sep 19, 2019

Choose a reason for hiding this comment

makortel Sep 19, 2019

Choose a reason for hiding this comment

osschar commented Sep 19, 2019

osschar commented Sep 23, 2019

osschar commented Sep 25, 2019

kmcdermo commented Sep 25, 2019

cerati commented Sep 25, 2019 via email

osschar commented Sep 25, 2019

cerati commented Sep 19, 2019 •

edited