Patatrack integration - HCAL local reconstruction (8/N) #31720

fwyzard · 2020-10-08T21:48:55Z

PR description:

Data formats and algorithms for the HCAL local reconstruction running on GPU.
Implements the HBHE unpacking and the production of HCAL uncalibrated rechits, including the MAHI algorithm.

PR validation:

Changes in use in the Patatrack releases.

if this PR is a backport please specify the original PR and why you need to backport that PR:

Includes changes from:

Implement HCAL local reconstruction on GPUs cms-patatrack/cmssw#468: Implement HCAL local reconstruction on GPUs
Update HCAL local reconstruction on GPUs cms-patatrack/cmssw#470: Update HCAL local reconstruction on GPUs
Fixing cuda crashes cms-patatrack/cmssw#483: Restructure code to work around CUDA build limitations
Code format cms-patatrack/cmssw#486: Apply code formatting
Remove dictionary definitions for classes already defined in CUDADataFormats/StdDictionaries cms-patatrack/cmssw#489: Remove duplicate dictionary definitions
Fix warnings about unused variables in HCAL GPU code cms-patatrack/cmssw#491: Fix warnings about unused variables in HCAL GPU code
Update ECAL and HCAL reconstruction to run on multple GPUs cms-patatrack/cmssw#502: Update ECAL and HCAL reconstruction to run on multple GPUs [1/3]
Implement HCAL-only workflows on GPU cms-patatrack/cmssw#505: Implement HCAL-only workflows on GPU
Add missing ESProducers for ECAL and HCAL GPU modules cms-patatrack/cmssw#508: Update ECAL and HCAL reconstruction to run on multple GPUs [2/3]
ECAL-HCAL: code refactoring cms-patatrack/cmssw#523: Refactor common ECAL and HCAL code
Apply code formatting cms-patatrack/cmssw#526: Apply code formatting
Remove use of boost::mpl::vector for dependent records cms-patatrack/cmssw#527: Remove use of boost::mpl::vector for dependent records
Use 10 time samples for the HBHE digis in Run 3 cms-patatrack/cmssw#531: Use up to 10 time samples for the HBHE digis in Run 3 MC
Apply code checks and code format cms-patatrack/cmssw#532: Apply code checks and code format
HcalDigisProducerGPU: preallocate pinned host memory only if CUDA is available cms-patatrack/cmssw#543: HcalDigisProducerGPU: preallocate pinned host memory only if CUDA is available
Update ESProducers following cms-sw#31556 cms-patatrack/cmssw#555: Update ESProducers following Remove setConsumes() from ESConsumesCollector #31556
Move multifit/MAHI common code to DataFormats/CaloRecHit cms-patatrack/cmssw#557: Move multifit/MAHI common code to DataFormats/CaloRecHit
Address HCAL review comments regarding CondFormats/HcalObjects cms-patatrack/cmssw#565: Address HCAL review comments regarding CondFormats/HcalObjects
Reduce code duplication in CPU and GPU modules cms-patatrack/cmssw#566: Reduce code duplication in CPU and GPU modules
HCAL code review, continued cms-patatrack/cmssw#567: Refactor ECAL and HCAL chi2 code
Address more HCAL review comments cms-patatrack/cmssw#568: Address more HCAL review comments
Move common ESProducer templates to ConvertingESProducer(WithDependencies)T cms-patatrack/cmssw#569: Move common ESProducer templates to ConvertingESProducer(WithDependencies)T
Move the HCALGPUAnalyzer to RecoLocalCalo/HcalRecProducers/ cms-patatrack/cmssw#572: Move the HCALGPUAnalyzer to RecoLocalCalo/HcalRecProducers/
Update GPU HCAL conditions framework cms-patatrack/cmssw#574: Update GPU HCAL conditions framework
Realign cpugpu cms-patatrack/cmssw#576: Synchronise GPU code with CPU updates

cmsbuild · 2020-10-08T21:49:18Z

The code-checks are being triggered in jenkins.

fwyzard · 2020-10-08T21:49:20Z

For all questions, please address @mariadalfonso @vkhristenko .
For all changes, please make PRs against cms-patatrack:patatrack_integration_8_N_hcal_local_reco .

cmsbuild · 2020-10-08T21:57:40Z

-code-checks

ERROR: Build errors found during clang-tidy run.

CUDADataFormats/HcalDigi/interface/DigiCollection.h:4:10: error: 'CUDADataFormats/CaloCommon/interface/Common.h' file not found [clang-diagnostic-error]
#include "CUDADataFormats/CaloCommon/interface/Common.h"
         ^
Suppressed 1522 warnings (1521 in non-user code, 1 with check filters).
--
CUDADataFormats/HcalDigi/interface/DigiCollection.h:4:10: error: 'CUDADataFormats/CaloCommon/interface/Common.h' file not found [clang-diagnostic-error]
#include "CUDADataFormats/CaloCommon/interface/Common.h"
         ^
Suppressed 794 warnings (793 in non-user code, 1 with check filters).
--
CUDADataFormats/HcalDigi/interface/DigiCollection.h:4:10: error: 'CUDADataFormats/CaloCommon/interface/Common.h' file not found [clang-diagnostic-error]
#include "CUDADataFormats/CaloCommon/interface/Common.h"
         ^
Suppressed 794 warnings (793 in non-user code, 1 with check filters).
--
CUDADataFormats/HcalDigi/interface/DigiCollection.h:4:10: error: 'CUDADataFormats/CaloCommon/interface/Common.h' file not found [clang-diagnostic-error]
#include "CUDADataFormats/CaloCommon/interface/Common.h"
         ^
Suppressed 1214 warnings (1213 in non-user code, 1 with check filters).
--
CUDADataFormats/HcalRecHitSoA/interface/RecHitCollection.h:6:10: error: 'CUDADataFormats/CaloCommon/interface/Common.h' file not found [clang-diagnostic-error]
#include "CUDADataFormats/CaloCommon/interface/Common.h"
         ^
Suppressed 793 warnings (792 in non-user code, 1 with check filters).
--
CUDADataFormats/HcalDigi/interface/DigiCollection.h:4:10: error: 'CUDADataFormats/CaloCommon/interface/Common.h' file not found [clang-diagnostic-error]
#include "CUDADataFormats/CaloCommon/interface/Common.h"
         ^
Suppressed 1269 warnings (1268 in non-user code, 1 with check filters).
--
gmake: *** [config/SCRAM/GMake/Makefile.coderules:128: code-checks] Error 2
gmake: *** [There are compilation/build errors. Please see the detail log above.] Error 2

abdoulline · 2020-10-10T04:58:05Z

I guess that for HCALxxxGPU conditions (.h and .cc) a natural place would be the same one as for regular conditions:
https://cmssdt.cern.ch/lxr/source/CondFormats/HcalObjects/
and not RecoLocalCalo/HcalRecAlgos ?

See [#31703](cms-sw/cmssw#31703) Patatrack integration - common data formats (5/N) [#31704](cms-sw/cmssw#31704) Patatrack integration - calorimeters shared code (6/N) [#31719](cms-sw/cmssw#31719) Patatrack integration - ECAL local reconstruction (7/N) [#31720](cms-sw/cmssw#31720) Patatrack integration - HCAL local reconstruction (8/N) [#31721](cms-sw/cmssw#31721) Patatrack integration - Pixel local reconstruction (9/N) [#31722](cms-sw/cmssw#31722) Patatrack integration - Pixel track reconstruction (10/N) [#31723](cms-sw/cmssw#31723) Patatrack integration - Pixel vertex reconstruction (11/N)

See - cms-sw/cmssw#31703: Patatrack integration - common data formats (5/N) - cms-sw/cmssw#31704: Patatrack integration - calorimeters shared code (6/N) - cms-sw/cmssw#31719: Patatrack integration - ECAL local reconstruction (7/N) - cms-sw/cmssw#31720: Patatrack integration - HCAL local reconstruction (8/N) - cms-sw/cmssw#31721: Patatrack integration - Pixel local reconstruction (9/N) - cms-sw/cmssw#31722: Patatrack integration - Pixel track reconstruction (10/N) - cms-sw/cmssw#31723: Patatrack integration - Pixel vertex reconstruction (11/N)

perrotta

Please find below a few, mostly stlish, comments.
We expect to get also some cpu/gpu validation plot.
And we are waiting for the resolution of the build errors found during clang-tidy run to proceed with the automatic tests and the next steps in the review

perrotta · 2020-10-19T06:51:54Z

CUDADataFormats/HcalDigi/interface/DigiCollection.h

+  }
+
+  template <>
+  constexpr uint32_t compute_nsamples<Flavor5>(uint32_t const nwords) {


Why this specialization, since WORDS_PER_SAMPLE = 1 / SAMPLES_PER_WORD ?

I would say this is to avoid a division of an integer by 0.5, which would involve floating point operations - while multiplying by 2 can be done with integer operations.

What I meant is: why cannot you use the previous templated compute_nsamples also for Flavor = Flavor5 ? (Then I agree with you that in all cases one could simply multiply by SAMPLES_PER_WORD instead of dividing by WORDS_PER_SAMPLE)

the other flavors were removed for which WORDS_PER_SAMPLE was 2 (flavor 2), which means you again have floating point in there... i just tried to keep only int operations in there...

@perrotta

why cannot you use the previous templated compute_nsamples also for Flavor = Flavor5 ?

The generic version is (lines 89-92)

template <typename Flavor> constexpr uint32_t compute_nsamples(uint32_t const nwords) { return (nwords - Flavor::HEADER_WORDS) / Flavor::WORDS_PER_SAMPLE; }

For Flavor5 that would become

constexpr uint32_t compute_nsamples(uint32_t const nwords) { return (nwords - 1) / 0.5; }

which does not look very sensible.

The Flavor5 specialisation instead becomes

constexpr uint32_t compute_nsamples(uint32_t const nwords) { return (nwords - 1) * 2; }

which avoids floating point math for an integer multiplication by 2.

Then I agree with you that in all cases one could simply multiply by SAMPLES_PER_WORD instead of dividing by WORDS_PER_SAMPLE

I have no idea if - in principle - there can be cases where WORDS_PER_SAMPLE is greater than 1.
If that is not foreseen, then indeed we can change the generic version to

template <typename Flavor> constexpr uint32_t compute_nsamples(uint32_t const nwords) { return (nwords - Flavor::HEADER_WORDS) * Flavor::SAMPLES_PER_WORD; }

If we want to keep this more generic, it could become

template <typename Flavor> constexpr uint32_t compute_nsamples(uint32_t const nwords) { if constexpr(Flavor::SAMPLES_PER_WORD >= 1) return (nwords - Flavor::HEADER_WORDS) * Flavor::SAMPLES_PER_WORD; else return (nwords - Flavor::HEADER_WORDS) / Flavor::WORDS_PER_SAMPLE; }

CUDADataFormats/HcalDigi/interface/DigiCollection.h

perrotta · 2020-10-19T07:07:36Z

EventFilter/HcalRawToDigi/plugins/DecodeGPU.cu

+
+      // get to the payload
+      auto const* payload64 = buffer + 2 + namc + amcoffset;
+      //amcoffset += amcSize;


If this line is not needed, it can be removed. Otherwise just add another comment line which explains why do you want to keep here even if commented out

perrotta · 2020-10-19T07:07:50Z

EventFilter/HcalRawToDigi/plugins/DecodeGPU.cu

+#ifdef HCAL_RAWDECODE_GPUDEBUG
+      // uhtr header v1 1st 64 bits
+      auto const payload64_w0 = payload64[0];
+      //uint32_t const data_length64 = payload64_w0 & 0xfffff;


If this line is not needed, it can be removed. Otherwise just add another comment line which explains why do you want to keep here even if commented out

perrotta · 2020-10-19T08:50:17Z