Speed up the use of DDFilteredViews #17778

Dr15Jones · 2017-03-05T19:25:28Z

Profiling of multi-threaded GEN+SIM jobs showed that the initialization time in RunManagerMTWorker::initializeThread was severely impacting the total event throughput for jobs with large numbers of streams (>100) and only modest numbers of events per stream. Using 12 cores on a standard Xeon system under CMSSW_9_0_0_pre5 the call to initializeThread took 517 seconds. After all the changes in this pull request it now takes 51 seconds.

The changes were
-Replace DDSpecificsFilter with filters that are specifics to the needs of the query.
-Have DDFilteredView only take one DDFilter (no code used more than one every anyway).
-Test DDFilters with a standard unit test.

Generalized the usefulness of DDFilteredViewAnalyzer by adding a few more options. This allowed it to be used for a wider range of testing and for profiling.

All code that used DD filtering only used the default DDLogOp::AND. Removing the OR option allows for a great deal of simplification.

All calls to DDSpecificsFilter.addCriteria used the default values for the arguments which had defaults. Therefore those options were removed and the logic only supports the old default values. In addition, only DDCompOp::equals and not_equals are actually used so the other choices were removed.

Created unit test for DDFilter to be used in conjunction with DDFilteredView.

Decreased the time filtering by avoiding making a temporary container through the call to DDLogicalPart::mergedSpecificsV. In tests with Upgrade SIM, the initialization time of OscarProducer was sped up by a 3x.

Removed the redundant DDCompOp::matches and DDCompOp::not_matches

The original DDSpecificsFilter is capable of doing many different kinds of queries which caused the code to be extremely complex and slow. The new DDFilters do one type of query and can be composed using the DDAndFilter.

From profiling the initialization time for OscarProducer it was found that many temporaries of type MuonDDDConstants were being produced. This was taking significant amounts of time. It is now possible to build just one MuonDDDConstants per stream.

The original implementation of DDFilteredView allowed for multiple DDFilters to be used. However, DDFilters are already composible so that functionality is unneeded and just complicates the design and slows the code. Changed the constructor of DDFilteredView to enforce that one and only one DDFilter is ever attached.

Rearranged the logic in the DDFilters to minimize the calls to DDCompareEqual since profiling showed this to be a very expensive call.

cmsbuild · 2017-03-05T19:25:50Z

A new Pull Request was created by @Dr15Jones (Chris Jones) for master.

It involves the following packages:

Alignment/CocoaApplication
CalibMuon/DTCalibration
DetectorDescription/Core
DetectorDescription/RegressionTest
Geometry/CMSCommonData
Geometry/CSCGeometryBuilder
Geometry/CaloEventSetup
Geometry/DTGeometryBuilder
Geometry/EcalTestBeam
Geometry/GEMGeometryBuilder
Geometry/HGCalCommonData
Geometry/HcalCommonData
Geometry/MuonNumbering
Geometry/RPCGeometryBuilder
Geometry/TrackerNumberingBuilder
Geometry/VeryForwardGeometryBuilder
GeometryReaders/XMLIdealGeometryESSource
SLHCUpgradeSimulations/Geometry
SimG4CMS/Calo
SimG4CMS/CherenkovAnalysis
SimG4CMS/Forward
SimG4CMS/HcalTestBeam
SimG4CMS/Muon
SimG4Core/PrintGeomInfo
SimTracker/TrackerMaterialAnalysis
Validation/Geometry
Validation/HGCalValidation

@ghellwig, @civanch, @Dr15Jones, @arunhep, @ianna, @mdhildreth, @dmitrijus, @cmsbuild, @franzoni, @kpedro88, @cerminar, @mmusich, @vanbesien, @davidlange6 can you please review it and eventually sign? Thanks.
@ghellwig, @cseez, @vandreev11, @argiro, @sethzenz, @makortel, @sdevissc, @GiacomoSguazzoni, @rovere, @lgray, @Martin-Grunewald, @bsunanda, @ptcox, @mschrode, @pfs, @tocheng, @VinInn, @threus, @dgulhan, @kpedro88, @venturia this is something you requested to watch as well.
@Muzaffar, @davidlange6, @smuzaffar you are the release manager for this.

cms-bot commands are listed here #13028

Dr15Jones · 2017-03-05T19:39:55Z

please test

cmsbuild · 2017-03-05T19:40:13Z

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/18146/console Started: 2017/03/05 20:41

Dr15Jones · 2017-03-05T19:40:31Z

@ianna I'd appreciate it if you'd do a careful code review of this pull request.

Dr15Jones · 2017-03-05T20:31:43Z

@ianna is there any reason to keep DetectorDescription/RegressionTest/test/tutorial.cc? It looks like it only works if someone externally fills the singletons for a DDCompactView.

Dr15Jones · 2017-03-05T21:04:10Z

Geometry/CSCGeometryBuilder/src/CSCGeometryParsFromDD.cc

@@ -34,14 +34,10 @@ bool CSCGeometryParsFromDD::build( const DDCompactView* cview

  DDSpecificsFilter filter;


I should have changed this to DDSpecificsMatchesValueFilter.

cmsbuild · 2017-03-06T18:03:43Z

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-17778/18157/summary.html

Dr15Jones · 2017-03-07T01:46:46Z

As far as I can tell, all the comparisons are normal.

ianna · 2017-03-07T08:10:22Z

@Dr15Jones - Yes, I'm just running some extra tests.

ianna

The following files can be removed:
DetectorDescription/RegressionTest/test/tutorial.cc
DetectorDescription/Core/interface/DDQuery.h
DetectorDescription/Core/src/DDQuery.cc

ianna · 2017-03-07T13:01:07Z

+1

kpedro88 · 2017-03-07T16:41:21Z

+1

civanch · 2017-03-07T16:46:29Z

For some reasons my local comparisons show differences for muon hits (nothing for calorimeters) but official comparison are OK. May this be due to some change in rotations?

Another concern: there is a report about broken CMS code rules in this PR (?) - this is unusual.

Dr15Jones · 2017-03-07T18:58:34Z

@civanch the coding rule violations are ones already in the IB. From this pull request the violations are:

/CalibMuon/DTCalibration/plugins/DTGeometryParserFromDDD.cc
[34]
/Geometry/EcalTestBeam/plugins/EcalTBGeometryBuilder.cc
[60, 66]
/Geometry/VeryForwardGeometryBuilder/src/RPAlignmentCorrectionsMethods.cc
[63]
/Geometry/VeryForwardGeometryBuilder/src/RPAlignmentCorrectionsDataSequence.cc
[98]
/Geometry/VeryForwardGeometryBuilder/plugins/TotemRPGeometryESModule.cc
[181, 212]

If you look at the most recent IB Q&A you'll see those same items:
https://cmssdt.cern.ch/SDT/html//rc//slc6_amd64_gcc530/www/mon/9.1-mon-23/CMSSW_9_1_X_2017-03-06-2300//codeRules/cmsCRPage.html

Dr15Jones · 2017-03-07T19:00:33Z

@civanch How can one reproduce the comparison problems you are seeing locally?

civanch · 2017-03-07T23:22:14Z

@Dr15Jones , I run 600 MeanBias events at recent IB with 8 threads, then merge locally this PR and run again. Number of hits in CSC is not big for this statistics (~500) but the difference is visible. It may be a false alarm, because these hits may be due to few events. My little worry is due to expectation that nothing should be affected and sim history should not change at all. Probably, I have to rerun with 1 thread.

In calorimeters number of hits is significant and distributions are really very similar.

Dr15Jones · 2017-03-08T02:30:37Z

@civanch do you happen to have the exact cmsDriver command you used to generate the configuration?

civanch · 2017-03-08T14:46:25Z

+1
when I run with 1 thread (or stream?) the hits are identical. The false alarm was connected with 8-thread runs.

Dr15Jones · 2017-03-08T15:26:18Z

@civanch David Dagenhart and I discussed this and one would expect the possibility of different results using multiple threads. We tie random number seeds to StreamIDs. However, when running different jobs using the same configuration with multiple threads, the streams in the two different jobs are likely to process different number of events and therefore use different random number seeds.

As an example, say you are going to generate 8 events and use 4 streams. In the first job you might get the following stream to #events processed

stream 1: 2 events
stream 2: 2 events
stream 3: 2 events
stream 4: 2 events

in another job you could get

stream 1: 1 event
stream 2: 1 event
stream 3: 3 events
stream 4: 3 events

So between these two jobs, you'd have 6 events which saw the same random numbers and 2 events which saw different random numbers.

Dr15Jones · 2017-03-08T15:46:08Z

@dmitrijus @vanbesien @cerminar @franzoni @mmusich @ghellwig @arunhep please review this pull request.

dmitrijus · 2017-03-08T15:52:11Z

+1

Dr15Jones · 2017-03-09T15:38:04Z

@cerminar @franzoni @mmusich @ghellwig @arunhep please review. alca is the last signature needed.

Dr15Jones added 11 commits March 2, 2017 13:45

Generalize DDFilteredViewAnalyzer

69d2d92

Generalized the usefulness of DDFilteredViewAnalyzer by adding a few more options. This allowed it to be used for a wider range of testing and for profiling.

Removed DDLogOp::OR option from DD filtering

2758393

All code that used DD filtering only used the default DDLogOp::AND. Removing the OR option allows for a great deal of simplification.

Added unit test for DDFilter

5c93d01

Created unit test for DDFilter to be used in conjunction with DDFilteredView.

Sped up DDSpecificsFilter

7b697b3

Decreased the time filtering by avoiding making a temporary container through the call to DDLogicalPart::mergedSpecificsV. In tests with Upgrade SIM, the initialization time of OscarProducer was sped up by a 3x.

Only allow DDCompOp::equals or DDCompOp::not_equals

674389e

Removed the redundant DDCompOp::matches and DDCompOp::not_matches

Create single purpose DDFilters

2e129ad

The original DDSpecificsFilter is capable of doing many different kinds of queries which caused the code to be extremely complex and slow. The new DDFilters do one type of query and can be composed using the DDAndFilter.

Use specialized DDFilters in Simulation

6e5ba78

Avoid calling DDCompareEqual because it is slow

f84f3a4

Rearranged the logic in the DDFilters to minimize the calls to DDCompareEqual since profiling showed this to be a very expensive call.

cmsbuild added this to the Next CMSSW_9_1_X milestone Mar 5, 2017

cmsbuild added alca-pending comparison-pending dqm-pending geometry-pending orp-pending pending-signatures simulation-pending tests-pending upgrade-pending labels Mar 5, 2017

Renamed DDSpecificsAnyValueFilter to DDSpecificsHasNamedValueFilter

8edde43

cmsbuild added tests-started and removed tests-pending labels Mar 5, 2017

Dr15Jones commented Mar 5, 2017

View reviewed changes

cmsbuild added tests-approved and removed tests-started labels Mar 6, 2017

cmsbuild added comparison-available and removed comparison-pending labels Mar 6, 2017

ianna approved these changes Mar 7, 2017

View reviewed changes

cmsbuild added geometry-approved and removed geometry-pending labels Mar 7, 2017

cmsbuild added upgrade-approved and removed upgrade-pending labels Mar 7, 2017

cmsbuild added simulation-approved and removed simulation-pending labels Mar 8, 2017

cmsbuild added dqm-approved and removed dqm-pending labels Mar 8, 2017

davidlange6 merged commit b215603 into cms-sw:master Mar 10, 2017

Dr15Jones deleted the fasterDDFiltering branch March 14, 2017 16:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up the use of DDFilteredViews #17778

Speed up the use of DDFilteredViews #17778

Dr15Jones commented Mar 5, 2017

cmsbuild commented Mar 5, 2017

Dr15Jones commented Mar 5, 2017

cmsbuild commented Mar 5, 2017 •

edited

Dr15Jones commented Mar 5, 2017

Dr15Jones commented Mar 5, 2017

Dr15Jones Mar 5, 2017

cmsbuild commented Mar 6, 2017

Dr15Jones commented Mar 7, 2017

ianna commented Mar 7, 2017

ianna left a comment

ianna commented Mar 7, 2017

kpedro88 commented Mar 7, 2017

civanch commented Mar 7, 2017

Dr15Jones commented Mar 7, 2017

Dr15Jones commented Mar 7, 2017

civanch commented Mar 7, 2017

Dr15Jones commented Mar 8, 2017

civanch commented Mar 8, 2017

Dr15Jones commented Mar 8, 2017 •

edited

Dr15Jones commented Mar 8, 2017

dmitrijus commented Mar 8, 2017

Dr15Jones commented Mar 9, 2017

		@@ -34,14 +34,10 @@ bool CSCGeometryParsFromDD::build( const DDCompactView* cview

		DDSpecificsFilter filter;

Speed up the use of DDFilteredViews #17778

Speed up the use of DDFilteredViews #17778

Conversation

Dr15Jones commented Mar 5, 2017

cmsbuild commented Mar 5, 2017

Dr15Jones commented Mar 5, 2017

cmsbuild commented Mar 5, 2017 • edited

Dr15Jones commented Mar 5, 2017

Dr15Jones commented Mar 5, 2017

Dr15Jones Mar 5, 2017

Choose a reason for hiding this comment

cmsbuild commented Mar 6, 2017

Dr15Jones commented Mar 7, 2017

ianna commented Mar 7, 2017

ianna left a comment

Choose a reason for hiding this comment

ianna commented Mar 7, 2017

kpedro88 commented Mar 7, 2017

civanch commented Mar 7, 2017

Dr15Jones commented Mar 7, 2017

Dr15Jones commented Mar 7, 2017

civanch commented Mar 7, 2017

Dr15Jones commented Mar 8, 2017

civanch commented Mar 8, 2017

Dr15Jones commented Mar 8, 2017 • edited

Dr15Jones commented Mar 8, 2017

dmitrijus commented Mar 8, 2017

Dr15Jones commented Mar 9, 2017

cmsbuild commented Mar 5, 2017 •

edited

Dr15Jones commented Mar 8, 2017 •

edited