Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataFormats -- changes for large object passed by value found by clang s... #612

Merged
merged 1 commit into from Sep 19, 2013
Merged

DataFormats -- changes for large object passed by value found by clang s... #612

merged 1 commit into from Sep 19, 2013

Conversation

gartung
Copy link
Member

@gartung gartung commented Aug 26, 2013

...tatic analyzer cms.ArgSizeChecker

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @gartung (Patrick Gartung) for CMSSW_7_0_X.

DataFormats -- changes for large object passed by value found by clang s...

It involves the following packages:

DataFormats/CSCRecHit
DataFormats/TrackingRecHit
DataFormats/PatCandidates
DataFormats/ParticleFlowCandidate
DataFormats/BTauReco
DataFormats/L1Trigger
DataFormats/L1CSCTrackFinder
DataFormats/EgammaCandidates
DataFormats/VZero
DataFormats/DTRecHit
DataFormats/METReco
DataFormats/HepMCCandidate
DataFormats/TauReco
DataFormats/CaloTowers
DataFormats/RecoCandidate
DataFormats/Common

@vciulli, @Dr15Jones, @thspeer, @slava77, @vadler, @ktf can you please review it and eventually sign? Thanks.
You can sign-off by replying to this message having '+1' in the first line of your reply.
You can reject by replying to this message having '-1' in the first line of your reply.

@cmsbuild
Copy link
Contributor

Pull request #612 was updated. Signatures reset, please check and sign again.

@Dr15Jones
Copy link
Contributor

+1

@slava77
Copy link
Contributor

slava77 commented Aug 26, 2013

working @slava77

@slava77
Copy link
Contributor

slava77 commented Aug 26, 2013

I tried in CMSSW_7_0_X_2013-08-26-1400
git cms-merge-topic --unsafe 612
==>
91 files changed, 134 insertions(+), 156 deletions(-)
What's happening?

This commit parent is e448454
which looks like coming from Aug 25 IB
git describe --tags e448454
CMSSW_7_0_X_2013-08-25-1700-18-ge448454

What's the reference to use for testing?

@Dr15Jones
Copy link
Contributor

Let's ask the developer to rebase to the present head

@slava77
Copy link
Contributor

slava77 commented Aug 27, 2013

+1

tested relative to CMSSW_7_0_X_2013-08-26-1400 + e448454
The incremental test area also included #623

No changes, as expected.

@cmsbuild
Copy link
Contributor

The following categories have been signed by @slava77: Reconstruction

@cms-git-reconstruction

@vadler
Copy link

vadler commented Aug 27, 2013

+1
DataFormats/PatCandidates is the only analysis package.
Tested with CMSSW_7_0_X_2013-08-27-1400, #622 , #623 (RecoMET/METFilters).

@cmsbuild
Copy link
Contributor

The following categories have been signed by @vadler: Analysis

@cms-git-analysis

@vciulli
Copy link
Contributor

vciulli commented Sep 2, 2013

+1

@slava77
Copy link
Contributor

slava77 commented Sep 11, 2013

Is L1 going to sign this? @mulhearn

@mulhearn
Copy link
Contributor

Sorry, just getting this now... as you can see I wasn't in the original notification.

@mulhearn
Copy link
Contributor

There's a problem: CMSSW_7_0_X_2013-08-27-1400 is no longer available...

@slava77
Copy link
Contributor

slava77 commented Sep 11, 2013

On 9/11/13 1:38 PM, mulhearn wrote:

There's a problem: CMSSW_7_0_X_2013-08-27-1400 is no longer available...

I think the pull can be merged into a later IB as well.
So, you can pick 09-11-0200

    --slava


Reply to this email directly or view it on GitHub
#612 (comment).


Vyacheslav (Slava) Krutelyov
TAMU: Physics Dept Texas A&M MS4242, College Station, TX 77843-4242
CERN: 42-R-027
AIM/Skype: siava16 googleTalk: slava77@gmail.com
(630) 291-5128 Cell (US) +41 76 275 7116 Cell (CERN)


@mulhearn
Copy link
Contributor

Oh, yes, you are correct. When I saw the large number of packages that "git cms-merge-topic 612" under 09-11-0200 checked out I mistakenly thought something went wrong. In fact, there are just a lot of dependencies from these changes... recompiling will take a little while...

@mulhearn
Copy link
Contributor

Compiles under CMSSW_7_0_X_2013-09-11-0200...

@mulhearn
Copy link
Contributor

Under vanilla CMSSW_7_0_Z_2013-09-11-0200 I am able to e.g. runTheMatrix.py -l 30. But after git cms-merge-topic 612, which checks out a whole bunch of dependent packages, then rebuilding, I get complaints about missing HLTTrigger/HLTfilters/triggerResultsFilter_cfi (full listing below).

mulhearn@lxplus0411>cmsDriver.py ZMM_8TeV_cfi --conditions auto:startup -s GEN,SIM --datatier GEN-SIM -n 10 --relval 18000,300 --eventcontent RAWSIM
GEN,SIM,ENDJOB
We have determined that this is simulation (if not, rerun cmsDriver.py with --data)
Step: GEN Spec:
Loading generator fragment from Configuration.Generator.ZMM_8TeV_cfi
Traceback (most recent call last):
File "/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/bin/slc6_amd64_gcc481/cmsDriver.py", line 43, in
run()
File "/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/bin/slc6_amd64_gcc481/cmsDriver.py", line 15, in run
configBuilder.prepare()
File "/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/python/Configuration/Applications/ConfigBuilder.py", line 1946, in prepare
self.addStandardSequences()
File "/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/python/Configuration/Applications/ConfigBuilder.py", line 669, in addStandardSequences
getattr(self,"prepare_"+stepName)(sequence = getattr(self,stepName+"DefaultSeq"))
File "/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/python/Configuration/Applications/ConfigBuilder.py", line 1279, in prepare_GEN
self.loadDefaultOrSpecifiedCFF(sequence,self.GENDefaultCFF)
File "/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/python/Configuration/Applications/ConfigBuilder.py", line 1118, in loadDefaultOrSpecifiedCFF
l=self.loadAndRemember(defaultCFF)
File "/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/python/Configuration/Applications/ConfigBuilder.py", line 282, in loadAndRemember
self.process.load(includeFile)
File "/afs/cern.ch/cms/sw/ReleaseCandidates/vol1/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_X_2013-09-11-0200/python/FWCore/ParameterSet/Config.py", line 508, in load
module = import(moduleName)
File "/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/python/Configuration/StandardSequences/Generator_cff.py", line 74, in
import HLTrigger.HLTfilters.triggerResultsFilter_cfi
ImportError: No module named triggerResultsFilter_cfi

@slava77
Copy link
Contributor

slava77 commented Sep 11, 2013

sounds like fun
@gartung @mulhearn
Maybe Patrick can rebase this one as well.
Then we can all have more fun resigning it :)

@cmsbuild
Copy link
Contributor

Pull request #612 was updated. @vciulli, @Dr15Jones, @ktf, @thspeer, @slava77, @vadler, @mulhearn can you please check and sign again.

@gartung
Copy link
Member Author

gartung commented Sep 11, 2013

Seems to work after rebase

Validation/RecoMET (header)
Validation/RecoMuon (header)
Validation/RecoParticleFlow (header)
Validation/RecoTau (header)
Validation/RecoVertex (header)
[lxplus439] /tmp/gartung/CMSSW_7_0_X_2013-09-11-0200 $ cmsDriver.py ZMM_8TeV_cfi --conditions auto:startup -s GEN,SIM --datatier GEN-SIM -n 10 --relval 18000,300 --eventcontent RAWSIM
GEN,SIM,ENDJOB
We have determined that this is simulation (if not, rerun cmsDriver.py with --data)
Step: GEN Spec:
Loading generator fragment from Configuration.Generator.ZMM_8TeV_cfi
Step: SIM Spec:
Step: ENDJOB Spec:
globaltag = PRE_ST62_V8::All
Starting cmsRun ZMM_8TeV_cfi_GEN_SIM.py
globaltag = PRE_ST62_V8::All
271 DQMStore::DQMStore
MSTU(12) changed from 0 to 12345
WARNING: MCParticlePairFilter : size of some vectors not matching with 2!!
Set Driver verbosity to -2
New QGSP_FTFP_BERT physics list, replaces LEP with FTF/P for p/n/pi (/K?) Thresholds:
1) between BERT and FTF/P over the interval 6 to 8 GeV.
2) between FTF/P and QGS/P over the interval 12 to 25 GeV.
-- quasiElastic was asked to be 1
Changed to 1 for QGS and to 0 (must be false) for FTF

@Dr15Jones
Copy link
Contributor

+1

@gartung
Copy link
Member Author

gartung commented Sep 11, 2013

****************** PYINIT: initialization of PYTHIA routines *****************
 ==== PYTHIA WILL USE LHAPDF ====
 *************************************
 *       LHAPDF Version 5.8.5        *
 *   Configured for the following:   *
 *             All PDFs              *
 *          LOW MEMORY option        *
 *    Maximum  1 concurrent set(s)   *
 *************************************
 >>>>>> PDF description: <<<<<<
 CTEQ6L1 - LO with LO alpha_s
 Reference:
 J. Pumplin, D.R. Stump, J. Huston, H.L. Lai, P. Nadolsky,
 W.K. Tung
 hep-ph/0201195
 >>>>>>                   <<<<<<
 Parametrization: CTEQ6
 ==============================================
 PDFset name /afs/cern.ch/cms/sw/ReleaseCandidates/vol1/slc5_amd64_gcc481/external/lhapdf/5.8
 with          1 members
 ====  initialized. ===========================
 Strong coupling at Mz for PDF is:  0.12978
 ==============================================================================
 I                                                                            I
 I              PYTHIA will be initialized for a p on p collider              I
 I                  at   8000.000 GeV center-of-mass energy                   I
 I                                                                            I
 ==============================================================================
 ******** PYMAXI: summary of differential cross-section maximum search ********
           ==========================================================
           I                                      I                 I
           I  ISUB  Subprocess name               I  Maximum value  I
           I                                      I                 I
           ==========================================================
           I                                      I                 I
           I    1   f + fbar -> gamma*/Z0         I    2.1618D-06   I
           I   96   Semihard QCD 2 -> 2           I    8.0723D+03   I
           I                                      I                 I
           ==========================================================
 ****** PYMULT: initialization of multiple interactions for MSTP(82) = 4 ******
        pT0 = 2.70 GeV gives sigma(parton-parton) = 4.56D+02 mb: accepted
 ****** PYMIGN: initialization of multiple interactions for MSTP(82) = 4 ******
        pT0 = 2.70 GeV gives sigma(parton-parton) = 1.80D+02 mb: accepted
 ********************** PYINIT: initialization completed **********************
Begin processing the 1st record. Run 1, Event 1, LumiSection 1 at 11-Sep-2013 19:42:45.841 CEST
Begin processing the 2nd record. Run 1, Event 2, LumiSection 1 at 11-Sep-2013 19:42:45.873 CEST
Begin processing the 3rd record. Run 1, Event 3, LumiSection 1 at 11-Sep-2013 19:42:45.877 CEST
Begin processing the 4th record. Run 1, Event 4, LumiSection 1 at 11-Sep-2013 19:42:45.931 CEST
Begin processing the 5th record. Run 1, Event 5, LumiSection 1 at 11-Sep-2013 19:42:59.548 CEST
Begin processing the 6th record. Run 1, Event 6, LumiSection 1 at 11-Sep-2013 19:43:31.565 CEST
Begin processing the 7th record. Run 1, Event 7, LumiSection 1 at 11-Sep-2013 19:44:28.384 CEST
Begin processing the 8th record. Run 1, Event 8, LumiSection 1 at 11-Sep-2013 19:45:34.736 CEST
Begin processing the 9th record. Run 1, Event 9, LumiSection 1 at 11-Sep-2013 19:46:11.248 CEST
Begin processing the 10th record. Run 1, Event 10, LumiSection 1 at 11-Sep-2013 19:46:46.052 CEST
1********* PYSTAT:  Statistics on Number of Events and Cross-sections *********
 ==============================================================================
 I                                  I                            I            I
 I            Subprocess            I      Number of points      I    Sigma   I
 I                                  I                            I            I
 I----------------------------------I----------------------------I    (mb)    I
 I                                  I                            I            I
 I N:o Type                         I    Generated         Tried I            I
 I                                  I                            I            I
 ==============================================================================
 I                                  I                            I            I
 I   0 All included subprocesses    I           10            16 I  1.144D-06 I
 I   1 f + fbar -> gamma*/Z0        I           10            16 I  1.144D-06 I
 I                                  I                            I            I
 ==============================================================================
 ********* Total number of errors, excluding junctions =        0 *************
 ********* Total number of errors, including junctions =        0 *************
 ********* Total number of warnings =                           0 *************
 ********* Fraction of events that fail fragmentation cuts =  0.00000 *********
=============================================
MessageLogger Summary
Severity    # Occurrences   Total Occurrences
--------    -------------   -----------------
336.219u 1.691s 5:41.51 98.9%   0+0k 0+0io 886pf+0w

@mulhearn
Copy link
Contributor

Working

@mulhearn
Copy link
Contributor

Sorry, @gartung, after rebuilding, the problem with runTheMatrix.py persists for me. I noticed in your above listing that you ran runTheMatrix.py without rebuilding first. I can run fine that way too. It looks to me like something is going wrong in the dependent packages during the build that breaks runTheMatrix.py / cmsDriver.py, but I haven't sorted it out...

mulhearn@lxplus0404>cmsrel CMSSW_7_0_X_2013-09-11-0200
mulhearn@lxplus0404>cd CMSSW_7_0_X_2013-09-11-0200/
mulhearn@lxplus0404>cd src/
mulhearn@lxplus0404>cmsenv
mulhearn@lxplus0404>git cms-merge-topic 612
No release tags specified, using default CMSSW_7_0_X_2013-09-11-0200.
You are on branch CMSSW_7_0_X
Checking out FWCore/Version in tag CMSSW_7_0_X_2013-09-11-0200.
Cloning into '/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/src'...
remote: Counting objects: 638, done.
remote: Compressing objects: 100% (242/242), done.
remote: Total 322 (delta 263), reused 136 (delta 77)
Receiving objects: 100% (322/322), 53.12 KiB | 0 bytes/s, done.
Resolving deltas: 100% (263/263), completed with 234 local objects.
Branch from-CMSSW_7_0_X_2013-09-11-0200 set up to track remote branch CMSSW_7_0_X from official-cmssw.
Switched to a new branch 'from-CMSSW_7_0_X_2013-09-11-0200'
...
[SNIP]
...
Validation/RecoMET (header)
Validation/RecoMuon (header)
Validation/RecoParticleFlow (header)
Validation/RecoTau (header)
Validation/RecoVertex (header)
mulhearn@lxplus0404>scram b -j 8
***WARNING: Multiple usage of "SimDataFormats/PileupSummaryInfo". Please cleanup "use" in "non-export" section of "src/Validation/RecoVertex/BuildFile".

Local Products Rules ..... started
Local Products Rules ..... done
***_WARNING: Invalid tool HLTrigger/HLTanalyzers. Please fix src/DQM/HcalMonitorTasks/BuildFile.xml file.
***_WARNING: Invalid tool CondCore/EgammaPlugins. Please fix src/RecoEgamma/EgammaTools/BuildFile.xml file.
****WARNING: Invalid tool DQMOffline/JetMET. Please fix src/DQMOffline/JetMET/plugins/BuildFile.xml file.
Building CMSSW version CMSSW_7_0_X_2013-09-11-0200 ----
------- copying files from src/Alignment/CommonAlignmentProducer/scripts -------
copied AlCaHLTBitMon_ParallelJobs.py
copied AlCaHLTBitMon_QueryRunRegistry.py
------- copying files from src/Alignment/KalmanAlignmentAlgorithm/scripts -------
copied kaps.sh
...
[SNIP]
...
@@@@ ----> OK EDM Class Version
mulhearn@lxplus0404>runTheMatrix.py -l 30
processing relval_standard
request for INPUT for None
processing relval_highstats
request for INPUT for None
processing relval_pileup
request for INPUT for None
processing relval_generator
request for INPUT for None
processing relval_production
request for INPUT for None
processing relval_ged
request for INPUT for None
ignoring relval_upgrade from default matrix
processing relval_identity
request for INPUT for None
Running in 4 thread(s)

Preparing to run 30.0 ZMM+ZMM+DIGI+RECO+HARVEST

in: /afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/src going to execute cd 30.0_ZMM+ZMM+DIGI+RECO+HARVEST

cmsDriver.py ZMM_8TeV_cfi --conditions auto:startup -s GEN,SIM --datatier GEN-SIM -n 10 --relval 18000,300 --eventcontent RAWSIM --fileout file:step1.root > step1_ZMM+ZMM+DIGI+RECO+HARVEST.log 2>&1

ERROR executing cd 30.0_ZMM+ZMM+DIGI+RECO+HARVEST; cmsDriver.py ZMM_8TeV_cfi --conditions auto:startup -s GEN,SIM --datatier GEN-SIM -n 10 --relval 18000,300 --eventcontent RAWSIM --fileout file:step1.root > step1_ZMM+ZMM+DIGI+RECO+HARVEST.log 2>&1; ret= 256
30.0_ZMM+ZMM+DIGI+RECO+HARVEST Step0-FAILED Step1-NOTRUN Step2-NOTRUN Step3-NOTRUN - time date Thu Sep 12 10:23:40 2013-date Thu Sep 12 10:23:34 2013; exit: 256 0 0 0
0 0 0 0 tests passed, 1 0 0 0 failed

@mulhearn
Copy link
Contributor

Sorry forgot the pre block:

mulhearn@lxplus0404>cmsrel CMSSW_7_0_X_2013-09-11-0200 
mulhearn@lxplus0404>cd CMSSW_7_0_X_2013-09-11-0200/
mulhearn@lxplus0404>cd src/
mulhearn@lxplus0404>cmsenv
mulhearn@lxplus0404>git cms-merge-topic 612
No release tags specified, using default CMSSW_7_0_X_2013-09-11-0200.
You are on branch CMSSW_7_0_X
Checking out FWCore/Version in tag CMSSW_7_0_X_2013-09-11-0200.
Cloning into '/afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/src'...
remote: Counting objects: 638, done.
remote: Compressing objects: 100% (242/242), done.
remote: Total 322 (delta 263), reused 136 (delta 77)
Receiving objects: 100% (322/322), 53.12 KiB | 0 bytes/s, done.
Resolving deltas: 100% (263/263), completed with 234 local objects.
Branch from-CMSSW_7_0_X_2013-09-11-0200 set up to track remote branch CMSSW_7_0_X from official-cmssw.
Switched to a new branch 'from-CMSSW_7_0_X_2013-09-11-0200'
...
[SNIP]
...
Validation/RecoMET (header)
Validation/RecoMuon (header)
Validation/RecoParticleFlow (header)
Validation/RecoTau (header)
Validation/RecoVertex (header)
mulhearn@lxplus0404>scram b -j 8
***WARNING: Multiple usage of "SimDataFormats/PileupSummaryInfo". Please cleanup "use" in "non-export" section of "src/Validation/RecoVertex/BuildFile".
>> Local Products Rules ..... started
>> Local Products Rules ..... done
****WARNING: Invalid tool HLTrigger/HLTanalyzers. Please fix src/DQM/HcalMonitorTasks/BuildFile.xml file.
****WARNING: Invalid tool CondCore/EgammaPlugins. Please fix src/RecoEgamma/EgammaTools/BuildFile.xml file.
****WARNING: Invalid tool DQMOffline/JetMET. Please fix src/DQMOffline/JetMET/plugins/BuildFile.xml file.
>> Building CMSSW version CMSSW_7_0_X_2013-09-11-0200 ----
------- copying files from src/Alignment/CommonAlignmentProducer/scripts -------
>> copied AlCaHLTBitMon_ParallelJobs.py
>> copied AlCaHLTBitMon_QueryRunRegistry.py
------- copying files from src/Alignment/KalmanAlignmentAlgorithm/scripts -------
>> copied kaps.sh
...
[SNIP]
...
@@@@ ----> OK  EDM Class Version 
mulhearn@lxplus0404>runTheMatrix.py -l 30
processing  relval_standard
request for INPUT for  None
processing  relval_highstats
request for INPUT for  None
processing  relval_pileup
request for INPUT for  None
processing  relval_generator
request for INPUT for  None
processing  relval_production
request for INPUT for  None
processing  relval_ged
request for INPUT for  None
ignoring relval_upgrade from default matrix
processing  relval_identity
request for INPUT for  None
Running in 4 thread(s)
Preparing to run 30.0 ZMM+ZMM+DIGI+RECO+HARVEST
# in: /afs/cern.ch/work/m/mulhearn/relval/branch_7_0_X/CMSSW_7_0_X_2013-09-11-0200/src going to execute cd 30.0_ZMM+ZMM+DIGI+RECO+HARVEST
 cmsDriver.py ZMM_8TeV_cfi  --conditions auto:startup -s GEN,SIM --datatier GEN-SIM -n 10 --relval 18000,300 --eventcontent RAWSIM --fileout file:step1.root  > step1_ZMM+ZMM+DIGI+RECO+HARVEST.log  2>&1
 
ERROR executing  cd 30.0_ZMM+ZMM+DIGI+RECO+HARVEST; cmsDriver.py ZMM_8TeV_cfi  --conditions auto:startup -s GEN,SIM --datatier GEN-SIM -n 10 --relval 18000,300 --eventcontent RAWSIM --fileout file:step1.root  > step1_ZMM+ZMM+DIGI+RECO+HARVEST.log  2>&1;  ret= 256
30.0_ZMM+ZMM+DIGI+RECO+HARVEST Step0-FAILED Step1-NOTRUN Step2-NOTRUN Step3-NOTRUN  - time date Thu Sep 12 10:23:40 2013-date Thu Sep 12 10:23:34 2013; exit: 256 0 0 0
0 0 0 0 tests passed, 1 0 0 0 failed

@slava77
Copy link
Contributor

slava77 commented Sep 12, 2013

@mulhearn @gartung
Hi Mike,

It seems to me that the rebase was made on top of CMSSW_7_0_X_2013-09-11-1400
I don't have any problems running the short++ matrix with "--useInput all" (which bypasses the sim step).
This is workflows (1000.0 1001.0 1102.0 200.0 201.0 25.0 4.17 4.22 4.29 4.37 4.38 4.39 4.53 5.1 8.0)

...
also, wf 30
runTheMatrix.py -l 30
completes just fine

@mulhearn
Copy link
Contributor

Thanks Slava.

I am cutting and pasting your sequence at: /afs/cern.ch/work/m/mulhearn/relval/CMSSW_7_0_X_2013-09-11-1400-test612

Running exactly the same sequence with only caveat that I use tcsh (so "setenv SCRAM_ARCH slc5_amd64_gcc481") Where is your working directory? I am at
/afs/cern.ch/work/m/mulhearn/relval/CMSSW_7_0_X_2013-09-11-1400-test612

@slava77
Copy link
Contributor

slava77 commented Sep 12, 2013

The last steps after it built successfully were

runTheMatrix.py -s --useInput all >& matrix.log &
as I see it, 1000.0 already passed the steps where it failed for you.
lxplus445:/tmp/slava77/CMSSW_7_0_X_2013-09-11-1400-test612/src

So, something else is going on in your case

@mulhearn
Copy link
Contributor

After cut and paste of exactly as posted, still fails:

/afs/cern.ch/work/m/mulhearn/relval/CMSSW_7_0_X_2013-09-11-1400-test612/src/matrix.log

@slava77
Copy link
Contributor

slava77 commented Sep 12, 2013

@mulhearn

weird. Something is getting wrong with your build.
The code is the same in your src/ directory.
I can reproduce your error if I cmsenv in your area.

Looking at details:
in my case it looks for hltHighLevel_cfi file and finds it in
$CMSSW_BASE/cfipython/slc5_amd64_gcc481/HLTrigger/HLTfilters/hltHighLevel_cfi.py

in your case in lxplus445:/tmp/mulhearn/CMSSW_7_0_X_2013-09-11-1400 it doesn't even find the directory
$CMSSW_BASE/cfipython/slc5_amd64_gcc481/HLTrigger/HLTfilters

Note that in my case I have the following directories in cfipython:
Geometry HLTrigger PhysicsTools RecoEgamma RecoTauTag RecoTracker

you have only
Geometry RecoEgamma

So, you should be looking for some error messages in the build log file.
Could you put yours somewhere .

@slava77
Copy link
Contributor

slava77 commented Sep 12, 2013

(I can't read files in your afs/work)

@gartung
Copy link
Member Author

gartung commented Sep 12, 2013

@mulhearn @Slava

I built and installed CMSSW_7_0_X_2013-09-11-1400 on a Fermilab build machine. I created a project area for CMSSW_7_0_X_2013-09-11-1400 and followed the steps Slava did. I also see many directories under cfipython. I am running runTheMatrix.py right now and it did not fail immediately as in your case.

@Slava
Copy link

Slava commented Sep 12, 2013

I think you refer to @slava77, rather than me.

@mulhearn
Copy link
Contributor

Had to step out for a minute... @slava77: changed AFS permissions, but also you can find it here lxplus445:/tmp/mulhearn/abuild.log

@mulhearn
Copy link
Contributor

+1 Above problem is clearly on my end, not replicated by @slava77, so trusting the above reports and signing off.

@vciulli
Copy link
Contributor

vciulli commented Sep 13, 2013

+1

@vciulli
Copy link
Contributor

vciulli commented Sep 13, 2013

Testing on CMSSW_7_0_X_2013-09-09-1400 showed some problems in all generator workflows using LHE files

501.0_MinBias_TuneZ2star_8TeV_pythia6+MinBias_TuneZ2star_8TeV_pythia6+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:10:19 2013-date Fri Sep 13 13:09:45 2013; exit: 0 0
502.0_QCD_Pt-30_TuneZ2star_8TeV_pythia6+QCD_Pt-30_TuneZ2star_8TeV_pythia6+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:10:23 2013-date Fri Sep 13 13:09:49 2013; exit: 0 0
503.0_TT_TuneZ2star_8TeV_pythia6-evtgen+TT_TuneZ2star_8TeV_pythia6-evtgen+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:10:25 2013-date Fri Sep 13 13:09:51 2013; exit: 0 0
504.0_DYToLL_M-50_TuneZ2star_8TeV_pythia6-tauola+DYToLL_M-50_TuneZ2star_8TeV_pythia6-tauola+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:10:28 2013-date Fri Sep 13 13:09:55 2013; exit: 0 0
505.0_WToLNu_TuneZ2star_8TeV_pythia6-tauola+WToLNu_TuneZ2star_8TeV_pythia6-tauola+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:11:00 2013-date Fri Sep 13 13:10:28 2013; exit: 0 0
506.0_MinBias_8TeV_pythia8+MinBias_8TeV_pythia8+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:11:07 2013-date Fri Sep 13 13:10:29 2013; exit: 0 0
507.0_QCD_Pt-30_8TeV_pythia8+QCD_Pt-30_8TeV_pythia8+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:11:06 2013-date Fri Sep 13 13:10:32 2013; exit: 0 0
508.0_QCD_Pt-30_8TeV_herwig6+QCD_Pt-30_8TeV_herwig6+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:11:09 2013-date Fri Sep 13 13:10:36 2013; exit: 0 0
509.0_QCD_Pt-30_8TeV_herwigpp+QCD_Pt-30_8TeV_herwigpp+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:12:30 2013-date Fri Sep 13 13:11:08 2013; exit: 0 0
510.0_GluGluTo2Jets_M-100_8TeV_exhume+GluGluTo2Jets_M-100_8TeV_exhume+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:11:48 2013-date Fri Sep 13 13:11:10 2013; exit: 0 0
517.0_QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+HARVGEN Step0-FAILED Step1-NOTRUN - time date Fri Sep 13 13:11:55 2013-date Fri Sep 13 13:11:11 2013; exit: 16640 0
518.0_QCD_Ht-250To500_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-250To500_TuneZ2star_8TeV_madgraph-tauola+HARVGEN Step0-FAILED Step1-NOTRUN - time date Fri Sep 13 13:11:45 2013-date Fri Sep 13 13:11:12 2013; exit: 16640 0
519.0_QCD_Ht-500To1000_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-500To1000_TuneZ2star_8TeV_madgraph-tauola+HARVGEN Step0-FAILED Step1-NOTRUN - time date Fri Sep 13 13:12:18 2013-date Fri Sep 13 13:11:53 2013; exit: 16640 0
520.0_TTJets_TuneZ2star_8TeV_madgraph-tauola+TTJets_TuneZ2star_8TeV_madgraph-tauola+HARVGEN Step0-FAILED Step1-NOTRUN - time date Fri Sep 13 13:12:14 2013-date Fri Sep 13 13:11:58 2013; exit: 16640 0
521.0_WJetsLNu_TuneZ2star_8TeV_madgraph-tauola+WJetsLNu_TuneZ2star_8TeV_madgraph-tauola+HARVGEN Step0-FAILED Step1-NOTRUN - time date Fri Sep 13 13:12:31 2013-date Fri Sep 13 13:12:03 2013; exit: 16640 0
522.0_ZJetsLNu_TuneZ2star_8TeV_madgraph-tauola+ZJetsLNu_TuneZ2star_8TeV_madgraph-tauola+HARVGEN Step0-FAILED Step1-NOTRUN - time date Fri Sep 13 13:12:51 2013-date Fri Sep 13 13:12:16 2013; exit: 16640 0
539.0_ZJetsLNu_Tune4C_8TeV_madgraph-pythia8+ZJetsLNu_Tune4C_8TeV_madgraph-pythia8+HARVGEN Step0-FAILED Step1-NOTRUN - time date Fri Sep 13 13:12:44 2013-date Fri Sep 13 13:12:27 2013; exit: 16640 0
540.0_ReggeGribovPartonMC_EposLHC_5TeV_pPb+ReggeGribovPartonMC_EposLHC_5TeV_pPb+HARVGEN Step0-PASSED Step1-PASSED - time date Fri Sep 13 13:13:14 2013-date Fri Sep 13 13:12:38 2013; exit: 0 0
11 11 tests passed, 7 0 failed

We need some further investigation

@vciulli
Copy link
Contributor

vciulli commented Sep 13, 2013

-1

@ktf
Copy link
Contributor

ktf commented Sep 13, 2013

Ok, I'll move this to pre5.

@slava77
Copy link
Contributor

slava77 commented Sep 13, 2013

@vciulli
Hi Vitaliano,

How do you get it to fail?
I tried "runTheMatrix.py -l 517.0,518.0,520.0,521.0,522.0,539.0,540.0"
all passed, both in the original CMSSW_7_0_X_2013-09-11-1400 and that with #612 added

@Dr15Jones
Copy link
Contributor

I suggest trying to run valgrind on this. Perhaps there is an uninitialized value and based on the memory configuration you get a different value which can cause it to fail.

@ktf
Copy link
Contributor

ktf commented Sep 13, 2013

@nclopezo, can you add a bit in the jenkins pull request tests which runs
valgrind on some given runTheMatrix workflow? Thanks.

Ciao,

Giulio

Office: 6-2-25 CERN
Tel: +41227671799
GitHub: http://github.com/ktf/

@slava77
Copy link
Contributor

slava77 commented Sep 13, 2013

this comes out of
runTheMatrix.py --command '-n 10 --prefix "time valgrind --tool=memcheck --suppressions=$CMSSW_RELEASE_BASE/src/Utilities/ReleaseScripts/data/cms-valgrind-memcheck.supp --num-callers=20 --xml=yes --xml-file=valgrind-%p.xml " ' -l 517.0

Overlap Source and destination overlap in memcpy(0xffeff8520, 0xffeff8521, 9) 0x480B5C3 /afs/cern.ch/cms/sw/ReleaseCandidates/vol1/slc5_amd64_gcc481/external/valgrind/3.9.0-13418/lib/valgrind/vgpreload_memcheck-amd64-linux.so memcpy 0x147E4C23 /afs/cern.ch/cms/sw/ReleaseCandidates/vol1/slc5_amd64_gcc481/external/gcc/4.8.1/lib64/libgfortran.so.3.0.0 formatted_transfer /build/davidlt/build-BOOTSTRAP_slc5_amd64_gcc481/b/BUILD/slc5_amd64_gcc481/external/gcc/4.8.1/gcc-gcc-4_8-branch-199526/obj/x86_64-redhat-linux-gnu/li bgfortran/../../../libgfortran/io transfer.c 1782 0x2F1C7276 /afs/cern.ch/cms/sw/ReleaseCandidates/vol1/slc5_amd64_gcc481/cms/cmssw/CMSSW_7_0_X_2013-09-11-1400/lib/slc5_amd64_gcc481/libGeneratorInterfacePythia6I nterface.so pyslha_ /build/davidlt/481all/a/BUILD/slc5_amd64_gcc481/external/pythia6/426/pythia6/426/src pyslha.f 306 0x2F11C8A2 /afs/cern.ch/cms/sw/ReleaseCandidates/vol1/slc5_amd64_gcc481/cms/cmssw/CMSSW_7_0_X_2013-09-11-1400/lib/slc5_amd64_gcc481/libGeneratorInterfacePythia6Interface.so gen::Pythia6Service::setSLHAFromHeader(std::vector<std::string, std::allocator<std::string> > const&) 0x2E6DA28D /afs/cern.ch/cms/sw/ReleaseCandidates/vol1/slc5_amd64_gcc481/cms/cmssw/CMSSW_7_0_X_2013-09-11-1400/lib/slc5_amd64_gcc481/pluginGeneratorInterfacePythia6Filters.so gen::Pythia6Hadronizer::initializeForExternalPartons()

@@ -131,7 +131,7 @@ class FlavorHistoryEvent {
const_reverse_iterator rend() const { return histories_.rend(); }
// here is the proper mutable interface... this is done so that the cache is
// set by us, not the user
void push_back( value_type v ) { cached_ = false; histories_.push_back(v); }
void push_back( const value_type& v ) { cached_ = false; histories_.push_back(v); }
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have a hunch that this is the change that caused the error.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Turns out not to be the case.

@cmsbuild
Copy link
Contributor

Pull request #612 was updated. @Dr15Jones, @mulhearn, @thspeer, @slava77, @vadler, @ktf can you please check and sign again.

@gartung
Copy link
Member Author

gartung commented Sep 13, 2013

Appears to be broken even in CMSSW_7_0_0_pre3. Note the failure here is that the step2 cannot find the root file from step1. I ran the command for step1 and it dies without producing any root file.

 runTheMatrix.py -l 517
processing  relval_standard
request for INPUT for  None
processing  relval_highstats
request for INPUT for  None
processing  relval_pileup
request for INPUT for  None
processing  relval_generator
request for INPUT for  None
processing  relval_production
request for INPUT for  None
processing  relval_ged
request for INPUT for  None
ignoring relval_upgrade from default matrix
processing  relval_identity
request for INPUT for  None
Running in 4 thread(s)
Preparing to run 517.0 QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-10                    0To250_TuneZ2star_8TeV_madgraph-tauola+HARVGEN
# in: /storage/local/data1/gartung/CMSSW_7_0_0_pre3 going to execute cd 517.0_QC                                                                D_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-100To250_TuneZ2star_8TeV_ma                                                                dgraph-tauola+HARVGEN
 cmsDriver.py Hadronizer_MgmMatchTuneZ2star_8TeV_madgraph_tauola_cff  --conditio                                                                ns auto:startup --filein lhe:5475 -s GEN,VALIDATION:genvalid_all --datatier GEN                                                                 --relval 1000000,20000 -n 10 --eventcontent RAWSIM --fileout file:step1.root  >                                                                 step1_QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-100To250_TuneZ2star                                                                _8TeV_madgraph-tauola+HARVGEN.log  2>&1
ERROR executing  cd 517.0_QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+HARVGEN; cmsDriver.py Hadronizer_MgmMatchTuneZ2star_8TeV_madgraph_tauola_cff  --conditions auto:startup --filein lhe:5475 -s GEN,VALIDATION:genvalid_all --datatier GEN --relval 1000000,20000 -n 10 --eventcontent RAWSIM --fileout file:step1.root  > step1_QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+HARVGEN.log  2>&1;  ret= 18688
517.0_QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+HARVGEN Step0-FAILED Step1-NOTRUN  - time date Fri Sep 13 12:59:04 2013-date Fri Sep 13 12:58:16 2013; exit: 18688 0
0 0 tests passed, 1 0 failed

Addendum:

This happens at Fermilab because cmsDriver silently fails. It does not find "--filein lhe:5475" and leaves LHESource blank.

python -i Hadronizer_MgmMatchTuneZ2star_8TeV_madgraph_tauola_cff_GEN_VALIDATION.py
globaltag = PRE_ST62_V8::All

print process.source
cms.Source("LHESource",
fileNames = cms.untracked.vstring()
)

@vciulli
Copy link
Contributor

vciulli commented Sep 13, 2013

Everything works fine for me on CMSSW_7_0_X_2013-09-13-1400
Sorry, I did my previous test based on slc5_amd64_gcc472 build
If pre5 is going to be on slc5_amd64_gcc481 then both previous and last version of this pull worked for me

@nclopezo
Copy link
Contributor

Hi all,

I added to the Jenkins tests an option that runs valgrind for a given workflow, you can see the outputs of all the tests here:

https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/525

And if you want to see the results for the valgrind run for 571.0, they are here in the file Valgrind.xml:

https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/525/valgrindResults-517.0/517.0_QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+QCD_Ht-100To250_TuneZ2star_8TeV_madgraph-tauola+HARVGEN/

I can run valgrind on Jenkins in case you need it again.

@ktf
Copy link
Contributor

ktf commented Sep 17, 2013

@vciulli @vadler @mulhearn are you ok now with this? Can you please sign off?

@mulhearn
Copy link
Contributor

+1

@vciulli
Copy link
Contributor

vciulli commented Sep 17, 2013

+1

On Sep 17, 2013, at 10:02, Giulio Eulisse wrote:

@vciulli @vadler @mulhearn are you ok now with this? Can you please sign off?


Reply to this email directly or view it on GitHub.

@vadler
Copy link

vadler commented Sep 17, 2013

+1
I did not really test it again (crappy internet here), but nothing changed for AT compared to my former signature

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next IBs unless changes or unless it breaks tests. @ktf can you please take care of it?

ktf added a commit that referenced this pull request Sep 19, 2013
Misc cleanups -- Cleanup large object passed by value
@ktf ktf merged commit 07bfc13 into cms-sw:CMSSW_7_0_X Sep 19, 2013
fwyzard added a commit to fwyzard/cmssw that referenced this pull request Apr 6, 2021
Use std::clamp(...) in device code now that CUDA supports c++17.
Name reused constants in the vertex fitting and splitting.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

10 participants