Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DAQ file locking compatibility fix for CC 7.4 #22215

Merged
merged 2 commits into from
Feb 20, 2018

Conversation

smorovic
Copy link
Contributor

@smorovic smorovic commented Feb 14, 2018

In CERN CentOS 7.4 a different behavior of file locking on NFS (v4) was observed, causing modifications of the locked file not being properly synchronized over NFS between client hosts.
Workaround, which has been proven to fix the problem in production DAQ, consists of opening another file descriptor and file stream for the same lock file within the critical section of the file locking code.

This modification is necessary to have functional HLT processing on hostswith CC 7.4

problem observed as inconsistent file or LS counter being read by
processes on different FUs. Consists of opening another file within
the critical section of the file locking code.
@cmsbuild
Copy link
Contributor

A new Pull Request was created by @smorovic (Srecko Morovic) for CMSSW_10_0_X.

It involves the following packages:

EventFilter/Utilities

@mommsen, @perrotta, @cmsbuild, @emeschi, @slava77 can you please review it and eventually sign? Thanks.
@Martin-Grunewald this is something you requested to watch as well.
@davidlange6, @slava77, @fabiocos you are the release manager for this.

cms-bot commands are listed here

@perrotta
Copy link
Contributor

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Feb 14, 2018

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/26067/console Started: 2018/02/14 17:07

@perrotta
Copy link
Contributor

perrotta commented Feb 14, 2018

@smorovic : please submit to the master branch before backporting to 10_0_X

@smorovic
Copy link
Contributor Author

@perotta, did this as #22217.

@perrotta
Copy link
Contributor

perrotta commented Feb 14, 2018 via email

@perrotta
Copy link
Contributor

abort test

@cmsbuild
Copy link
Contributor

Jenkins tests are aborted.

@cmsbuild
Copy link
Contributor

Pull request #22215 was updated. @mommsen, @perrotta, @cmsbuild, @emeschi, @slava77 can you please check and sign again.

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

Comparison job queued.

@cmsbuild
Copy link
Contributor

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-22215/26067/summary.html

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /build/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/JR-comparison/PR-22215/1325.7_TTbar_13_94XNanoAODINPUT+TTbar_13_94XNanoAODINPUT+NANOEDMMC2017+HARVESTNANOAODMC2017

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 27
  • DQMHistoTests: Total histograms compared: 2469361
  • DQMHistoTests: Total failures: 1
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2469191
  • DQMHistoTests: Total skipped: 169
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.809999999801 KiB( 22 files compared)
  • Checked 111 log files, 9 edm output root files, 27 DQM output files

@slava77
Copy link
Contributor

slava77 commented Feb 16, 2018

+1

for #22215 412822a

@emeschi
Copy link
Contributor

emeschi commented Feb 19, 2018

+1

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next CMSSW_10_0_X IBs (tests are also fine) and once validation in the development release cycle CMSSW_10_1_X is complete. This pull request will now be reviewed by the release team before it's merged. @davidlange6, @slava77, @smuzaffar, @fabiocos (and backports should be raised in the release meeting by the corresponding L2)

@fabiocos
Copy link
Contributor

+1

@cmsbuild cmsbuild merged commit 5e54d3b into cms-sw:CMSSW_10_0_X Feb 20, 2018
@smorovic smorovic deleted the fix-locking-10X branch February 13, 2019 11:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants