Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add per-site Xrootd statistics to the HTCondor monitoring. #20143

Merged
merged 3 commits into from Sep 6, 2017

Conversation

bbockelm
Copy link
Contributor

Per popular demand, this will allow us to see per-site bytes read and read time, aggregated by site (as identified by the remote server; may be different from the CMS site name).

@osschar

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @bbockelm (Brian Bockelman) for master.

It involves the following packages:

FWCore/Framework
FWCore/Services
Utilities/XrdAdaptor

@cmsbuild, @smuzaffar, @Dr15Jones can you please review it and eventually sign? Thanks.
@Martin-Grunewald, @wddgit this is something you requested to watch as well.
@davidlange6 you are the release manager for this.

cms-bot commands are listed here

@cmsbuild
Copy link
Contributor

-code-checks

Logs: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pr-code-checks/PR-20143/73

Code check has found code style and quality issues which could be resolved by applying a patch in https://cmssdt.cern.ch/SDT/jenkins-artifacts/pr-code-checks/PR-20143/73/git-diff.patch

In future, you can run scram build code-checks to apply code checks

@davidlange6
Copy link
Contributor

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 12, 2017

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/22249/console Started: 2017/08/12 15:36

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

Comparison job queued.

@cmsbuild
Copy link
Contributor

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-20143/22249/summary.html

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 25
  • DQMHistoTests: Total histograms compared: 2600817
  • DQMHistoTests: Total failures: 250
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2600388
  • DQMHistoTests: Total skipped: 179
  • DQMHistoTests: Total Missing objects: 0
  • Checked 103 log files, 14 edm output root files, 25 DQM output files

Per popular demand, this will allow us to see per-site bytes read and
read time, aggregated by site (as identified by the remote server;
may be different from the CMS site name).
@cmsbuild
Copy link
Contributor

Pull request #20143 was updated. @cmsbuild, @smuzaffar, @Dr15Jones can you please check and sign again.

@cmsbuild
Copy link
Contributor

// Returns a mapping of "site name" to transfer statistics. The "site name" is
// as self-identified by the Xrootd host; may not necessarily match up with the
// "CMS site name".
std::map<std::string, CondorIOStats> condorUpdate();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A std::vector<std::pair<std::string, CondorIOStats> would be more memory efficient and probably faster to fill.

XrdSiteStatisticsInformation *instance = XrdSiteStatisticsInformation::getInstance();
if (!instance) {return result;}

std::lock_guard<std::mutex> lock(instance->m_mutex);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For future: exposing a lock is a poor interface. We should rethink this service.

@@ -23,7 +23,8 @@ namespace edm {
"AdaptorConfig",
"SiteLocalConfigService",
"StatisticsSenderService",
"CondorStatusService"};
"CondorStatusService",
"XrdAdaptor::XrdStatisticsService"};
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not overly happy about adding more default services to jobs where the framework itself does not need those services to operate.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hi @Dr15Jones - perhaps you need two lists here - one for the things the FWK needs and one for the things that CMS wants by default?

@Dr15Jones
Copy link
Contributor

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 5, 2017

The code-checks are being triggered in jenkins.

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 5, 2017

Pull request #20143 was updated. @cmsbuild, @smuzaffar, @Dr15Jones can you please check and sign again.

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 5, 2017

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/PR-20143/511

@bbockelm
Copy link
Contributor Author

bbockelm commented Sep 6, 2017

@Dr15Jones - I tackled the item that had a concrete action item; however, it's not clear from your other comments what you'd like me to do (or if they're just notes for the future).

@smuzaffar smuzaffar modified the milestones: CMSSW_9_4_X, CMSSW_9_3_X Sep 6, 2017
@davidlange6
Copy link
Contributor

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 6, 2017

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/22745/console Started: 2017/09/06 14:12

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 6, 2017

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 6, 2017

Comparison job queued.

@Dr15Jones
Copy link
Contributor

+1
We need to keep an eye on this since we are now causing all test configurations of cmsRun to force loading of xrootd client.

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 6, 2017

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @davidlange6, @slava77, @smuzaffar (and backports should be raised in the release meeting by the corresponding L2)

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 6, 2017

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-20143/22745/summary.html

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 26
  • DQMHistoTests: Total histograms compared: 2656498
  • DQMHistoTests: Total failures: 205
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2656104
  • DQMHistoTests: Total skipped: 189
  • DQMHistoTests: Total Missing objects: 0
  • Checked 107 log files, 14 edm output root files, 26 DQM output files

@davidlange6
Copy link
Contributor

+1

@cmsbuild cmsbuild merged commit c72da29 into cms-sw:master Sep 6, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants