Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
Already on GitHub? Sign in to your account
Store performance data not working #4941
Comments
|
If you check the pcp archives from the command line do they show collected data? |
petervo
added
the
question
label
Aug 29, 2016
JGjorgji
commented
Aug 29, 2016
|
Good question, i looked into the logs for the logger and found a permissions denied error for the config. These were the permissions:
I changed the SELinux context of the default config to be the same as the other ones, let's wait and see. |
JGjorgji
commented
Aug 30, 2016
|
Apparently not but i noticed there were errors in the logs, here they are: http://paste.fedoraproject.org/417455/77100147/ |
|
You may want to check the permissions on /var/lib/pcp/config/pmlogconf/tools/cockpit. Since there were selinux issues with the pmlogger files, there might be again here. |
|
So it looks like for some reason in cockpit we are getting Unknown metric name errors when pminto reports those metrics are available. http://paste.fedoraproject.org/417481/14725785/ There was also another error:
That was fixed by
But the missing metrics problem remains. @mvollmer or @fche any ideas about what might cause this? |
fche
commented
Aug 31, 2016
|
Those libpcp segvs should never happen. The pcp metrics being requested should always be available. Something peculiar is going on. Does this problem occur reproducibly on multiple machines, or is it a one-off? What does |
JGjorgji
commented
Sep 1, 2016
|
I just tried this on a fresh Centos 7 VM and it's the same result. I did try restarting the pmcd service but it didn't help, both before resolving some of the issues above and after. I did not try any of the fixes mentioned before on the VM, just installing the packages and turning on logging, here are the results from /usr/bin/pcp . Original physical host:
Test VM:
|
JGjorgji
commented
Sep 5, 2016
•
|
Any thoughts on this? Can you try replicating this by installing a fresh Centos 7 wtih only the packages i installed? Maybe it's some configuration provided by the main package? Any other info you would find helpful? |
fche
commented
Sep 5, 2016
•
|
Short of reproducing the problem here on a new VM, what comes to mind is that maybe the way |
It's been some time and I might misremember, but when accessing an archive, only the metrics that are actually stored in the archive are "known". So this error might mean that the archive is empty. |
mvollmer
self-assigned this
Sep 9, 2016
|
I had a similar situation on my development VM, where the machinery all seemed to be working okay, but no actual data would be collected. The reason was that Cockpit installs the file In fact, none of the metrics in The reason seems to be that
Removing
I don't know how I ended up with a broken |
fche
commented
Sep 9, 2016
|
FWIW, service-pmmgr always deletes and recomputes pmlogger configuration files. The cron-job-based service-pmlogger is less predictable. @natoscott |
|
Thanks! A system crash might indeed be the reason how I got an empty config.default file. I am pretty harsh with that VM and force reboot it all the time. |
I see. How do we decide which one to enable/start? Right now the UI button enables/starts |
fche
commented
Sep 9, 2016
•
Way back when y'all were first building out the pcp bridge, we talked about it. Not quite sure why we didn't go that way. In principle, it is only:
|
natoscott
commented
Sep 11, 2016
|
@mvollmer to balance things a little (@fche is pmmgr's author so may be slightly biased :) ...
IOW, YMMV - all code has bugs & trade-offs of course. There is no one approach to rule them all. Also, pmmgr is feature-rich - you may wish to expose other functionality it offers too - host discovery via Avahi, and support for running other helper daemons like pcp2graphite(1), pcp2influxdb(1), etc. Hence, I'd recommend a dual approach - where existing setups can be preserved & supported, rather than a breaking-switch to pmmgr (perhaps have new UI options for pmmgr, alongside existing pmlogger and pmie services?) - and people can opt-in to the more resource-heavy (i.e. permanent daemon) pmmgr use if they wish. Not to ignore the original problem, I'll look into fixing up that empty file issue this week too. |
fche
commented
Sep 12, 2016
I don't know what you are referring to. The config files that pmmgr-invoked pmlogger / pmie use are the exact same format that shell-invoked pmlogger / pmie use, because they are the same programs. Their output files are the same format because they are the same programs.
Please identify the documentation where it spells out under what conditions pmlogconf is rerun by service-pmlogger's scripts (pmlogger_check ?). |
natoscott
commented
Sep 12, 2016
|
| be aware switching to pmmgr means incompatible file formats with existing pmlogger and pmie
Take time, read carefully. |
natoscott
added a commit
to performancecopilot/pcp
that referenced
this issue
Sep 12, 2016
natoscott
commented
Sep 12, 2016
|
@JGjorgji @mvollmer the empty file issue is resolved in PCP now and the fix will be in pcp-3.11.5 onward (3.11.5 is scheduled within a couple of weeks). commit 8e9f44151e47abacd93dbb75b843996d50458652
|
Yeah, I remember. I think at the time pmmgr was still pretty new, maybe not even ready yet, and I made a note to look at it more closely once I got the basics working. Of course, I never did that... mostly because pmlogger.service was working well enough after all. I really don't want to decide between pmlogger and pmmgr, given that you guys don't agree. Can we hide that choice from Cockpit and it's users, while still making it accessible for experts? Maybe via a |
A powerful, PCP specific UI would be awesome. But that's a different topic altogether and we would need to look at ManageIQ etc as well, I guess. We don't want to compete against ourselves, really. :-) |
natoscott
commented
Sep 12, 2016
+1 - probably best to consider them completely orthogonal - the direct pmlogger/pmie use will always be available with PCP, and pmmgr is there for people who need the additional extras (and compromises to achive those) that it offers. If Cockpit could offer up a rich UI for optional pmmgr use someday, I'm all for that.
BTW, we do get requests for features sometimes that Cockpit could probably help with - e.g. the Red Hat customer support folk would like an easy way to query and set the default pmlogger recording interval for a given site. Being able to see which pmie(1) rules are enabled, and enable/disable individual rules via a clean UI is something we get occasionally prodded for too. |
The thing is, none of the additional extras of pmmgr are visible in the Cockpit UI, so we would allow people to make an obscure choice without any visible effect, no?
Interesting. Should we start writing down some use cases and make some mock ups? What you mention doesn't seem difficult, we only have to find nice place in the UI for it... |
JGjorgji
commented
Sep 12, 2016
|
So i reinstalled this and the issue with the configuration file is missing (still needed the permission fix on /var/lib/pcp/config/pmlogger/config.default ). Performance metrics are still no shown on the graphs and cockpit-pcp segfaults. Though now the errors about missing metrics are gone. |
natoscott
commented
Sep 13, 2016
|
| Interesting. Should we start writing down some use cases and make some mock ups? I'll steer some of the interested customer support folk towards this issue, see if that want to chime in or start a discussion elsewhere directly with you. Thanks @mvollmer |
Ouch. Is the crash repeatable? E.g., does it happen everytime you log into Cockpit? Is there any chance that you can get a stack trace of the crash? Can you make the core dump available to us somewhow? |
JGjorgji
commented
Sep 15, 2016
|
It crashes every time but it's after i select a longer time period to review (say 1 week). For which service would coredumps need to be enabled? The main cockpit one? |
stefwalter
removed
the
question
label
Nov 30, 2016
JGjorgji
commented
Oct 15, 2017
|
Tried this again today and it's still happening, same server cockpit 148, pcp 3.11.8. |
JGjorgji commentedAug 27, 2016
It still only collects data when I'm logged in.
OS:
Centos 7 with latest updates as of this moment
Cockpit version:
cockpit-pcp-0.114-2.el7.centos.x86_64
PCP version:
pcp-3.10.6-2.el7.x86_64
I only installed the following packages since i didn't want docker and the default package was pulling that in:
cockpit-pcp
cockpit-bridge
cockpit-shell
cockpit-ws