New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Service origin-master-controllers crash when service systemd-journald reloads #40
Comments
@jperville The issue was on our side...:-1: |
Cheers ! |
I will test this when I return to work tomorrow morning. Thanks for the quick investigation and fix @IshentRas |
Let me know if it does work and I'll merge, upload the code... |
Hello @IshentRas, I got success with your fix. After restarting systemd-journald, the origin-master daemons come back as expected. |
Perfect, will be merged soon 👍 |
Following the release of v 1.10.26 of this cookbook, I want to submit my last issue with running this cookbook in openshift_HA mode (with external etcd).
Some context: In my environment cookbook, at some point the systemd-journald configuration gets reloaded (I enable persisting journal to disk and setup some max sizes).
The bug: Reloading systemd-journald has the repeatable effect of crashing the origin-master-controllers service which won't come back until I run chef again or manually restart the service. I still don't know if this is openshift issue or this cookbook's issue (the origin-master-* systemd units are created by this cookbook).
Here is how to reproduce, step by step:
checkout https://github.com/PerfectMemory/origin-provision-bug-demo.git
vagrant up master
This boots a Vagrant VM with a working openshift3 1.3.1 master configured:
You may need to install Vagrant, the latest chef-dk and the vagrant-berkshelf plugin to make it work.
Once the VM is provisioned, ssh into it for the rest of the reproduction steps.
vagrant ssh
terminal, tail system messages[vagrant@master ~]$ sudo systemctl restart systemd-journald.service
In the log we tailed in step 4, a message "Started Flush Journal to Persistent Storage" appears and just after that point the openshift some master services will be crashed.
The text was updated successfully, but these errors were encountered: