process instalbility observed when running inside a container laucnhed with daemonset #30106

sbezverk · 2016-08-04T22:29:34Z

I noticed some issue when launching container using the daemonset. The same container launched with ReplicationController works 100% stable. The issue are mostly related to a process restart, some errors and inability to create a named socket. It is not a config issue as, as soon as I add strace in the command line of the process, it starts working fine. It seems some sort of a racing condition with daemoset. The issue is 100% reproducible, I am ready to offer access to the impacted test bed for further troubleshooting. Here is the error which is generated when container starts with DaemonSet:
[19202.800318] traps: ovsdb-server[24147] general protection ip:7fb18600be37 sp:7ffef3da0d00 error:0 in libc-2.17.so[7fb185fd5000+1b7000]
I am on Cenots 7.2, kubernetes 1.3.4-dirty (I needed this version for another issue which is fixed in this release). Please let me know if somebody would be interested to check it out.

dchen1107 · 2016-08-05T06:42:53Z

@sbezverk If possible, could you please provide the following information:

Run your container through rc, then run command kubectl get -o yaml pods
Run your container as daemonset, then run command kubectl get -o yaml pods
Run your container as daemonset, run kubectl logs --previous pod-id container to retrieve previous terminated container's log

sbezverk · 2016-08-05T11:28:52Z

@dchen1107 It turned out that replication controller also shows the similar issue :(
Here is the link to requested information.
Collected logs
Let me know if additional info is required.

sbezverk · 2016-08-05T11:57:02Z

@dchen1107 Interesting observation, if I start the process inside of a container with this script,
#!/bin/bash

sleep 1
/usr/sbin/ovsdb-server /etc/openvswitch/conf.db -vconsole:emer -vsyslog:err --remote=punix:/run/openvswitch/db.sock -vfile:info --log-file=/var/log/kolla/openvswitch/ovsdb-server.log

it works perfectly on both compute nodes.

fejta-bot · 2017-12-16T19:24:38Z

Issues go stale after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

fejta-bot · 2018-01-15T20:12:27Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle rotten
/remove-lifecycle stale

fejta-bot · 2018-02-14T20:20:12Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-github-robot added area/kubelet sig/node Categorizes an issue or PR as relevant to SIG Node. labels Aug 4, 2016

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 16, 2017

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 15, 2018

k8s-ci-robot closed this as completed Feb 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

process instalbility observed when running inside a container laucnhed with daemonset #30106

process instalbility observed when running inside a container laucnhed with daemonset #30106

sbezverk commented Aug 4, 2016

dchen1107 commented Aug 5, 2016

sbezverk commented Aug 5, 2016

sbezverk commented Aug 5, 2016

fejta-bot commented Dec 16, 2017

fejta-bot commented Jan 15, 2018

fejta-bot commented Feb 14, 2018

process instalbility observed when running inside a container laucnhed with daemonset #30106

process instalbility observed when running inside a container laucnhed with daemonset #30106

Comments

sbezverk commented Aug 4, 2016

dchen1107 commented Aug 5, 2016

sbezverk commented Aug 5, 2016

sbezverk commented Aug 5, 2016

fejta-bot commented Dec 16, 2017

fejta-bot commented Jan 15, 2018

fejta-bot commented Feb 14, 2018