-
Notifications
You must be signed in to change notification settings - Fork 92
investigate: node registrator doesn't play well with slaves that die and come back #778
Comments
@ravlir do you happen to know about the conditions in which the slave came back? for example, did it come back up but with a different slaveID or the same slaveID as before? |
the previously reported scenario involved mesos slave registering back with a different slaveId: I0129 02:30:33.423883 2985 slave.cpp:859] Registered with master master@1.1.1.1:5050; given slave ID 20160129-022011-3340029194-5050-31106-S5 |
some more details: I0302 22:11:20.818402 1 nodecontroller.go:450] Deleting node (no longer present in cloud provider): xyz.com https://github.com/mesosphere/kubernetes/blob/v0.7.2-v1.1.5/pkg/controller/node/nodecontroller.go#L449 |
[EDIT] Thanks for investigating this further. What should happen in this On Wed, Mar 2, 2016 at 9:46 PM, ravilr notifications@github.com wrote:
|
yes, the mesos slave node which come back up with different slaveID, never seems to be registered back as k8s api.node object, unless the k8sm scheduler is restarted. but, once the scheduler is restarted, the node appears in the k8s node registry. sequence of steps to repro:
|
I found the problem, a bug in the queue/ package. Will push a fix shortly On Thu, Mar 3, 2016 at 3:25 PM, ravilr notifications@github.com wrote:
|
xref kubernetes/kubernetes#22500 On Thu, Mar 3, 2016 at 11:08 PM, James DeFelice james@mesosphere.io wrote:
|
reported here: #768 (comment)
/cc @ravlir
The text was updated successfully, but these errors were encountered: