Pod created from on-disk manifest not in API #14992

caseydavenport · 2015-10-02T16:57:06Z

When deploying a pod using an on-disk kubelet manifest (a la /etc/kubernetes/manifests), it appears that the network plugin setUpPod is notified of the new pod before the apiserver.

The network plugin API passes limited information about the pod, with the expectation the the network plugin will look up the pod object in the API if needed. However, in the case above, the apiserver has not been informed of the existence of this pod at the time the network plugin setUpPod is called, and as such the network plugin is unable to get the information it needs from the apiserver.

The text was updated successfully, but these errors were encountered:

Symmetric · 2015-10-05T22:27:07Z

FWIW I think this issue might not be hit if #14938 was fixed. If the APIServer eventually gets informed of the pod creation, we'll keep retrying until that time. The linked bug prevents the failed pod from being retried. (There may still be a pathological case if the APIServer update takes a long time, but I think the mainline case should be fixed).

thockin · 2015-10-06T05:53:39Z

@yujuhong if the net plugin fails, will a mirror pod be created?

yujuhong · 2015-10-06T16:27:17Z

@yujuhong if the net plugin fails, will a mirror pod be created?

No, syncPod will return upon error.

yujuhong · 2015-10-06T16:34:25Z

Static pods (pods created from on-disk manifest) is designed to work regardless of apiserver's availability. If I understand correctly, with this network plugin, kubelet would not be able to run static pods when apiserver is unreachable?

What information do you need from the pod object? I'd like to make sure that mirror pods actually contain the information you want.

Symmetric · 2015-10-06T16:54:08Z

That's correct, with the way the network plugin interface is currently designed, our plugin needs to hit the apiserver before any pods can be started.

We use the Label, Annotation, Name, and Namespace metadata. We also use the Spec.Containers.Ports field in some cases.

In general, I think that plugins could require any pod state, but if there is some piece of the Spec that will be hard to set in this case, we should discuss raise it with the SIG-network to see if anyone else will be inconvenienced at this point.

thockin · 2015-10-06T17:09:53Z

In this case the user/admin has installed a network plugin that needs to
talk to apiserver. There's nothing that can be done if the user
(mistakenly) tries to do this in a no-apiserver installation. It has to
fail.

The interesting part is how can we close the loop when the apiserver
doesn't (yet) know about a pod but we need the pod spec for the network
plugin.

We could make the pod spec available to the plugin from Kubelet directly -
either streamed on stdin or as a kubelet API. The latter doesn't really
exist AFAIU and the former has all the API versioning problems (and isn't
part of CNI).

Would it be unreasonable to create a mirror pod before instantiating the
pod via docker? From an intent point of view, the user intends the Pod to
exist. If it fails to launch for some reason, it seems reasonable to
expose it in the API, no?

On Tue, Oct 6, 2015 at 9:54 AM, Paul Tiplady notifications@github.com
wrote:

That's correct, with the way the network plugin interface is currently
designed, our plugin needs to hit the apiserver before any pods can be
started.

We use the Label, Annotation, Name, and Namespace metadata. We also use
the Spec.Containers.Ports field in some cases.

In general, I think that plugins could require any pod state, but if there
is some piece of the Spec that will be hard to set in this case, we should
discuss raise it with the SIG-network to see if anyone else will be
inconvenienced at this point.

—
Reply to this email directly or view it on GitHub
#14992 (comment)
.

yujuhong · 2015-10-06T17:18:48Z

Yes, I was going to suggest that we create the mirror pod first as the solution. We'll ignore the creation error (if there's any) and proceed to the rest of sync pod. This with #14938 should fix the problem.

bprashanth · 2015-10-06T17:28:08Z

I'm not suggesting this for the near term, there are probably easier ways to fix this now.

In general, I think that plugins could require any pod state, but if there is some piece of the Spec that will be hard to set in this case, we should discuss raise it with the SIG-network to see if anyone else will be inconvenienced at this point.

This is going to cause confusion. Take the example of flannel, it has a daemon half which allocates subnets and a plugin half, that does plugin stuff. The daemon half runs off and makes decisions out of band, which causes problems.

daemon -----> private etcd
      \
       \
kube -> apiserver

I had a related pr: #13877
What that does is essentially (plugin doesn't talk to server in my pr, but that's the idea):

daemon -> flannel server in a privileged pod <-> apiserver
                 ^
plugin ----------|

At this point, we are isolated from what different "flannel servers in privileged pods" need. If that pod needs to read file manifests and serve them up, it can. If the network plugin needs arbitrarily more information than it's given, it would request it's server not the kubelet or the apiserver.

I like this because it really streamlines network plugins. I can start the kubelet with --plugin=calico, and it would just pull and run the server with the right settings (like my pr does with flannel).

thockin · 2015-10-07T06:18:59Z

Maybe I am missing your point - if the network plugin needs to know some
fields of the Pod in order to properly link to some external control plane,
how does it get that info?

On Tue, Oct 6, 2015 at 10:28 AM, Prashanth B notifications@github.com
wrote:

I'm not suggesting this for the near term, there are probably easier ways
to fix this now.

In general, I think that plugins could require any pod state, but if there
is some piece of the Spec that will be hard to set in this case, we should
discuss raise it with the SIG-network to see if anyone else will be
inconvenienced at this point.

This is going to cause confusion. Take the example of flannel, it has a
daemon half which allocates subnets and a plugin half, that does plugin
stuff. The daemon half runs off and makes decisions out of band, which
causes problems.

daemon -----> private etcd

kube -> apiserver

I had a related pr: #13877
#13877
What that does is essentially (plugin doesn't talk to server in my pr, but
that's the idea):

daemon -> flannel server in a privileged pod <-> apiserver
^
plugin ----------|

At this point, we are isolated from what different "flannel servers in
privileged pods" need. If that pod needs to read file manifests and serve
them up, it can. If the network plugin needs arbitrarily more information
than it's given, it would request it's server not the kubelet or the
apiserver.

I like this because it really streamlines network plugins. I can start the
kubelet with --plugin=calico, and it would just pull and run the server
with the right settings (like my pr does with flannel).

—
Reply to this email directly or view it on GitHub
#14992 (comment)
.

+ Fix kubernetes#14992 + "When deploying a pod using an on-disk kubelet manifest (a la /etc/kubernetes/manifests), it appears that the network plugin setUpPod is notified of the new pod before the apiserver."

…etes#14992 + "When deploying a pod using an on-disk kubelet manifest (a la /etc/kubernetes/manifests), it appears that the network plugin setUpPod is notified of the new pod before the apiserver."

+ Fix kubernetes#14992 + "When deploying a pod using an on-disk kubelet manifest (a la /etc/kubernetes/manifests), it appears that the network plugin setUpPod is notified of the new pod before the apiserver."

…etes#14992 + "When deploying a pod using an on-disk kubelet manifest (a la /etc/kubernetes/manifests), it appears that the network plugin setUpPod is notified of the new pod before the apiserver."

caseydavenport mentioned this issue Oct 2, 2015

Error setting up network for a pod defined as manifest projectcalico/k8s-exec-plugin#55

Closed

cjcullen added the sig/node Categorizes an issue or PR as relevant to SIG Node. label Oct 2, 2015

yujuhong added the team/cluster label Oct 2, 2015

alexhersh mentioned this issue Nov 6, 2015

Create mirror pod before running a static pod so that the API can be informed before calling plugins. #16894

Merged

Zogg mentioned this issue Nov 9, 2015

[WIP] Kubernetes integration mantl/mantl#794

Merged

k8s-github-robot closed this as completed in 0584f9b Nov 17, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pod created from on-disk manifest not in API #14992

Pod created from on-disk manifest not in API #14992

caseydavenport commented Oct 2, 2015

Symmetric commented Oct 5, 2015

thockin commented Oct 6, 2015

yujuhong commented Oct 6, 2015

yujuhong commented Oct 6, 2015

Symmetric commented Oct 6, 2015

thockin commented Oct 6, 2015

yujuhong commented Oct 6, 2015

bprashanth commented Oct 6, 2015

thockin commented Oct 7, 2015

Pod created from on-disk manifest not in API #14992

Pod created from on-disk manifest not in API #14992

Comments

caseydavenport commented Oct 2, 2015

Symmetric commented Oct 5, 2015

thockin commented Oct 6, 2015

yujuhong commented Oct 6, 2015

yujuhong commented Oct 6, 2015

Symmetric commented Oct 6, 2015

thockin commented Oct 6, 2015

yujuhong commented Oct 6, 2015

bprashanth commented Oct 6, 2015

thockin commented Oct 7, 2015