Proposal on how to bring a VM into a container #2

rmohr · 2017-01-15T09:20:17Z

No description provided.

rmohr · 2017-02-02T10:51:06Z

@fabiand @stu-gott @admiyo, thoughts on this? I think we can come up with something better later if we find something. This is really a blocker regarding to bringing KubeVirt out of the Vagrant and Demo VM.

fabiand

Looks good to me in general.
I think a binary - or this search mechanism - is the best solution we have today.

Few things:

Please describe the complete flow from controller until launched VM - and how each component passes, retrieves the secret
Let's keep numa out and focus on setting ns and cgroup

fabiand · 2017-02-02T13:33:14Z

design-proposals/vm-in-a-container.md

+process with the uuid and the vm name in the commandline like this:
+
+```bash
+virt-launcher -kubevirt.vm.uuid 1234-5678-1234-1234 -kubevirt.vm.name testvm


Instead of using this flags, the below suggested ENV vars can be used.

Here we would find everything already in /proc/<pid>/cmdline. Then we can just look in /environ (like you suggested) or a specific file location, if the right secret is there.

fabiand · 2017-02-02T16:11:18Z

design-proposals/vm-in-a-container.md

+</domain>
+```
+
+Note that we have to specify the qemu namespace to use these features.


You mean the target namespace the qemu instance should live in?

I just mean the xmlns namespace, when it is not present, libvirt does not accept these qemu tags. Will clarify it.

fabiand · 2017-02-02T16:49:35Z

design-proposals/vm-in-a-container.md

+
+```xml
+<qemu:commandline>
+   <qemu:env name='kubevirt.io.secret' value='3098tFJoswfwkjp4'/>


What about aligning the namespace: kubevirt.io.vm.secret?

Hm, in contrast to the VM name and the VM uuid, the secret is no property of the VM, that is why I did not add vm there, but I don't really care.

fabiand · 2017-02-02T16:50:34Z

design-proposals/vm-in-a-container.md

+ 2. Introduce a shared secret between the VM target container and kubevirt
+ 3. This secret can be unique per pod creation and only valid for a specific
+    amount of time
+ 4. The binary checks that  the secret is present in the kubernetes metadata


"the kubernetes metadata file" what file are you referring to?

fabiand · 2017-02-02T16:52:34Z

design-proposals/vm-in-a-container.md

+namespace where libvirt is running. They should be delivered via an init
+container to the host via `hostDir` mount (if libvirt is running on the host),
+or via an `emptyDir` mount to a container (if libvirt is running in a
+container).


What about shipping it directly in the libvirtd container?

Would be ok too for the start, the disadvantage is, that we would need to rebuild the libvirt container then. So far I only had to change the emulator when kubevirt introduced new features and not when we changed the libvirt configuration. It seems to be more influenced on how we integrate with kubernetes than how we integrate with libvirt. If we ship it for example with the virt-handler daemon set as init container, we would not have to role out or restart libvirt. But both solutions would work.

Further, virt-handler controlls the additional qemu commandline and environment flags, and therefore how the emulator is called. So for me it is more like a part of virt-handler and that it is more important that it is kept in-sync with virt-handler than with libvirt. What do you think?

fabiand · 2017-02-02T16:53:17Z

design-proposals/vm-in-a-container.md

+
+when we implement the secret properly.
+
+### NUMA


Let's keep numa out of this for now.
It deserves it's own proposal I'd say.

Sounds good to me. I will make one for numa and cpu pinning.

fabiand · 2017-02-02T16:58:10Z

design-proposals/vm-in-a-container.md

+
+```
+# Getting the pid os container runtime independent
+pid = $(ps aux  | grep virt-launcher | grep "-kubevirt.vm.uuid 1234-5678-1234-1234 | tr -s ' ' | cut -d " " -f2)


We can also set the secret on the launcher pod using an environemtn variable.
Then we can scan all /proc/*/environ files which is already a map, and find the correct process.

Being picky: It would probably not work with rkt stage2-lkvm runtime, because there the launcher would be run in a VM.

Being picky: It would probably not work with rkt stage2-lkvm runtime, because there the launcher would be run in a VM.

Right.

We can also set the secret on the launcher pod using an environemtn variable.
Then we can scan all /proc/*/environ files which is already a map, and find the correct process.

I guess you mean just scanning the environ file of the target virt-launcher, to make sure it is the right one.

fabiand · 2017-02-02T16:59:54Z

design-proposals/vm-in-a-container.md

+hook](https://libvirt.org/hooks.html#qemu) will check if the Domain XML has an
+emulator tag which points to the right binary.
+
+### Flow


Would be nice if the flow would cover

controller part (setting the secret?)

launcher part

handler part

binary part

fabiand · 2017-02-02T17:01:51Z

design-proposals/vm-in-a-container.md

+### Running it inside a container
+
+To run the binary from inside a container, it needs access to the host `/proc`
+directory. The recommended implementaiton is, to have a config file for the


I think it only needs to access it if the container it is launched in has a pid namespace.

If the surrounding container is using the host's pid namespace, then /proc should be the host's /proc.

You are right

admiyo

From a security standpoint, this approach seems risky. It means that the virt-handler code can assume control of any other process on the system via a VM launch.

This is an overkill response to the desire to limit the power of virt-launcher, which we've decided should not have direct access to libvirt. We don't want a random VM controller other vms, either.

It seems to me that the right solution involves cooperation between the virt-launcher and virt-handler processes. virt-launcher should open a domain socket that gets passed to virt-handler, and finally to libvirt, for launching the vm. This socket should be the only means of communication between the processes.

The virt-launcher process already has theproper cgroups and namespaces set up. It should be the sole source of truth for this information.

When launching a vm, libvirt uses a fork/exec approach. It opens the socket used to manage the process, and execs either qemu, or the command as passed from the caller, such as what virt-handler passes in the wrapper script. Since domain sockets can be passed via domain sockets (really!) the launched process could pass the management socket to virt-launcher, as well as any other parameters required for the start of the virtual machine.

rmohr

@fabiand forgot to hit the "submit review" button. They were pending for a few weeks now. Sorry.

rmohr · 2017-02-03T11:11:53Z

design-proposals/vm-in-a-container.md

+</domain>
+```
+
+Note that we have to specify the qemu namespace to use these features.


I just mean the xmlns namespace, when it is not present, libvirt does not accept these qemu tags. Will clarify it.

rmohr · 2017-02-03T11:13:12Z

design-proposals/vm-in-a-container.md

+
+```xml
+<qemu:commandline>
+   <qemu:env name='kubevirt.io.secret' value='3098tFJoswfwkjp4'/>


Hm, in contrast to the VM name and the VM uuid, the secret is no property of the VM, that is why I did not add vm there, but I don't really care.

rmohr · 2017-02-03T11:21:44Z

design-proposals/vm-in-a-container.md

+process with the uuid and the vm name in the commandline like this:
+
+```bash
+virt-launcher -kubevirt.vm.uuid 1234-5678-1234-1234 -kubevirt.vm.name testvm


Here we would find everything already in /proc/<pid>/cmdline. Then we can just look in /environ (like you suggested) or a specific file location, if the right secret is there.

rmohr · 2017-02-03T11:25:38Z

design-proposals/vm-in-a-container.md

+
+```
+# Getting the pid os container runtime independent
+pid = $(ps aux  | grep virt-launcher | grep "-kubevirt.vm.uuid 1234-5678-1234-1234 | tr -s ' ' | cut -d " " -f2)


Being picky: It would probably not work with rkt stage2-lkvm runtime, because there the launcher would be run in a VM.

Right.

We can also set the secret on the launcher pod using an environemtn variable.
Then we can scan all /proc/*/environ files which is already a map, and find the correct process.

I guess you mean just scanning the environ file of the target virt-launcher, to make sure it is the right one.

rmohr · 2017-04-14T22:53:31Z

design-proposals/vm-in-a-container.md

+namespace where libvirt is running. They should be delivered via an init
+container to the host via `hostDir` mount (if libvirt is running on the host),
+or via an `emptyDir` mount to a container (if libvirt is running in a
+container).


Would be ok too for the start, the disadvantage is, that we would need to rebuild the libvirt container then. So far I only had to change the emulator when kubevirt introduced new features and not when we changed the libvirt configuration. It seems to be more influenced on how we integrate with kubernetes than how we integrate with libvirt. If we ship it for example with the virt-handler daemon set as init container, we would not have to role out or restart libvirt. But both solutions would work.

rmohr · 2017-04-14T22:53:36Z

design-proposals/vm-in-a-container.md

+### Running it inside a container
+
+To run the binary from inside a container, it needs access to the host `/proc`
+directory. The recommended implementaiton is, to have a config file for the


You are right

rmohr · 2017-04-14T22:54:43Z

design-proposals/vm-in-a-container.md

+
+when we implement the secret properly.
+
+### NUMA


Sounds good to me. I will make one for numa and cpu pinning.

fabiand · 2017-07-05T14:20:20Z

Closing this according to discussion offline discussions.

rmohr requested review from fabiand, admiyo and stu-gott January 15, 2017 09:20

rmohr added 3 commits January 16, 2017 12:13

Proposal on how to bring a VM into a container

a1712e0

Add libvirt webhook for checking the emulator field

0d9bfc9

Add possible NUMA mapping solution

26cdd05

fabiand requested changes Feb 2, 2017

View reviewed changes

admiyo reviewed Feb 7, 2017

View reviewed changes

rmohr commented Apr 14, 2017

View reviewed changes

rmohr mentioned this pull request Apr 21, 2017

/var/run issue when running on Ubuntu host kubevirt/kubevirt#192

Closed

fabiand closed this Jul 5, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal on how to bring a VM into a container #2

Proposal on how to bring a VM into a container #2

rmohr commented Jan 15, 2017

rmohr commented Feb 2, 2017

fabiand left a comment

fabiand Feb 2, 2017

rmohr Feb 3, 2017

fabiand Feb 2, 2017

rmohr Feb 3, 2017

fabiand Feb 2, 2017

rmohr Feb 3, 2017

fabiand Feb 2, 2017

fabiand Feb 2, 2017

rmohr Apr 14, 2017

rmohr Apr 14, 2017

fabiand Feb 2, 2017

rmohr Apr 14, 2017

fabiand Feb 2, 2017

fabiand Feb 2, 2017

rmohr Feb 3, 2017

fabiand Feb 2, 2017

fabiand Feb 2, 2017

rmohr Apr 14, 2017

admiyo left a comment

rmohr left a comment

rmohr Feb 3, 2017

rmohr Feb 3, 2017

rmohr Feb 3, 2017

rmohr Feb 3, 2017

rmohr Apr 14, 2017

rmohr Apr 14, 2017

rmohr Apr 14, 2017

fabiand commented Jul 5, 2017

Proposal on how to bring a VM into a container #2

Proposal on how to bring a VM into a container #2

Conversation

rmohr commented Jan 15, 2017

rmohr commented Feb 2, 2017

fabiand left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

admiyo left a comment

Choose a reason for hiding this comment

rmohr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fabiand commented Jul 5, 2017