VMs should allow being created without running #169

fabiand · 2017-03-24T16:50:00Z

Currently the assumption is that VMs are running as along as they are defined in the cluster. To stop them, the object needs to be removed.
This was choosen to map a pod's behavior.

But - Eventually it makes sense to allow stopped VMs in KubeVirt. The reason is that VMs are stateful, and thus their state outlives their life-cycle.

My suggestion: Allow VMs to be stopped and to allow to create them stopped. With such a change KubeVirt could also act as a VM store.

fabiand · 2017-03-24T16:50:24Z

/cc @rmohr @stu-gott @admiyo

rmohr · 2017-03-24T17:01:23Z

I would argue, that only the disks of a VM are stateful, not the VM itself. The same like with pods with persistent storage. Like with such pods the disk outlives the lifetime of the VM.

Long term storage ( that is how I would call that), is a purely administrative task, the same like with pod definitions, daemon sets, ... They all only stay in the cluster for as long they have a task (something active) to fulfill.

Anyway, if that will really happen, we also have to understand how etcd would react to that on scaled environments ( also have to check the kubernetes bugs regarding to etcd scalability), where all the active watches are probably much more busy to provide the data. I would really want to be sure that having something which is just for convenience is not hurting us.

mykaul · 2017-03-24T18:56:48Z

This is exactly the distinction between persistent and nonpersistent domains in libvirt - I remember talking to Michal about it and his wish to move to persistent domains (I've looked at it from Lago perspective). It's not directly only about storage, but also persisting the configuration of the VM.
Of course, locally storing it per host is nice, but insufficient.

rmohr · 2017-03-24T20:23:32Z

This is exactly the distinction between persistent and nonpersistent domains in libvirt - I remember talking to Michal about it and his wish to move to persistent domains (I've looked at it from Lago perspective). It's not directly only about storage, but also persisting the configuration of the VM.
Of course, locally storing it per host is nice, but insufficient.

Yes, libvirt does extensive defaulting. Further we have, in one way or the other, to support all the special fields in the cluster wide representation of the VM (to not use something like saving the valuable added defaults only on the host). Therefore, the important bit here, would be more about providing a nice way of exporting the information from the cluster. I don't see the need to always keep it in.

One way to keep the relevant bits is to add something like the export header, which kubernetes also provides for its entities. Then you only get the important information for later repost. Everything non-relevant is filtered out.

Like the actual Pod or Deployment configuration is valuable, a VM configuration is valuable, so making sure that it can be retrieved is very important, but saving it for convenience does not seem to be so important from my perspective. Kubevirt should definitely not stand in your way to add something like this on top of it (if that would not be the case, I would have a problem with it too). You can save it in another TPR, in a directory, in a relational database, whatever you prefer.

fabiand · 2017-03-25T16:56:09Z

The benefit of having all defined VMs (regardless of it's state) in the cluster, as the same objects, would allow external components to discover all VMs.
And this is a true benefit IMO.

If the VM registry is outside of the cluster (outside of the cluster, or non vm objects) then it is not possible to just discover the cluster.
The picture would probably be more consistent with them.

Previously I was considering to keep it like we have it today - but the reason of being able to discover the cluster makes sense to me.

fabiand · 2017-03-25T17:11:03Z

/cc @michalskrivanek

michalskrivanek · 2017-03-27T08:55:36Z

@fabiand - it is an interesting thought. In practice we're talking about thousands (or even tens of thousands) of VMs in a cluster. I would really like to know about etcd scalability before going there.
But as a disaster recovery ready cache it makes sense to have a libvirt xml cache of VMs when they last run on a host - providing they contain complete VM definition equivalent to the one in the "long term storage" - but I wouldn't go for more than that

fabiand · 2017-03-27T12:39:46Z

@michalskrivanek I am not so worried about thus number, after all I'd expect that kube and thus etcd needs to handle a magnitude more of containers and related objects.

And another point besides the discoverability. A user has the expectation to be able to shutdown a VM, a keep it shutdown. But right now IIUIC, the VM would be restarted.

michalskrivanek · 2017-03-27T13:20:43Z

@fabiand DR would assume that it is replicated to all hosts. And overall the required CPU and bandwidth should be negligent enough to not affect the VMs. I'm not worried the bare etcd cannot handle that, but all components needs to scale along with it
About the restart - I don't follow - why?

rmohr · 2017-03-27T16:02:06Z

The benefit of having all defined VMs (regardless of it's state) in the cluster, as the same objects, would allow external components to discover all VMs.

From my perspective you are then not only discovering what is part of the cluster but also what might be part of the cluster if one chooses to (which means in my understanding, it is not part of the cluster).

I would prefer to only work with thinks in the cluster which meet one of two conditions:

acquire resources (disks, paused VMs, suspendend VMs, but not stopped VMs)
are actively doing something (Controllers, active Pods, running VMs, ...)

That is where I would draw the line. Like with replication controller (which allows having count 0) there might be side effects of technical necessities, e.g. upgrading something, which you can 'exploit' for storing something for later use, but that is not why it is there in the first place.

mpolednik · 2017-03-27T16:24:18Z

@fabiand @michalskrivanek the restart should be handled by #75 . It's just current code's limitation.

fabiand · 2017-03-31T10:58:02Z

OTOH we do have entities like ConfigMaps and and Secrets, which exist but don't consume resources.

rmohr · 2017-04-03T10:23:35Z

@fabiand right. They are special in the sense, that they are never active or consume resources.

rmohr · 2017-04-03T10:25:49Z

Had a discussion with @michalskrivanek about this, and if I understood him right, he was more interested in the direction of having the VM definition on the node, in case a central entity can't be reached. So that for example virt-handler can decide on the host alone if a VM should be restarted, and does not loose the VM definition just because the VM went down. @michalskrivanek correct me if I am wrong.

michalskrivanek · 2017-04-03T10:33:05Z

yes. nothing more as a convenience in that case, for all other cases the only source of truth is elsewhere.It assumes the local definitions are complete at least from the host perspective. It doesn't have to have the cluster-related data, however they might be handy for DR. Alternatively that is not necessary either if the storage persists the whole VM spec

fabiand · 2017-04-05T07:58:56Z

@rmohr The VM definition itself does not - like a configmap - consume resources itself. It's only the domain which will be consuming resources.

I could imagine that we can keep the domxml on the host in some cases. But this obviously opens up a range of problems related to in and out of sync and the ability to make decisions in the absence of network connectivity.

fabiand · 2017-07-06T09:40:59Z

Closing this issue in favor of #267 which also has the relevant proposal attached.

Automated Versioning with Git Tags

README: add a list of CNI users

Fix tc-tbf burst value in bytes

Depending on the kernel/host configuration, the reported value may change. Update the detection code to deal with that. Signed-off-by: Francesco Romani <fromani@redhat.com>

fabiand added the kind/enhancement label Mar 24, 2017

fabiand closed this as completed Jul 6, 2017

kubevirt-bot pushed a commit to kubevirt-bot/kubevirt that referenced this issue Nov 6, 2020

Merge pull request kubevirt#169 from copejon/deploy-latest-images

10563c3

Automated Versioning with Git Tags

mzzgaopeng pushed a commit to mzzgaopeng/kubevirt that referenced this issue Mar 8, 2021

Merge pull request kubevirt#169 from philips/add-users

1e0e105

README: add a list of CNI users

mzzgaopeng pushed a commit to mzzgaopeng/kubevirt that referenced this issue Mar 8, 2021

Merge pull request kubevirt#169 from hustcat/bandwidth-burst-fix

9201a5a

Fix tc-tbf burst value in bytes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VMs should allow being created without running #169

VMs should allow being created without running #169

fabiand commented Mar 24, 2017

fabiand commented Mar 24, 2017

rmohr commented Mar 24, 2017

mykaul commented Mar 24, 2017

rmohr commented Mar 24, 2017

fabiand commented Mar 25, 2017 •

edited

fabiand commented Mar 25, 2017

michalskrivanek commented Mar 27, 2017

fabiand commented Mar 27, 2017

michalskrivanek commented Mar 27, 2017

rmohr commented Mar 27, 2017

mpolednik commented Mar 27, 2017

fabiand commented Mar 31, 2017

rmohr commented Apr 3, 2017

rmohr commented Apr 3, 2017

michalskrivanek commented Apr 3, 2017

fabiand commented Apr 5, 2017

fabiand commented Jul 6, 2017

VMs should allow being created without running #169

VMs should allow being created without running #169

Comments

fabiand commented Mar 24, 2017

fabiand commented Mar 24, 2017

rmohr commented Mar 24, 2017

mykaul commented Mar 24, 2017

rmohr commented Mar 24, 2017

fabiand commented Mar 25, 2017 • edited

fabiand commented Mar 25, 2017

michalskrivanek commented Mar 27, 2017

fabiand commented Mar 27, 2017

michalskrivanek commented Mar 27, 2017

rmohr commented Mar 27, 2017

mpolednik commented Mar 27, 2017

fabiand commented Mar 31, 2017

rmohr commented Apr 3, 2017

rmohr commented Apr 3, 2017

michalskrivanek commented Apr 3, 2017

fabiand commented Apr 5, 2017

fabiand commented Jul 6, 2017

fabiand commented Mar 25, 2017 •

edited