QEMU orchestrator: implement support for remote hosts #1760

anirudhrb · 2022-02-21T09:57:06Z

The current qemu orchestrator treats the local machine as the host for the VMs. Add support to allow specifying a remote
host in the runbook. The remote host should have libvirtd running and configured to allow remote connections. VMs under test are spawned on this remote host.

Example runbook for specifying a remote host

name: qemu default
...<snip>...
platform:
  - type: qemu
    admin_private_key_file: $(admin_private_key_file)
    keep_environment: $(keep_environment)
    qemu:
      hosts:
        - address: "10.77.0.5"
          username: "anirudh"
          private_key_file: $(admin_private_key_file)
    requirement:
      qemu:
        qcow2: $(qcow2)
        cloud_init:
          extra_user_data: $(extra_user_data)

In case host is not specified in the runbook, the local machine is treated as host (existing behavior).

squirrelsc · 2022-02-22T05:09:35Z

@cwize1 Can you have a look on this change too?

lisa/tools/firewall.py

lisa/sut_orchestrator/qemu/schema.py

lisa/sut_orchestrator/qemu/platform.py

cwize1 · 2022-02-22T18:04:50Z

lisa/sut_orchestrator/qemu/platform.py

+            node_addr = address
+            node_port = 22
+            if self.qemu_platform_runbook.is_host_remote():
+                self.host_node.tools[Iptables].start_forwarding(10022, address, 22)


This doesn't seem like it'll support testing multiple VMs on the same host.

What I recommend you do is allow the user to set the libvirt network that the VM connects to. Then the user can setup their own network bridge on the host, create a libvirt "network" that points to that bridge, and then set tell LISA to use that "network" for the VMs. Then the test runner will be able to access the VMs directly without needing to deal with port forwarding.

We can use the consistent way for local and remote hosts. It will simplify the code logic. The only difference is the localhost doesn't need connection info.

What do you mean by "the consistent way"?

In case of port forwarding, we do have special logic for remote hosts (set up iptable rules for forwarding). So consistent way is not possible.

Anyway, I think it can make the logic simpler, if the port forwarding or other approaches are used on local too.
If to use port forwarding, one way is to set a start port range from high end like 30000, and map the ports one by one incrementally, if a port is unused.

I need to figure out the bridged network thing. I had tried it at first but it didn't work. I am inclined to doing it in a follow up PR.

Just to understand, in what cases do we need multiple VMs on the same host? When I do implement it how do I test it?

You can create a simple test function with the following annotation:

@TestCaseMetadata( description="", requirement=node_requirement( node=schema.NodeSpace( node_count=2, ) ), )

There are a bunch of "Microsoft suite" tests that use multiple nodes, I think mainly around testing networking. I am using it to test multi-node Kubernetes clusters.

@cwize1 The field schema.Environment.topology is used to differentiate different network topology, but it's not used so far. The value is always subnet. Please let me know, if you need to test different topology, we can discuss the requirement, and think about how to support it.

@squirrelsc Unfortunately, unlike with Hyper-V which has a fairly unified networking API, libvirt/QEMU doesn't provide much assistance for networking. Unless you are doing something super simple (e.g. using the default NAT network), then a developer has to manually create the network outside of libvirt and then point the libvirt VM at that network. Typically, either Linux kernel bridges or OVS (Open vSwitch) are used but both have their challenges. OVS is a big piece of software that is complicated to use. And every Linux distro has their own networking manager API that you have to us to provision networks, such as kernel bridges. I don't think it is worth trying to pull that complexity into LISA.

Thank you for the thoughts. IMO, it depends on test scenarios, and doesn't need to fully automate all things. The lab environments is complex, and it's acceptable for preconfigured steps. If some servers are setup with some topologies, LISA just needs to use it, and keep it unchanged. The LISA needs to be aware of the topology, and assign test cases by its requirements. If the different topology tests is needed from a test case, the platform checks the settings (from runbook) of each server, and then allocate VMs from matched servers. The ADO agents supports it by "capabilities". The capabilities just a couple of key/value pairs, it's very similar like what I want to do.

lisa/sut_orchestrator/qemu/platform.py

cwize1

.

lisa/sut_orchestrator/qemu/platform.py

lisa/sut_orchestrator/qemu/schema.py

lisa/sut_orchestrator/qemu/platform.py

cwize1

.

squirrelsc · 2022-02-25T01:47:03Z

BTW, please update document for the new schema.

lisa/sut_orchestrator/qemu/platform.py

lisa/sut_orchestrator/qemu/schema.py

lisa/util/shell.py

lisa/tools/firewall.py

lisa/sut_orchestrator/qemu/platform.py

cwize1

.

squirrelsc · 2022-02-28T17:12:41Z

lisa/sut_orchestrator/qemu/platform.py

@@ -280,12 +317,21 @@ def _configure_nodes(self, environment: Environment, log: Logger) -> None:
            node_context.cloud_init_file_path = os.path.join(
                vm_disks_dir, f"{node_context.vm_name}-cloud-init.iso"
            )
-            node_context.os_disk_base_file_path = qemu_node_runbook.qcow2
+
+            if self.host_node.is_remote:


the is_remote is incorrect here, and always be True, because it's the method itself. The is_remote() is right.

I'm not sure if the "remote" check is useful here. Can remote and local use the same folder structure?

is_remote is correct based on usages in other files. is_remote() throws the error "bool is not callable".

This is an optimization. We avoid copying OS disk image for local node. That's why remote check is useful here.

is_remote is marked with @property. So, it behaves like a C# style property.

Sorry, I'm confused on the LibVirtHost, I thought its type is LibVirtHost. BTW, what's the reason it needs a LibVirtHost schema? Can it reuse the RemoteNode schema?

I think there is some utility in having a separate schema because we could have libvirt specific fields in it. Right now we have the lisa_working_dir property that is not part of RemoteNode. In the future we could have a supported_hypervisors property for example to indicate the hypervisors (qemu, ch etc) that the host supports.

The Qemu can inherit the RemoteNode, so it's easy to obtain the general supports of node. The Node has working_path, and it can be overwritten to read from the schema.

lisa/sut_orchestrator/qemu/platform.py

lisa/tools/qemu_img.py

cwize1

.

lisa/sut_orchestrator/qemu/platform.py

lisa/tools/qemu_img.py

cwize1 · 2022-03-02T19:48:49Z

lisa/sut_orchestrator/qemu/schema.py

+    # The directory where lisa will store VM related files (such as disk images).
+    # This directory must already exist and the test user should have write permission
+    # to it.
+    lisa_working_dir: str = "/var/tmp"


Is it possible to leave the default as the base VM image's directory for the local case?

I guess it could be done if needed. Is it for backward compatibility?

When running on a local dev box, it is nice to be able to easily see all the files created, particularly for cases where you keep the environment after the test completes/fails (e.g. while debugging).

cwize1

.

QEMU orchestrator: implement support for remote hosts

8cb0716

anirudhrb requested a review from squirrelsc as a code owner February 21, 2022 09:57

squirrelsc reviewed Feb 22, 2022

View reviewed changes

lisa/tools/firewall.py Show resolved Hide resolved

squirrelsc reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/schema.py Outdated Show resolved Hide resolved

squirrelsc reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/schema.py Outdated Show resolved Hide resolved

squirrelsc reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/schema.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 22, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 suggested changes Feb 22, 2022

View reviewed changes

QEMU orchestrator: use host_node for both local and remote hosts

8ada809

cwize1 reviewed Feb 24, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 24, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/schema.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 24, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 24, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 suggested changes Feb 24, 2022

View reviewed changes

Code review fixes: round 1

4578314

cwize1 reviewed Feb 25, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Show resolved Hide resolved

cwize1 reviewed Feb 25, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/schema.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 25, 2022

View reviewed changes

lisa/util/shell.py Show resolved Hide resolved

cwize1 reviewed Feb 25, 2022

View reviewed changes

lisa/tools/firewall.py Show resolved Hide resolved

cwize1 reviewed Feb 25, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 25, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 suggested changes Feb 25, 2022

View reviewed changes

anirudhrb added 4 commits February 28, 2022 07:07

Address a few more comments

b6111a2

QEMU orchestration: remove unused 'local_node' param

5c32700

QEMU orchestrator: support multiple hosts in schema

235147f

QEMU orchestrator: don't copy os disk file for local host

184b8c2

squirrelsc reviewed Feb 28, 2022

View reviewed changes

cwize1 reviewed Feb 28, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Feb 28, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Show resolved Hide resolved

cwize1 reviewed Feb 28, 2022

View reviewed changes

lisa/tools/qemu_img.py Outdated Show resolved Hide resolved

cwize1 suggested changes Feb 28, 2022

View reviewed changes

Address some more comments

de610f7

cwize1 reviewed Mar 2, 2022

View reviewed changes

lisa/sut_orchestrator/qemu/platform.py Outdated Show resolved Hide resolved

cwize1 reviewed Mar 2, 2022

View reviewed changes

lisa/tools/qemu_img.py Show resolved Hide resolved

cwize1 reviewed Mar 2, 2022

View reviewed changes

cwize1 suggested changes Mar 2, 2022

View reviewed changes

anirudhrb added 4 commits March 3, 2022 12:07

Use node_context.vm_disks_dir instead of recalculating it

e09dd64

Use ssh transport

6c7dca2

Force run qemu-img command

5a112a5

Fix type check issue

9b6fa53

cwize1 approved these changes Mar 3, 2022

View reviewed changes

squirrelsc approved these changes Mar 3, 2022

View reviewed changes

squirrelsc merged commit 9e72070 into microsoft:main Mar 3, 2022

anirudhrb deleted the qemu_remote_host_support branch December 22, 2023 05:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QEMU orchestrator: implement support for remote hosts #1760

QEMU orchestrator: implement support for remote hosts #1760

anirudhrb commented Feb 21, 2022 •

edited

Loading

squirrelsc commented Feb 22, 2022

cwize1 Feb 22, 2022

squirrelsc Feb 28, 2022

cwize1 Feb 28, 2022

anirudhrb Feb 28, 2022

squirrelsc Feb 28, 2022

anirudhrb Mar 2, 2022

cwize1 Mar 2, 2022

squirrelsc Mar 2, 2022

cwize1 Mar 2, 2022

squirrelsc Mar 2, 2022

cwize1 left a comment

cwize1 left a comment

squirrelsc commented Feb 25, 2022

cwize1 left a comment

squirrelsc Feb 28, 2022

anirudhrb Feb 28, 2022

cwize1 Feb 28, 2022

squirrelsc Feb 28, 2022

anirudhrb Mar 2, 2022 •

edited

Loading

squirrelsc Mar 2, 2022

cwize1 left a comment

cwize1 Mar 2, 2022

anirudhrb Mar 3, 2022

cwize1 Mar 3, 2022

cwize1 left a comment

QEMU orchestrator: implement support for remote hosts #1760

QEMU orchestrator: implement support for remote hosts #1760

Conversation

anirudhrb commented Feb 21, 2022 • edited Loading

squirrelsc commented Feb 22, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cwize1 left a comment

Choose a reason for hiding this comment

cwize1 left a comment

Choose a reason for hiding this comment

squirrelsc commented Feb 25, 2022

cwize1 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anirudhrb Mar 2, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cwize1 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cwize1 left a comment

Choose a reason for hiding this comment

anirudhrb commented Feb 21, 2022 •

edited

Loading

anirudhrb Mar 2, 2022 •

edited

Loading