block: Leverage multiqueue for virtio-block #4503

amshinde · 2022-06-21T19:39:52Z

Similar to network, we can use multiple queues for virtio-block
devices. This can help improve storage performance.
This commit changes the number of queues for block devices to
the number of cpus for cloud-hypervisor and qemu.

Today the default number of cpus a VM starts with is 1.
Hence the queues used will be 1. This change will help
improve performance when the default cold-plugged cpus is greater
than one by changing this in the config file. This may also help
when we use the sandboxing feature with k8s that passes down
the sum of the resources required down to Kata.

Fixes #4502

Signed-off-by: Archana Shinde archana.m.shinde@intel.com

Similar to network, we can use multiple queues for virtio-block devices. This can help improve storage performance. This commit changes the number of queues for block devices to the number of cpus for cloud-hypervisor and qemu. Today the default number of cpus a VM starts with is 1. Hence the queues used will be 1. This change will help improve performance when the default cold-plugged cpus is greater than one by changing this in the config file. This may also help when we use the sandboxing feature with k8s that passes down the sum of the resources required down to Kata. Fixes kata-containers#4502 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>

fengwang666 · 2022-06-21T19:59:16Z

src/runtime/virtcontainers/qemu.go

-		if err = q.qmpMonitorCh.qmp.ExecutePCIDeviceAdd(q.qmpMonitorCh.ctx, drive.ID, devID, driver, addr, bridge.ID, romFile, 0, true, defaultDisableModern); err != nil {
+		queues := int(q.config.NumVCPUs)
+
+		if err = q.qmpMonitorCh.qmp.ExecutePCIDeviceAdd(q.qmpMonitorCh.ctx, drive.ID, devID, driver, addr, bridge.ID, romFile, queues, true, defaultDisableModern); err != nil {


What about virtio-scsi? Can we also enable multi-queue for virtio-scsi?

@fengwang666 Yes, I am planning to add this to scsi as well, maybe in a follow-up PR. I do want to do some performance testing before that, like figuring out if we need to cap the number of queues. Beyond a point, increasing the queues may not give any benefits.

amshinde · 2022-06-21T20:48:38Z

/test

fengwang666

LGTM

liubin · 2022-06-22T08:51:06Z

src/runtime/virtcontainers/clh.go

@@ -753,6 +753,11 @@ func (clh *cloudHypervisor) hotplugAddBlockDevice(drive *config.BlockDrive) erro
 	clhDisk.Readonly = &drive.ReadOnly
 	clhDisk.VhostUser = func(b bool) *bool { return &b }(false)

+	queues := int32(clh.config.NumVCPUs)
+	queueSize := int32(1024)


@amshinde Can you add a comment to describe why here use 1024 as a queue size?

Yep I agree with @liubin it'd be good to understand why you picked 1024. Having a deeper queue might bring some benefits or drawbacks depending on the use case.

bergwolf · 2022-06-22T10:20:50Z

Nice work @amshinde ! Is it the only missing piece to have multi-queue support throughout the block IO stack or do we need to change more things inside the guest to more performance out of it (e.g., enable blk-mq in the guest virtio-block driver etc.)?

amshinde · 2022-06-23T01:17:48Z

@bergwolf This is the kernel config required CONFIG_BLK_MQ_VIRTIO. We have to have that explicitly enabled before we moved to fragments structure for building kernel. We dont explicitly enable it, but its default value is "yes
" when VIRTIO is turned on : https://github.com/torvalds/linux/blob/master/block/Kconfig#L215
I verified this with a kernel I built using our kernel setup script and saw that CONFIG_BLK_MQ_VIRTIO=Y in the resulting kernel config file.

sboeuf

LGTM

katacontainersbot added the size/tiny Smallest and simplest task label Jun 21, 2022

fengwang666 reviewed Jun 21, 2022

View reviewed changes

amshinde requested a review from sboeuf June 21, 2022 20:39

fengwang666 approved these changes Jun 21, 2022

View reviewed changes

liubin reviewed Jun 22, 2022

View reviewed changes

sboeuf approved these changes Jun 23, 2022

View reviewed changes

fidencio added the no-backport-needed label Jun 23, 2022

fidencio merged commit 133528d into kata-containers:main Jun 23, 2022

amshinde deleted the multi-queue-block branch November 15, 2022 21:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

block: Leverage multiqueue for virtio-block #4503

block: Leverage multiqueue for virtio-block #4503

amshinde commented Jun 21, 2022

fengwang666 Jun 21, 2022

amshinde Jun 21, 2022

amshinde commented Jun 21, 2022

fengwang666 left a comment

liubin Jun 22, 2022

sboeuf Jun 23, 2022

bergwolf commented Jun 22, 2022

amshinde commented Jun 23, 2022 •

edited

sboeuf left a comment

Navigation Menu

block: Leverage multiqueue for virtio-block #4503

block: Leverage multiqueue for virtio-block #4503

Conversation

amshinde commented Jun 21, 2022

fengwang666 Jun 21, 2022

Choose a reason for hiding this comment

amshinde Jun 21, 2022

Choose a reason for hiding this comment

amshinde commented Jun 21, 2022

fengwang666 left a comment

Choose a reason for hiding this comment

liubin Jun 22, 2022

Choose a reason for hiding this comment

sboeuf Jun 23, 2022

Choose a reason for hiding this comment

bergwolf commented Jun 22, 2022

amshinde commented Jun 23, 2022 • edited

sboeuf left a comment

Choose a reason for hiding this comment

amshinde commented Jun 23, 2022 •

edited