Tune iscsid.conf for better performance by zedi-pramodh · Pull Request #5818 · lf-edge/eve

zedi-pramodh · 2026-04-16T23:43:24Z

During a POC with a customer this regression was found that queue depth was set to 1 in initial development phase of eve-k and never fixed back.

So set it SSD devices, which we expect as a minimum spec for clustering nodes.

HDD spinning disks:
queue_depth = 64 ← HDDs can't handle deep queues well
cmds_max = 128

SSD (SATA):
queue_depth = 128 ← SSDs handle deeper queues well
cmds_max = 256

NVMe:
queue_depth = 256 ← NVMe has native deep queue support
cmds_max = 512

How to test and validate this PR

Create 3 node cluster
Deploy multiple VMs.
Run some activity in the VMs and watch the performance.
Do few failovers and see how fast the replicas are syncing.

You should see response times improved before and after this PR

Changelog notes

None

Checklist

I've provided a proper description
I've added the proper documentation
I've tested my PR on amd64 device
I've written the test verification instructions
I've set the proper labels to this PR
I've checked the boxes above, or I've provided a good reason why I didn't
check them.

Please, check the boxes above after submitting the PR in interactive mode.

zedi-pramodh · 2026-04-20T18:15:28Z

@andrewd-zededa can you review this one.

zedi-pramodh · 2026-04-22T16:53:04Z

@andrewd-zededa can you look into his PR.

andrewd-zededa · 2026-04-24T16:34:07Z

@zedi-pramodh my only worry here is changing the default for all systems to unlock the queue depth for the user/app volumes. For lower resource systems where k3s db shares the same disk and fs this may enable the ability to starve out k3s for io performance. Can the default be set but configured as an eve config property so that other systems can be debugged without a new eve build? I would propose having zedkube:

read the config property
set this as a value in types.KubeConfig
Then the kube service container can read the value and overwrite iscsid.conf

zedi-pramodh · 2026-04-24T16:43:42Z

@zedi-pramodh my only worry here is changing the default for all systems to unlock the queue depth for the user/app volumes. For lower resource systems where k3s db shares the same disk and fs this may enable the ability to starve out k3s for io performance. Can the default be set but configured as an eve config property so that other systems can be debugged without a new eve build? I would propose having zedkube:

read the config property

set this as a value in types.KubeConfig

Then the kube service container can read the value and overwrite iscsid.conf

I am not clear on your suggestion. The default values I set are actually recommended as default for SSD drives. Even if we change through eve config are we saying we have to restart all the apps or reboot nodes for this config change ?
Also leaving this setup and expecting customers to tune is bit overhead in IMO.

andrewd-zededa · 2026-04-24T16:58:27Z

@zedi-pramodh my only worry here is changing the default for all systems to unlock the queue depth for the user/app volumes. For lower resource systems where k3s db shares the same disk and fs this may enable the ability to starve out k3s for io performance. Can the default be set but configured as an eve config property so that other systems can be debugged without a new eve build? I would propose having zedkube:

read the config property

set this as a value in types.KubeConfig

Then the kube service container can read the value and overwrite iscsid.conf

I am not clear on your suggestion. The default values I set are actually recommended as default for SSD drives. Even if we change through eve config are we saying we have to restart all the apps or reboot nodes for this config change ? Also leaving this setup and expecting customers to tune is bit overhead in IMO.

The intended user of the config property would be for dev/debug on new systems, not end-user operators.

zedi-pramodh · 2026-04-24T18:09:19Z

@zedi-pramodh my only worry here is changing the default for all systems to unlock the queue depth for the user/app volumes. For lower resource systems where k3s db shares the same disk and fs this may enable the ability to starve out k3s for io performance. Can the default be set but configured as an eve config property so that other systems can be debugged without a new eve build? I would propose having zedkube:

read the config property

set this as a value in types.KubeConfig

Then the kube service container can read the value and overwrite iscsid.conf

I am not clear on your suggestion. The default values I set are actually recommended as default for SSD drives. Even if we change through eve config are we saying we have to restart all the apps or reboot nodes for this config change ? Also leaving this setup and expecting customers to tune is bit overhead in IMO.

The intended user of the config property would be for dev/debug on new systems, not end-user operators.

For that we can modify locally right ?
iscsiadm -m node -o update -n -v

andrewd-zededa · 2026-04-24T20:15:01Z

@zedi-pramodh my only worry here is changing the default for all systems to unlock the queue depth for the user/app volumes. For lower resource systems where k3s db shares the same disk and fs this may enable the ability to starve out k3s for io performance. Can the default be set but configured as an eve config property so that other systems can be debugged without a new eve build? I would propose having zedkube:

read the config property

set this as a value in types.KubeConfig

Then the kube service container can read the value and overwrite iscsid.conf

I am not clear on your suggestion. The default values I set are actually recommended as default for SSD drives. Even if we change through eve config are we saying we have to restart all the apps or reboot nodes for this config change ? Also leaving this setup and expecting customers to tune is bit overhead in IMO.

The intended user of the config property would be for dev/debug on new systems, not end-user operators.

For that we can modify locally right ? iscsiadm -m node -o update -n -v

Ok that can work for non-persistent testing. I think long term there should be some facility to configure settings from the controller and persist them.

zedi-pramodh · 2026-04-25T03:57:56Z

changed values after testing on slower disks. Please check.

eriknordmark

LGTM

During a POC with a customer this regression was found that queue depth was set to 1 in initial development phase of eve-k and never fixed back. So set it SSD devices, which we expect as a minimum spec for clustering nodes. HDD spinning disks: queue_depth = 64 ← HDDs can't handle deep queues well cmds_max = 128 SSD (SATA): queue_depth = 128 ← SSDs handle deeper queues well cmds_max = 256 NVMe: queue_depth = 256 ← NVMe has native deep queue support cmds_max = 512 Signed-off-by: Pramodh Pallapothu <pramodh@zededa.com>

zedi-pramodh · 2026-04-26T17:31:44Z

@zedi-pramodh my only worry here is changing the default for all systems to unlock the queue depth for the user/app volumes. For lower resource systems where k3s db shares the same disk and fs this may enable the ability to starve out k3s for io performance. Can the default be set but configured as an eve config property so that other systems can be debugged without a new eve build? I would propose having zedkube:

read the config property

set this as a value in types.KubeConfig

Then the kube service container can read the value and overwrite iscsid.conf

I am not clear on your suggestion. The default values I set are actually recommended as default for SSD drives. Even if we change through eve config are we saying we have to restart all the apps or reboot nodes for this config change ? Also leaving this setup and expecting customers to tune is bit overhead in IMO.

The intended user of the config property would be for dev/debug on new systems, not end-user operators.

For that we can modify locally right ? iscsiadm -m node -o update -n -v

Ok that can work for non-persistent testing. I think long term there should be some facility to configure settings from the controller and persist them.

I agree for longterm we should look at config value and tune test it on different HW systems, may be we can provide to HW certification team.

github-actions Bot requested review from andrewd-zededa, eriknordmark and naiming-zededa April 16, 2026 23:43

zedi-pramodh force-pushed the increase-iscsi-queue-depth branch from 2c599d1 to ae8e48c Compare April 25, 2026 03:56

eriknordmark approved these changes Apr 25, 2026

View reviewed changes

zedi-pramodh force-pushed the increase-iscsi-queue-depth branch from ae8e48c to 1e7806c Compare April 26, 2026 17:14

github-actions Bot requested a review from eriknordmark April 26, 2026 17:15

rene merged commit 5636816 into lf-edge:master Apr 27, 2026
36 of 42 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tune iscsid.conf for better performance#5818

Tune iscsid.conf for better performance#5818
rene merged 1 commit intolf-edge:masterfrom
zedi-pramodh:increase-iscsi-queue-depth

zedi-pramodh commented Apr 16, 2026

Uh oh!

zedi-pramodh commented Apr 20, 2026

Uh oh!

zedi-pramodh commented Apr 22, 2026

Uh oh!

andrewd-zededa commented Apr 24, 2026

Uh oh!

zedi-pramodh commented Apr 24, 2026

Uh oh!

andrewd-zededa commented Apr 24, 2026

Uh oh!

zedi-pramodh commented Apr 24, 2026

Uh oh!

andrewd-zededa commented Apr 24, 2026

Uh oh!

zedi-pramodh commented Apr 25, 2026

Uh oh!

eriknordmark left a comment

Uh oh!

zedi-pramodh commented Apr 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

zedi-pramodh commented Apr 16, 2026

How to test and validate this PR

Changelog notes

Checklist

Uh oh!

zedi-pramodh commented Apr 20, 2026

Uh oh!

zedi-pramodh commented Apr 22, 2026

Uh oh!

andrewd-zededa commented Apr 24, 2026

Uh oh!

zedi-pramodh commented Apr 24, 2026

Uh oh!

andrewd-zededa commented Apr 24, 2026

Uh oh!

zedi-pramodh commented Apr 24, 2026

Uh oh!

andrewd-zededa commented Apr 24, 2026

Uh oh!

zedi-pramodh commented Apr 25, 2026

Uh oh!

eriknordmark left a comment

Choose a reason for hiding this comment

Uh oh!

zedi-pramodh commented Apr 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants