[IMPROVEMENT] Disable `Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly` for RWX volumes #5017

derekbit · 2022-12-07T09:18:58Z

Is your improvement request related to a feature? Please describe (👍 if you like this request)

Longhorn has a feature Automatically Delete Workload Pod when Volume is Detached Unexpectedly, and it is enabled by default. For a RWX volume, when the volume is unexpectedly detached, it will be recreated on another node. Additionally, the NFS sever already has a dedicated recovery backend. Thus, there is no need to delete the workload pods using the volume, and IO should continue after the new volume is started.

Describe the solution you'd like

A clear and concise description of what you want to happen.

Describe alternatives you've considered

A clear and concise description of any alternative solutions or features you've considered.

Additional context

Add any other context or screenshots about the feature request here.

The text was updated successfully, but these errors were encountered:

innobead · 2022-12-07T09:21:34Z

So this is specific for RWX only. I feel we need to include it in 1.4.0? WDYT?

derekbit · 2022-12-07T09:24:41Z

So this is specific for RWX only. I feel we need to include it in 1.4.0? WDYT?

Yes. Should be in v1.4.0
Waiting for e2e https://ci.longhorn.io/job/private/job/longhorn-tests-regression/2651/

innobead · 2022-12-07T09:32:39Z

If we don't have this improvement, what users will encounter is just the workload getting restarted to reattach to the RWX volume from the stretch w/o the previous client session (maintained in our newly built recovery backend), but there will be no data loss problem because the current nfs mode is changed to hard mode, correct?

derekbit · 2022-12-07T11:29:16Z

If we don't have this patch, the pod (client side) will be stopped and then recreated and attached to the volume. The disconnection can still introduce the data loss issue. hard mode only works when the client is still running without stopping.

I'm thinking if we need to have a setting for Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly for RWX volumes. Not sure if there is any use case needing the function.

innobead · 2022-12-07T11:49:14Z

I see. Also, I know you would like to have a setting for keeping the original function. It's quite weird.

But I do feel this setting should originally be for RWO, because the volume/engine is next to the workload.

@shuo-wu @joshimoo WDYT?

derekbit · 2022-12-07T11:51:12Z

Agree. Then, we can keep the current PR and see if there is any feedback from users. WDYT?

innobead · 2022-12-07T11:53:50Z

Sounds good. BTW, could you check the commit which introduced this setting to see the context of the feature? See if this was actually a concern for RWO or also RWX included.

shuo-wu · 2022-12-07T12:18:47Z

That setting is mainly for RWO volumes. I don't think we need a separate setting for RWX volumes after the improvement.
We can see if there are any community users/customers requesting it in the future.

derekbit · 2022-12-07T12:19:09Z

Sounds good. BTW, could you check the comment which introduced this setting to see the context of the feature? See if this was actually a concern for RWO or also RWX included.

Yes, I've checked the PR, but it doesn't mention any concern for RWO or RWX volume.
But the test cases are only for RWO volume.
Ref: #1719

innobead · 2022-12-07T12:22:09Z

Then let's do this improvement ;)

longhorn-io-github-bot · 2022-12-08T03:55:58Z

Pre Ready-For-Testing Checklist

Where is the reproduce steps/test steps documented?
The reproduce steps/test steps are at:

The steps are same as test_rwx.py::test_rwx_delete_share_manager_pod updated in longhorn/longhorn-tests#1221.

The workload can be a DaemonSet, Deployment or Deployment.

Does the PR include the explanation for the fix or the feature?
[IMPROVEMENT] Disable Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly for RWX volumes #5017 (comment)
Have the backend code been merged (Manager, Engine, Instance Manager, BackupStore etc) (including backport-needed/*)?

longhorn/longhorn-manager#1589

Which areas/issues this PR might have potential impacts on?
Area: RWX volume
Issues

derekbit · 2022-12-08T04:44:16Z

e2e afte updating test_rwx.py::test_rwx_delete_share_manager_pod updated

https://ci.longhorn.io/job/private/job/longhorn-tests-regression/2657/

chriscchien · 2022-12-08T09:56:44Z

Verified in longhorn master 13ac2a with test steps
Result Pass

After delete share-manager pod then wait share-manager pod work again, write extra data into statesfulset pod mount point, those new added data can be seen in path /export/'<pv_name> in share-manager pod.
Test case passed in master-head pipeline.

derekbit added area/volume-rwx Volume RWX related kind/improvement Request for improvement of existing function labels Dec 7, 2022

derekbit self-assigned this Dec 7, 2022

innobead added area/stability System or volume stability area/resilience System or volume resilience labels Dec 7, 2022

derekbit mentioned this issue Dec 7, 2022

Disable Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly for RWX volumes longhorn/longhorn-manager#1589

Merged

innobead added the priority/0 Must be fixed in this release (managed by PO) label Dec 7, 2022

innobead added this to the v1.4.0 milestone Dec 7, 2022

innobead added the area/setting Global setting or volume setting label Dec 7, 2022

derekbit mentioned this issue Dec 8, 2022

Update test_rwx_delete_share_manager_pod longhorn/longhorn-tests#1221

Merged

khushboo-rancher assigned chriscchien Dec 8, 2022

chriscchien closed this as completed Dec 8, 2022

innobead mentioned this issue Dec 23, 2022

Update test_rwx_delete_share_manager_pod (backport #1221) longhorn/longhorn-tests#1229

Merged

innobead mentioned this issue May 10, 2023

[TEST] Update test cases to exclude RWX from physical node down test cases #5900

Closed

derekbit mentioned this issue Oct 4, 2023

[TASK] Revert "Disable Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly for RWX volumes" #6838

Closed

innobead mentioned this issue Oct 27, 2023

[RELEASE] 1.5.2 #6981

Closed

13 tasks

w13915984028 mentioned this issue Apr 13, 2024

[ENHANCEMENT] Review LH setting auto-delete-pod-when-volume-detached-unexpectedly harvester/harvester#5580

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IMPROVEMENT] Disable `Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly` for RWX volumes #5017

[IMPROVEMENT] Disable `Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly` for RWX volumes #5017

derekbit commented Dec 7, 2022 •

edited

innobead commented Dec 7, 2022

derekbit commented Dec 7, 2022

innobead commented Dec 7, 2022 •

edited

derekbit commented Dec 7, 2022 •

edited

innobead commented Dec 7, 2022 •

edited

derekbit commented Dec 7, 2022

innobead commented Dec 7, 2022 •

edited

shuo-wu commented Dec 7, 2022

derekbit commented Dec 7, 2022 •

edited

innobead commented Dec 7, 2022

longhorn-io-github-bot commented Dec 8, 2022 •

edited by derekbit

derekbit commented Dec 8, 2022

chriscchien commented Dec 8, 2022

[IMPROVEMENT] Disable Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly for RWX volumes #5017

[IMPROVEMENT] Disable Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly for RWX volumes #5017

Comments

derekbit commented Dec 7, 2022 • edited

Is your improvement request related to a feature? Please describe (👍 if you like this request)

Describe the solution you'd like

Describe alternatives you've considered

Additional context

innobead commented Dec 7, 2022

derekbit commented Dec 7, 2022

innobead commented Dec 7, 2022 • edited

derekbit commented Dec 7, 2022 • edited

innobead commented Dec 7, 2022 • edited

derekbit commented Dec 7, 2022

innobead commented Dec 7, 2022 • edited

shuo-wu commented Dec 7, 2022

derekbit commented Dec 7, 2022 • edited

innobead commented Dec 7, 2022

longhorn-io-github-bot commented Dec 8, 2022 • edited by derekbit

Pre Ready-For-Testing Checklist

derekbit commented Dec 8, 2022

chriscchien commented Dec 8, 2022

[IMPROVEMENT] Disable `Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly` for RWX volumes #5017

[IMPROVEMENT] Disable `Automatically Delete Workload Pod when The Volume Is Detached Unexpectedly` for RWX volumes #5017

derekbit commented Dec 7, 2022 •

edited

innobead commented Dec 7, 2022 •

edited

derekbit commented Dec 7, 2022 •

edited

innobead commented Dec 7, 2022 •

edited

innobead commented Dec 7, 2022 •

edited

derekbit commented Dec 7, 2022 •

edited

longhorn-io-github-bot commented Dec 8, 2022 •

edited by derekbit