Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Traffic disturbance 2 minutes after node restart #11787

Open
ljkiraly opened this issue Apr 12, 2024 · 4 comments
Open

Traffic disturbance 2 minutes after node restart #11787

ljkiraly opened this issue Apr 12, 2024 · 4 comments
Assignees
Labels
bug Something isn't working

Comments

@ljkiraly
Copy link
Contributor

ljkiraly commented Apr 12, 2024

Expected Behavior

The node restart should not have impact on traffic between elements running on other nodes.

Current Behavior

Two minutes after a worker restart there was a traffic outage.

Failure Information

Can not reproduce this, but fails often in nightly tests. Logs from a failed test run in
traffic_outage_after_node_reboot_log.tar.gz

The node reboot is at:
[2024-04-05T13:45:03.923Z] robustness-node-restart-test.sh: Rebooting node: worker-pool1-1dn6k2vc-n121-vpod1-pnes8010-ipv4

The traffic has been stopped between: [2024-04-05T13:47:06.910Z] and [2024-04-05T13:48:12.064Z]

Context

  • NSM Version: v1.13.0-rc2
    The issue can be seen with NSM v1.12.1-rc.1 also.
@denis-tingaikin
Copy link
Member

NSM Version: v1.13.0-rc1
The issue can be seen with NSM v1.12.1-rc.1 also.

Hm, as far as I know, we fixed something similar in v1.13.0.
Have you tried it on v1.13.0? 

@denis-tingaikin denis-tingaikin self-assigned this Apr 12, 2024
@denis-tingaikin denis-tingaikin added the bug Something isn't working label Apr 12, 2024
@ljkiraly
Copy link
Contributor Author

NSM Version: v1.13.0-rc1
The issue can be seen with NSM v1.12.1-rc.1 also.

Hm, as far as I know, we fixed something similar in v1.13.0. Have you tried it on v1.13.0?

The logs are from a test run with v1.13.0-rc1. Is there a difference between v1.13.0 and v1.13.0-rc1? Just mentioned NSM v1.12.1-rc.1 to clarify that is not a new bug. It is considered as a medium priority issue.

@denis-tingaikin
Copy link
Member

It is considered as a medium priority issue.

OK, good that it's not crirical.

The logs are from a test run with v1.13.0-rc1. Is there a difference between v1.13.0 and v1.13.0-rc1?

Yes, it has a difference. We have fixed a few bugs, like #11372 in v1.13.0 and 1.13.0- rc.1 doesn't contain the fix. 1.13.0-rc.2 contains the fix.

@ljkiraly
Copy link
Contributor Author

Ah, I missed the version, sorry: the logs are from a test run with NSM v1.13.0-rc2. Fixing in description.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: Todo
Development

No branches or pull requests

2 participants