subctl: Enhance diagnose to troubleshoot state of submariner #961

vthapar · 2023-10-03T13:53:34Z

What would you like to be added:
Enhance subctl diagnose to do more analysis than it currently does. It currently focuses on finding out if something has gone wrong, not why it has gone wrong. Some enhancements that can be done are:

For OVN-CI, make sure legacy ports etc. are not present.
Make sure OVN flows, router policies etc. are using correct IPs as per endpoints.
Check of IP Tables rules programed are using correct IPs.
For Globalnet, make sure exported services are using same IPs as GlobalIngressIPs allocated to them.
Check the logs for frequency of logs. Too frequent logs can cause log overflow in long running setups, losing crucial information. This shold help catch any overzealous logs.
Check if pod logs are about to runover, so user can back them up for future troubleshooting. Note: This should probably be an alert.
Check if any multicluster objects match in contents on source, broker and destination.

Why is this needed:
Currently subctl diagnose only does basic diagnosis. Checks for deployments and pods states, run firewall test etc. But lot of troubleshooting still requires dev team to gather logs and analyze them. Some of the analysis done manually can be easily automated. Aim is to minimize effort and time dev team has to spend on troubleshooting.

The text was updated successfully, but these errors were encountered:

maayanf24 · 2023-10-30T14:35:25Z

@vthapar - Are you preparing an ep for this epic?

github-actions · 2024-02-28T00:10:06Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further
activity occurs. Thank you for your contributions.

vthapar added enhancement New feature or request subctl Subctl related issues size:extra-large priority:medium labels Oct 3, 2023

vthapar added this to Backlog in Backlog via automation Oct 3, 2023

vthapar self-assigned this Oct 3, 2023

vthapar moved this from Backlog to Next Version Candidate in Backlog Oct 4, 2023

Jaanki added priority:high and removed priority:medium labels Oct 4, 2023

Jaanki removed this from Next Version Candidate in Backlog Oct 4, 2023

github-actions bot added the stale label Feb 28, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 6, 2024

tpantelis removed the stale label Mar 6, 2024

tpantelis reopened this Mar 6, 2024

tpantelis self-assigned this Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

subctl: Enhance diagnose to troubleshoot state of submariner #961

subctl: Enhance diagnose to troubleshoot state of submariner #961

vthapar commented Oct 3, 2023 •

edited

maayanf24 commented Oct 30, 2023

github-actions bot commented Feb 28, 2024

subctl: Enhance diagnose to troubleshoot state of submariner #961

subctl: Enhance diagnose to troubleshoot state of submariner #961

Comments

vthapar commented Oct 3, 2023 • edited

maayanf24 commented Oct 30, 2023

github-actions bot commented Feb 28, 2024

vthapar commented Oct 3, 2023 •

edited