Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

subctl: Enhance diagnose to troubleshoot state of submariner #961

Open
vthapar opened this issue Oct 3, 2023 · 2 comments
Open

subctl: Enhance diagnose to troubleshoot state of submariner #961

vthapar opened this issue Oct 3, 2023 · 2 comments
Assignees
Labels
enhancement New feature or request priority:high size:extra-large subctl Subctl related issues

Comments

@vthapar
Copy link
Contributor

vthapar commented Oct 3, 2023

What would you like to be added:
Enhance subctl diagnose to do more analysis than it currently does. It currently focuses on finding out if something has gone wrong, not why it has gone wrong. Some enhancements that can be done are:

  1. For OVN-CI, make sure legacy ports etc. are not present.
  2. Make sure OVN flows, router policies etc. are using correct IPs as per endpoints.
  3. Check of IP Tables rules programed are using correct IPs.
  4. For Globalnet, make sure exported services are using same IPs as GlobalIngressIPs allocated to them.
  5. Check the logs for frequency of logs. Too frequent logs can cause log overflow in long running setups, losing crucial information. This shold help catch any overzealous logs.
  6. Check if pod logs are about to runover, so user can back them up for future troubleshooting. Note: This should probably be an alert.
  7. Check if any multicluster objects match in contents on source, broker and destination.

Why is this needed:
Currently subctl diagnose only does basic diagnosis. Checks for deployments and pods states, run firewall test etc. But lot of troubleshooting still requires dev team to gather logs and analyze them. Some of the analysis done manually can be easily automated. Aim is to minimize effort and time dev team has to spend on troubleshooting.

@vthapar vthapar added enhancement New feature or request subctl Subctl related issues size:extra-large priority:medium labels Oct 3, 2023
@vthapar vthapar added this to Backlog in Backlog via automation Oct 3, 2023
@vthapar vthapar self-assigned this Oct 3, 2023
@vthapar vthapar moved this from Backlog to Next Version Candidate in Backlog Oct 4, 2023
@Jaanki Jaanki removed this from Next Version Candidate in Backlog Oct 4, 2023
@maayanf24
Copy link
Contributor

@vthapar - Are you preparing an ep for this epic?

Copy link

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further
activity occurs. Thank you for your contributions.

@github-actions github-actions bot added the stale label Feb 28, 2024
@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 6, 2024
@tpantelis tpantelis removed the stale label Mar 6, 2024
@tpantelis tpantelis reopened this Mar 6, 2024
@tpantelis tpantelis self-assigned this Mar 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request priority:high size:extra-large subctl Subctl related issues
Projects
Status: Todo
Development

No branches or pull requests

4 participants