New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create an endpoint that shows agora internals (for debugging) #1934
Comments
What I was actually thinking about is expose an UNIX socket that would allow one to directly interact with the nodes. This would also allow us to control nodes, e.g. by setting log level and co. But all things consider, going the HTTP route, with a control interface, might be much simpler to implement. |
For the moment this should be added to the admin interface. agora/source/agora/api/Admin.d Lines 30 to 42 in 2809791
|
Already part of the API: agora/source/agora/api/FullNode.d Line 246 in 2809791
Currently not very useful, but will keep it in mind for the future.
We already have
I think we're better off just logging it for now.
Yup.
I think the logger reconfigure mentioned above helps with this. |
I think it would be nice to have all that information show up as a result of 1 HTTP/REST call, so we don't have to reconfigure/scavenge the logfiles. We currently have 5 validators, logging into each one of those machines and trying to find relevant log messages will still take a lot of time versus just opening up 5 browser tabs. We might not be able to find the exact reason of the problem we are trying to debug, but we would hopefully at least know if it is SCP/Network/Something else |
For that, maybe a EKL stack would be better suited ? |
Currently it is very hard to debug complex problems like: #1552 in our production environment.
It would be helpful to have a http endpoint that exposes the most important internals of the Agora instance to help debugging. This includes:
Some of these information is already available through other endpoints, but it would be good to see them all on the same endpoint.
Some information like know preimages are also known, if we ssh'd into the machine and read the sqlite db, but that is very cumbersome to do.
Some information like SCP messages are only available in the logs, if we set the level to Trace.
Some information like currently banned nodes are in its own file.
The text was updated successfully, but these errors were encountered: