improved logging in nats to nats connections #622

ripienaar · 2018-02-19T11:02:24Z

Defect
Feature Request or Change Proposal

Feature Requests

Use Case:

improved operability of clusters

Proposed Change:

We need better logging in clusters, I found for example slow consumer messages in my logs, I believe this is in the cluster node < -> cluster node connections...but its hard to say.

Elevating these lines to info would help:

https://github.com/nats-io/gnatsd/blob/ee7b97e6ee3068900d39f1fe4ae7b75f358416ab/server/route.go#L142

https://github.com/nats-io/gnatsd/blob/ee7b97e6ee3068900d39f1fe4ae7b75f358416ab/server/route.go#L737

Elevating this to error would help:

https://github.com/nats-io/gnatsd/blob/ee7b97e6ee3068900d39f1fe4ae7b75f358416ab/server/route.go#L168

Additional changes would be to better log when the cluster connections drop on the side that iniaited the request initially so we know this happens

Who Benefits From The Change(s)?

cluster operators

Alternative Approaches

n/a

derekcollison · 2018-02-19T21:17:53Z

I think being able to dynamically turn on tracing and debug modes without restart might be a better direction here.

ripienaar · 2018-02-19T21:30:16Z

Debugging would instantly overwhelm my /var partition :)

It might be useful but basic error logging and info for major events will go very far

ripienaar · 2018-02-20T08:33:13Z

Just to expand on that, I generally never want to see debug output in production setting with very large numbers of nodes connected sending many things and clients coming/going this would create a storm of noise and you'll miss whats going on. Further its not retrospective, you cannot run in debug all the time.

These lines I highlighted though, I always want to see them, they are critical for the operability of the software imo. I want to go back and review logs after an incident and know this happened, it should be safe to always expose this data.

Logging appropriately is the correct action here - this is not debug information.

Not to say being able to enable debug/trace dynamically would not be good - but it would not solve this problem.

kozlovic · 2018-02-21T00:39:56Z

In general, I kind of agree that these could be elevated, but I have a concern about the one trying to establish the route. If you have a static route but the remote server is not running, this notice/error would then be printed every 2 seconds. Now with config reload you should be able to remove it though.

I agree that the dynamic nature of enabling logging may not help once the event you are interested in has already happened.

ripienaar · 2018-02-21T13:16:37Z

@kozlovic good point about the remote server messages, and this will be made worse by the business of announced cluster members never expiring if the node is down - in its self probably something worth knowing.

In general I think its normal and expected that those messages would appear though, admins are used to that kind of thing

ripienaar · 2018-03-29T15:38:28Z

Also:

https://github.com/nats-io/gnatsd/blob/add6d7930ae6d4bff8823b28999ea87bf1bfd23d/server/client.go#L325

and

https://github.com/nats-io/gnatsd/blob/c587035a49df6fe9fca8986b8e7fed0130c0007e/server/server.go#L775

You're CLOSING the connection after logging a debug message, that's so unfriendly to operators.

derekcollison · 2018-06-27T00:52:52Z

I believe we have addressed this in #692

ripienaar mentioned this issue Mar 1, 2018

try to elevate certain logs to warnings choria-io/go-choria#176

Closed

derekcollison closed this as completed Jun 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improved logging in nats to nats connections #622

improved logging in nats to nats connections #622

ripienaar commented Feb 19, 2018

derekcollison commented Feb 19, 2018

ripienaar commented Feb 19, 2018

ripienaar commented Feb 20, 2018 •

edited

Loading

kozlovic commented Feb 21, 2018

ripienaar commented Feb 21, 2018

ripienaar commented Mar 29, 2018 •

edited

Loading

derekcollison commented Jun 27, 2018

improved logging in nats to nats connections #622

improved logging in nats to nats connections #622

Comments

ripienaar commented Feb 19, 2018

Feature Requests

Use Case:

Proposed Change:

Who Benefits From The Change(s)?

Alternative Approaches

derekcollison commented Feb 19, 2018

ripienaar commented Feb 19, 2018

ripienaar commented Feb 20, 2018 • edited Loading

kozlovic commented Feb 21, 2018

ripienaar commented Feb 21, 2018

ripienaar commented Mar 29, 2018 • edited Loading

derekcollison commented Jun 27, 2018

ripienaar commented Feb 20, 2018 •

edited

Loading

ripienaar commented Mar 29, 2018 •

edited

Loading