feat: add debug logging when NodeFilters fail #69

IvanVergiliev · 2022-10-04T11:05:32Z

Description

High-level notes:

The filters are pretty verbose now. This is because each boolean that would cause the filter to fail is checked separately, so that it can be reported accurately in the logs. The verbosity is fairly annoying, but it makes it very easy to check and verify the routing logic simply by looking at the logs. We can revert back to a more succinct version when we're more used to working with these and we're okay with dropping the logging.
The IsHealthy filter is broken down into separate Peers and Syncing filters. When implemented in a single function, it was unnecessarily complicated to properly handle all the failure cases and add logging accordingly. Breaking it down makes each filter fairly simple on its own, and it's also easy to combine them because we can reuse the AndFilter 🙂
The NodeFilters access some *Check internals. For example, the HasEnoughPeers filter needs to access the err field of the PeerCheck object. This seems tolerable for now, but I'm totally open to changing it.
The IsPassing methods of the various Checkers are gradually becoming redundant. As I move the corresponding logic to NodeFilters, I don't think any production code uses the IsPassing methods anymore. I didn't remove them here because it would've required moving some more code around to get the tests to work and I didn't want to make the PR too huge.

Type of change

🐛 Bug fix (non-breaking change which fixes an issue)
😎 New feature (non-breaking change which adds functionality)
⁉️ Breaking change (fix or feature that would cause existing functionality to not work as expected)
⚒️ Refactor (no functional changes)
📖 Documentation (updating or adding docs)

How Has This Been Tested?

Deployed to staging and made sure I see the logging messages I expect for various <request, upstream> combinations.

brianluong · 2022-10-04T17:36:33Z

internal/route/node_filter.go


-	checkIsHealthy := upstreamStatus.BlockHeightCheck.GetError() == nil
-	isClose := upstreamStatus.BlockHeightCheck.GetBlockHeight()+f.maxBlocksBehind >= maxHeight
+	zap.L().Debug("Upstream too far behind global max height!", zap.Uint64("UpstreamHeight", upstreamHeight), zap.Uint64("MaxHeight", maxHeight))


Should we add upstreamConfig.ID to these debug logs? To correlate which upstream the filter is acting upon.

Yep! Was indeed already planning to do it as I browsed through the logs a bunch of times and had a bit of a hard time figuring out which message was about which upstream, especially when there were concurrent requests in-flight.

IvanVergiliev added 7 commits October 4, 2022 12:51

feat: add pass/fail logging for NodeFilters

0c8bd01

refactor: revert to a single return value + logging

029a2da

style: make linter happy

b6ace25

style: inline some vars to simplify test

2c8e49c

fix: only log at top-level ANDs

7db94f5

fix: remove extraneous log message part

b13a5b5

fix: add logging for SimpleIsStatePresent filter

d90bcd5

IvanVergiliev requested a review from brianluong October 4, 2022 11:05

brianluong approved these changes Oct 4, 2022

View reviewed changes

fix: add UpstreamID to NodeFilter logs

2ea8f16

IvanVergiliev merged commit d53f9c2 into main Oct 5, 2022

IvanVergiliev deleted the productionize-stuff branch October 5, 2022 07:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add debug logging when NodeFilters fail #69

feat: add debug logging when NodeFilters fail #69

IvanVergiliev commented Oct 4, 2022

brianluong Oct 4, 2022

IvanVergiliev Oct 5, 2022

feat: add debug logging when NodeFilters fail #69

feat: add debug logging when NodeFilters fail #69

Conversation

IvanVergiliev commented Oct 4, 2022

Description

Type of change

How Has This Been Tested?

brianluong Oct 4, 2022

Choose a reason for hiding this comment

IvanVergiliev Oct 5, 2022

Choose a reason for hiding this comment