Consumergroup tests and refactoring #53

MrTrustworthy · 2019-09-18T13:33:16Z

No description provided.

MrTrustworthy · 2019-09-18T13:36:14Z

I need some input regarding the output of esque describe consumergroup x. I was thinking about having a "simple summary" description per default, and only providing -o yaml etc outputs in case you want to get the full info. Not sure if that's a good idea, but in the spirit of "Kafka Ops for humans" I didn't want to just dump out all info per default.

Short summary currently looks like this:

hfjn · 2019-09-18T14:00:46Z

@MrTrustworthy I love to make the describe commands more approachable. But we should agree on a level of verbosity that is consistent for all our commands.

MrTrustworthy · 2019-09-18T14:05:07Z

@hfjn exactly why I posted that here :D

Consider the screenshot as my proposal for a level of verbosity for describe. Higher verbosity is provided when you specify -o yaml, which will give you all details.

hfjn · 2019-09-18T14:14:07Z

@MrTrustworthy I feel like it doesn't need to be as "speaking" as in your example. I think that might be a pain to maintain for all commands.

MrTrustworthy · 2019-09-19T07:45:57Z

Proposal:

ConsumerGroup pia-mabaya-unit
        active members: 0
        topics: ['shop-product_export_unit'] 
        partitions: 17
        offsets:
               min: 894
               avg: 1908.7
               max: 4052
               total: 32436
        total lag: 90920

swenzel · 2019-09-19T08:48:15Z

I'm not sure if min/max/avg over offsets is a useful metric. You'll have to know the watermarks of the corresponding topics for those metrics to be meaningful. I'd propose to use the current relative position in percent between low and high watermark. I.e ((offset-low_water)/(high_water-low_water))*100. This way you can immediately see where the consumer is within the topic. You can then calculate the min/max/avg over those percentages.
Apart from that I also like the less verbose yaml like output 👍

MrTrustworthy · 2019-09-19T08:55:51Z

I really like that idea! Not sure how to call that property yet, but something like Consumergroup Progress: 99.7% seems like a good, compact piece of information.

Still, a relative measure will have to be interpreted differently depending on the total size of the topic. 98% on a topic with 100 messages isn't cause for worry, but on a topic with 100 billion entries over multiple years, it represents a considerable amount of lag. So I'd say we have to pair at least one absolute metric in addition to the progress. Either total lag or topic size (aka sum(high_watermarks)) would be a good choice, or both.

swenzel · 2019-09-19T08:59:01Z

Agreed. I'd also keep the lag. Maybe do min/max/avg on that, too. Then you can see if it's only for a few partitions or for all of them.
Could also make sense to do the statistics per topic, since message counts may vary significantly.

MrTrustworthy · 2019-09-19T09:04:47Z

Allright, I'll play around a bit and try to find a combination that makes the most sense when viewed by a user.

One note about the progress calculation (offset-low_water)/(high_water-low_water))*100: I'm not sure how I feel about the inclusion of low_water in the calculation. On one hand, it gives a more precise piece of info because progress is always based on alive messages in the topic. On the other hand, everytime the topic gets retented/compacted, it will cause your progress to move back, which is kinda counter intuitive.

swenzel · 2019-09-19T09:33:12Z

I see what you mean...
Not including the low_watermark, however, will lead to very misleading results on very busy topics with retention. Consider low_watermark = 100k and high_watermark=101k.

Maybe progress is the wrong description then... how about relative_offset_location?

We could also invert the calculation (high_water-offset)/(high_water-low_water))*100 and call it relative_lag.
Color coding would also be nice: >=100% (critical, data loss), >=95% (warning, loss imminent), <95% (okay).

esque/cli/output.py

swenzel

Looks good, waiting for an update to the pretty_consumergroup_simple_overview, then everything is fine 👍

…-and-refactoring

hfjn · 2019-09-27T07:15:32Z

Added new output formatting and color-coded output. Anyone up for a review? @MrTrustworthy @swenzel @garrettthomaskth

hfjn · 2019-09-27T07:16:16Z

Ah that screenshot has a faked 100.00% lag to show of the color. :P

hfjn · 2020-02-15T10:05:15Z

I will close this for now since there haven’t been any updates in months. 🙂

MrTrustworthy added 6 commits September 13, 2019 17:20

consumergroups simplified, tests working, commands not yet

78f6207

consumergroups simplified, tests working, commands not yet

b1306b8

ok, so the first version kinda works now

89f8688

test fixes

f6ac0aa

actual integration tests for non-verbose output

873d497

output now better for simple output

b36ab70

MrTrustworthy requested review from hfjn and swenzel September 18, 2019 13:39

hfjn mentioned this pull request Sep 18, 2019

Add more tests #54

Merged

Merge branch 'master' into consumergroup-tests-and-refactoring

6946ec7

Bibob7 reviewed Sep 23, 2019

View reviewed changes

esque/cli/output.py Show resolved Hide resolved

swenzel suggested changes Sep 23, 2019

View reviewed changes

swenzel mentioned this pull request Sep 23, 2019

32 describe consumergroup show topic #35

Closed

Bibob7 and others added 6 commits September 24, 2019 11:30

Add missing import

5f04ed3

Merge remote-tracking branch 'origin/master' into consumergroup-tests…

380b69e

…-and-refactoring

Fix import

d581375

Fix imports

25ee729

Fix tests & update confluent

4868080

Adds color coded output

0796230

Introduce default colors

c1b0673

ognjen-j approved these changes Sep 27, 2019

View reviewed changes

hfjn closed this Feb 15, 2020

swenzel deleted the consumergroup-tests-and-refactoring branch April 7, 2020 11:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consumergroup tests and refactoring #53

Consumergroup tests and refactoring #53

MrTrustworthy commented Sep 18, 2019

MrTrustworthy commented Sep 18, 2019

hfjn commented Sep 18, 2019

MrTrustworthy commented Sep 18, 2019

hfjn commented Sep 18, 2019

MrTrustworthy commented Sep 19, 2019

swenzel commented Sep 19, 2019

MrTrustworthy commented Sep 19, 2019

swenzel commented Sep 19, 2019 •

edited

MrTrustworthy commented Sep 19, 2019

swenzel commented Sep 19, 2019 •

edited

swenzel left a comment

hfjn commented Sep 27, 2019

hfjn commented Sep 27, 2019

hfjn commented Feb 15, 2020

Consumergroup tests and refactoring #53

Consumergroup tests and refactoring #53

Conversation

MrTrustworthy commented Sep 18, 2019

MrTrustworthy commented Sep 18, 2019

hfjn commented Sep 18, 2019

MrTrustworthy commented Sep 18, 2019

hfjn commented Sep 18, 2019

MrTrustworthy commented Sep 19, 2019

swenzel commented Sep 19, 2019

MrTrustworthy commented Sep 19, 2019

swenzel commented Sep 19, 2019 • edited

MrTrustworthy commented Sep 19, 2019

swenzel commented Sep 19, 2019 • edited

swenzel left a comment

Choose a reason for hiding this comment

hfjn commented Sep 27, 2019

hfjn commented Sep 27, 2019

hfjn commented Feb 15, 2020

swenzel commented Sep 19, 2019 •

edited

swenzel commented Sep 19, 2019 •

edited