[Merged by Bors] - Improve water-level demo #126

sbernauer · 2022-09-28T16:29:33Z

Description

Run with
stackablectl --additional-demos-file demos/demos-v1.yaml --additional-stacks-file stacks/stacks-v1.yaml demo install nifi-kafka-druid-water-level-data

Tested demo with 2.500.000.000 records

Hi all, here a short summary of the observations of the water-level demo:

NiFi uses content-repo pvc but keeps it at ~50% usage => Shoud be fine forever
Actions:

Increase content-repo 5->10 gb, better safe than sorry. I was able to crash it by using large queues and stalling processors.

Kafka uses pvc (currently 15gb) => Should work fine for ~1 week
Actions:

Look into retentions settings (low priority as it should work ~1 week) so that it works forever

Druid uses S3 for deep storage (S3 has 15gb). But currently it also cashes everything locally at the historical because we set druid.segmentCache.locations=[{"path"\:"/stackable/var/druid/segment-cache","maxSize"\:"300g"}] (hardcoded in https://github.com/stackabletech/druid-operator/blob/45525033f5f3f52e0997a9b4d79ebe9090e9e0a0/deploy/config-spec/properties.yaml#L725)
This does not really effect the demo, as 100.000.000 records (let's call it data of ~1 week) have ~400MB.
I think the main problem with the demo is that queries take > 5 minutes to complete and Superset shows timeouts.
The historical pod suspiciously uses exactly one core of cpu and the queries are really slow for a "big data" system IMHO.
This could be because either druid is only using a single core or because we dont set any resources (yet!) and the node does not have more cores available. Going to reasearch that.
Actions:

Created Make segment-cache size configurable and use emptyDir for it druid-operator#306
In the meantime configure overwrite in the demo druid.segmentCache.locations=[{"path"\:"/stackable/var/druid/segment-cache","maxSize"\:"3g","freeSpacePercent":"5.0"}]
Research slow query performance
Have a look at the queries the Superset Dashboard executes and optimize them
Maybe we should bump the druid-operator versions in the demo (e.g. create release 22.09-druid which basically is 22.09 with a newer druid-op version). Therefore we get stable resources.
Enable Druid auto compaction to reduce number of segments

Review Checklist

Code contains useful comments
(Integration-)Test cases added (or not applicable)
Documentation added (or not applicable)
Changelog updated (or not applicable)
Cargo.toml only contains references to git tags (not specific commits or branches)

Once the review is done, comment bors r+ (or bors merge) to merge. Further information

demos/demos-v1.yaml

demos/nifi-kafka-druid-water-level-data/setup-superset.yaml

stacks/stacks-v1.yaml

maltesander

LGTM!

sbernauer · 2022-09-30T12:12:47Z

bors r+

## Description Run with `stackablectl --additional-demos-file demos/demos-v1.yaml --additional-stacks-file stacks/stacks-v1.yaml demo install nifi-kafka-druid-water-level-data` Tested demo with 2.500.000.000 records Hi all, here a short summary of the observations of the water-level demo: NiFi uses content-repo pvc but keeps it at ~50% usage => Shoud be fine forever Actions: * Increase content-repo 5->10 gb, better safe than sorry. I was able to crash it by using large queues and stalling processors. Kafka uses pvc (currently 15gb) => Should work fine for ~1 week Actions: * Look into retentions settings (low priority as it should work ~1 week) so that it works forever Druid uses S3 for deep storage (S3 has 15gb). But currently it also cashes *everything* locally at the historical because we set `druid.segmentCache.locations=[{"path"\:"/stackable/var/druid/segment-cache","maxSize"\:"300g"}]` (hardcoded in https://github.com/stackabletech/druid-operator/blob/45525033f5f3f52e0997a9b4d79ebe9090e9e0a0/deploy/config-spec/properties.yaml#L725) This does *not* really effect the demo, as 100.000.000 records (let's call it data of ~1 week) have ~400MB. I think the main problem with the demo is that queries take > 5 minutes to complete and Superset shows timeouts. The historical pod suspiciously uses exactly one core of cpu and the queries are really slow for a "big data" system IMHO. This could be because either druid is only using a single core or because we dont set any resources (yet!) and the node does not have more cores available. Going to reasearch that. Actions: * Created stackabletech/druid-operator#306 * In the meantime configure overwrite in the demo `druid.segmentCache.locations=[{"path"\:"/stackable/var/druid/segment-cache","maxSize"\:"3g","freeSpacePercent":"5.0"}]` * Research slow query performance * Have a look at the queries the Superset Dashboard executes and optimize them * Maybe we should bump the druid-operator versions in the demo (e.g. create release 22.09-druid which basically is 22.09 with a newer druid-op version). Therefore we get stable resources. * Enable Druid auto compaction to reduce number of segments

bors · 2022-09-30T12:18:34Z

Pull request successfully merged into main.

Build succeeded:

Run Rustfmt

Improve water-level demo

12e94ce

sbernauer force-pushed the improve-water-levels-demo branch from d746d6b to 12e94ce Compare September 28, 2022 16:31

sbernauer commented Sep 28, 2022

View reviewed changes

demos/demos-v1.yaml Outdated Show resolved Hide resolved

demos/nifi-kafka-druid-water-level-data/setup-superset.yaml Outdated Show resolved Hide resolved

stacks/stacks-v1.yaml Outdated Show resolved Hide resolved

stacks/stacks-v1.yaml Outdated Show resolved Hide resolved

Fast... faster... this dashboard

6acc900

sbernauer force-pushed the improve-water-levels-demo branch from 6739f9f to 6acc900 Compare September 28, 2022 18:32

sbernauer requested a review from a team September 28, 2022 18:35

maltesander approved these changes Sep 29, 2022

View reviewed changes

sbernauer added 6 commits September 30, 2022 14:07

Add compaction

3d59cba

Update demos/nifi-kafka-druid-water-level-data/setup-superset.yaml

0802e01

Update stacks/stacks-v1.yaml

2f22338

Update stacks/stacks-v1.yaml

a8b3df2

Update demos/demos-v1.yaml

0fa476b

WIP

1b3145c

bors bot changed the title ~~Improve water-level demo~~ [Merged by Bors] - Improve water-level demo Sep 30, 2022

bors bot closed this Sep 30, 2022

bors bot deleted the improve-water-levels-demo branch September 30, 2022 12:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Merged by Bors] - Improve water-level demo #126

[Merged by Bors] - Improve water-level demo #126

Uh oh!

sbernauer commented Sep 28, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maltesander left a comment

Uh oh!

sbernauer commented Sep 30, 2022

Uh oh!

bors bot commented Sep 30, 2022

Uh oh!

Uh oh!

[Merged by Bors] - Improve water-level demo #126

[Merged by Bors] - Improve water-level demo #126

Uh oh!

Conversation

sbernauer commented Sep 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Review Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maltesander left a comment

Choose a reason for hiding this comment

Uh oh!

sbernauer commented Sep 30, 2022

Uh oh!

bors bot commented Sep 30, 2022

Uh oh!

Uh oh!

sbernauer commented Sep 28, 2022 •

edited

Loading