Faster parents #16127

ktsaou · 2023-10-04T22:33:29Z

Current master on parent receiver:

This PR on parent receiver:

To understand the difference, check the width of quoted_strings_splitter() in the 2 charts.
On the second this function (that has't changed in this PR) is many times larger than in the first.
This means the rest of the code is now many times faster.

Simple optimizations to increase the efficiency of busy parents.

cache dbengine context so that calling mrg all the time is avoided.
cache rd and rd->id (as const char) together with rda for speeding up dimension lookup by pluginsd.
use 2 collected flags inside RRDDIM and RRDSET to avoid calling rrdcontexts to update the collected status on every data collection.
mrg is now lockless for all operations.
updated the streaming protocol with a new capability called SLOTS. The new protocol requires for the sender to number uniquely all the RRDSET and RRDDIM it sends. The numbers are used to help the receiver quickly find the RRDSET and RRDDIM pointers.
fixed a bug, where the EXPOSED flag in dimensions was allocated as a collector option (non-atomic), while it was used by replication to check the status and by the sender to clear it on disconnect.

Comparison: 2.7 million metrics per second, Netdata vs Prometheus

In this setup, both Netdata and Prometheus are configured to collect the same 2.5 million metrics per second from 500 Netdata children. To test similar functionality, we disabled ML and Health at netdata.conf of the Netdata parent.

CPU utilization

Netdata needs about 2 CPU cores per million metrics
Prometheus needs about 3 CPU cores per million metrics, with frequent spikes at 14+ CPU cores.

Prometheus has a huge spike every 2 minutes, utilizing almost all CPU cores available on the system (both VMs have 24 cores available).

Memory consumption

As far as memory consumption is concerned:

Netdata uses 40GiB (after we added 16GiB main cache, and 8GiB extend cache)
Prometheus uses 30GiB

Disk footprint

application	tier	on disk	retention
Netdata	tier 0	625GiB	7 days
Netdata	tier 1	285GiB	14days
Netdata	tier 2	114GiB	90 days
Prometheus	-	3TiB	7 days

Netdata total, including data and metadata is 1 TiB.

Disk I/O

Each of the VMs has each own physical disk (so that we can measure the disk I/O of each VM). In the following screenshot, Prometheus is using sdd and Netdata is using zd16:

As you can see, Prometheus is really stressing the disks at this scale, possibly due to its WAL. Netdata achieves the same safety against data loss by re-streaming its metrics to another Netdata Parent (when configured to do so).

Network bandwidth

Netdata reception is 380Mbps.
Prometheus reception is 240Mbps.

Netdata is using LZ4 compression on a much more compact communication, while Prometheus uses gzip/deflate on a more chatty communication. However, the compression efficiency of gzip is quite higher than LZ4.

In PR #16268 we add ZSTD streaming support in Netdata, to see how its bandwidth changes.

ktsaou · 2023-10-08T22:00:04Z

@stelfrag this is ready for merge. If you can't find an issue, let's merge it.

@ilyam8 I have installed it on lab-parent2. Please install it on all its children and stop lab-parent3 to see the difference in action. It should be way faster than before.

…e pluginsd_acquire_dimension()

…during reading them

…ms if the buffer contains DATA only.

… sender buffer has been committed, so that replication will not send dimensions prematurely

… chart

ktsaou requested review from thiagoftsm and vkalintiris as code owners October 4, 2023 22:33

github-actions bot added area/collectors Everything related to data collection area/database collectors/plugins.d area/daemon area/tests area/streaming labels Oct 4, 2023

ktsaou force-pushed the faster-parents branch from 0d97a1d to a0b3a1d Compare October 23, 2023 08:52

github-actions bot added collectors/cgroups collectors/diskspace collectors/proc area/exporting labels Oct 23, 2023

ktsaou added 16 commits October 25, 2023 20:51

cache ctx in collection handle

9985552

cache rd together with rda

9687d16

do not repeatedy call rrdcontexts - cached collection status; optimiz…

76f11e3

…e pluginsd_acquire_dimension()

fix unit tests

e6bc91a

do the absolutely minimum while updating timestamps, ensure validity …

4a9a275

…during reading them

when the stream is INTERPOLATED, buffer outstanding data for up to 50…

88c36d6

…ms if the buffer contains DATA only.

remove the spinlock from mrg

f885fd1

remove the metric flags that are not used any more

c8c6f85

mrg writers can be different threads

0f8f618

update first time when latest clean is also updated

61eada1

cleanup

169d6e7

set hot page with a simple atomic operation

7903a4d

sender sets chart slot for every chart

4a927c6

work on senders without SLOT

e6da6eb

enable SLOT capability

a7aed33

send slot at BEGIN when SLOT is enabled

7312b2e

ktsaou added 23 commits October 25, 2023 20:54

we need the dimension slot at the DIMENSION keyword

9a57866

more debug info in case of dimension mismatch

f646fa3

ensure the RRDDIM EXPOSED flag is multi-threaded and set it after the…

ab1779c

… sender buffer has been committed, so that replication will not send dimensions prematurely

fix renumbering on child restart

97a457d

reset rda caching when receiving a chart definition

49b5f60

optimize pluginsd_end_v2()

e60060d

do not do zero sized allocations

153b4eb

trust the chart slot id of the child

e0d4e1c

cleanup charts on pluginsd thread exit

3e07c86

better cleanup

f914710

find the chart and put it in the slot, if it not already there

1181731

move slots array to host

6005b21

initialize pluginsd slots properly

4eaa42c

add slots to replay begin; do not cleanup slots that dont belong to a…

5321ad1

… chart

cleanup on obsolete

336c473

cleanup slots on obsoletions

14f5959

cleanup and renames about obsoletion

93a3ffc

rewrite obsolation service code to remove race conditions

d4c726a

better service obsoletion log

87723b3

added debugging

22210b3

more debug

a9b0e99

exposed flag now compares versions

ee422da

removed debugging messages

ddb1fda

ktsaou force-pushed the faster-parents branch from 1ad4751 to ddb1fda Compare October 25, 2023 17:54

ilyam8 mentioned this pull request Oct 27, 2023

[Feat]: add zstd to packaging #16283

Closed

4 tasks

ktsaou added 3 commits October 27, 2023 17:41

merged to master

8f8e598

respolve conflicts

0d63196

fix replication check for unsent dimensions

f08bbd0

stelfrag approved these changes Oct 27, 2023

View reviewed changes

ktsaou merged commit 2175104 into netdata:master Oct 27, 2023
148 of 149 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster parents #16127

Faster parents #16127

ktsaou commented Oct 4, 2023 •

edited

ktsaou commented Oct 8, 2023 •

edited

Faster parents #16127

Faster parents #16127

Conversation

ktsaou commented Oct 4, 2023 • edited

Comparison: 2.7 million metrics per second, Netdata vs Prometheus

CPU utilization

Memory consumption

Disk footprint

Disk I/O

Network bandwidth

ktsaou commented Oct 8, 2023 • edited

ktsaou commented Oct 4, 2023 •

edited

ktsaou commented Oct 8, 2023 •

edited