netdata for IoT

Costa Tsaousis edited this page Feb 27, 2018 · 42 revisions

image1

New to netdata? Check its demo: http://my-netdata.io/

User Base Monitored Servers Sessions Served

New Users Today New Machines Today Sessions Today


netdata is a very efficient server performance monitoring solution. When running in server hardware, it can collect thousands of system and application metrics per second with just 1% CPU utilization of a single core. Its web server responds to most data requests in about half a millisecond making its web dashboards spontaneous, amazingly fast!

netdata can also be a very efficient real-time monitoring solution for IoT devices (RPIs, routers, media players, wifi access points, industrial controllers and sensors of all kinds). Netdata will generally run everywhere a Linux kernel runs (and it is glibc and musl-libc friendly).

You can use it as both a data collection agent (where you pull data using its API), for embedding its charts on other web pages / consoles, but also for accessing it directly with your browser to view its dashboard.

The netdata web API already provides reduce functions allowing it to report average and max for any timeframe. It can also respond in many formats including JSON, JSONP, CSV, HTML. Its API is also a google charts provider so it can directly be used by google sheets, google charts, google widgets.

sensors

Although netdata has been significantly optimized to lower the CPU and RAM resources it consumes, the plethora of data collection plugins may be inappropriate for weak IoT devices.

keep in mind that netdata on RPi 2 and 3 does not require any tuning. The default settings will be good. The following tunables apply only when running netdata on RPi 1 or other very weak IoT devices.

Here are a few tricks to control the resources consumed by netdata:

1. Disable External plugins

External plugins can consume more system resources than the netdata server. Disable the ones you don't need.

Edit /etc/netdata/netdata.conf, find the [plugins] section:

[plugins]
	proc = yes

	tc = no
	idlejitter = no
	cgroups = no
	checks = no
	apps = no
	charts.d = no
	node.d = no

	plugins directory = /usr/libexec/netdata/plugins.d
	enable running new plugins = no
	check for new plugins every = 60

In detail:

plugin description
proc the internal plugin used to monitor the system. Normally, you don't want to disable this. You can disable individual functions of it at the next section.
tc monitoring network interfaces QoS (tc classes)
idlejitter internal plugin (written in C) that attempts show if the systems starved for CPU. Disabling it will eliminate a thread.
cgroups monitoring linux containers. Most probably you are not going to need it. This will also eliminate another thread.
checks a debugging plugin, which is disabled by default.
apps a plugin that monitors system processes. It is very complex and heavy (heavier than the netdata daemon), so if you don't need to monitor the process tree, you can disable it.
charts.d BASH plugins (squid, nginx, mysql, etc). This is again a heavy plugin.
node.d node.js plugin, currently used for SNMP data collection and monitoring named (the name server).

For most IoT devices, you can disable all plugins except proc. For proc there is another section that controls which functions of it you need. Check the next section.


2. Disable internal plugins

In this section you can select which modules of the proc plugin you need. All these are run in a single thread, one after another. Still, each one needs some RAM and consumes some CPU cycles.

[plugin:proc]
	# /proc/net/dev = yes                       # network interfaces
	# /proc/diskstats = yes                     # disks
	# /proc/net/snmp = yes                      # generic IPv4
	# /proc/net/snmp6 = yes                     # generic IPv6
	# /proc/net/netstat = yes                   # TCP and UDP
	# /proc/net/stat/conntrack = yes            # firewall
	# /proc/net/ip_vs/stats = yes               # IP load balancer
	# /proc/net/stat/synproxy = yes             # Anti-DDoS
	# /proc/stat = yes                          # CPU, context switches
	# /proc/meminfo = yes                       # Memory
	# /proc/vmstat = yes                        # Memory operations
	# /proc/net/rpc/nfsd = yes                  # NFS Server
	# /proc/sys/kernel/random/entropy_avail = yes # Cryptography
	# /proc/interrupts = yes                    # Interrupts
	# /proc/softirqs = yes                      # SoftIRQs
	# /proc/loadavg = yes                       # Load Average
	# /sys/kernel/mm/ksm = yes                  # Memory deduper
	# netdata server resources = yes            # netdata charts

3. Disable logs

Normally, you will not need them. To disable them, set:

[global]
	debug log = none
	error log = none
	access log = none

4. Set memory mode to RAM

Setting the memory mode to ram will disable loading and saving the round robin database. This will not affect anything while running netdata, but it might be required if you have very limited storage available.

[global]
	memory mode = ram

5. CPU utilization

If after disabling the plugins you don't need, netdata still uses a lot of CPU without any clients accessing the dashboard, try lowering its data collection frequency. Going from "once per second" to "once every two seconds" will not have a significant difference on the user experience, but it will cut the CPU resources required in half.

To set the update frequency, edit /etc/netdata/netdata.conf and set:

[global]
      update every = 2

You may have to increase this to 5 or 10 if the CPU of the device is weak.

Keep in mind this will also force dashboard chart refreshes to happen at the same rate. So increasing this number actually lowers data collection frequency but also lowers dashboard chart refreshes frequency.

This is a dashboard on a device with [global].update every = 5 (this device is a media player and is now playing a movie):

pi1


6. Lower memory requirements

You can set the default size of the round robin database for all charts, using:

[global]
      history = 600

The units for history is [global].update every seconds. So if [global].update every = 6 and [global].history = 600, you will have an hour of data ( 6 x 600 = 3.600 ), which will store 600 points per dimension, one every 6 seconds.

Check also Memory Requirements for directions on calculating the size of the round robin database.


7. Disable gzip compression of responses

Gzip compression of the web responses is using more CPU that the rest of netdata. You can lower the compression level or disable gzip compression completely. You can disable it, like this:

[web]
	enable gzip compression = no

To lower the compression level, do this:

[web]
	enable gzip compression = yes
	gzip compression level = 1

Finally, if no web server is installed on your device, you can use port tcp/80 for netdata:

[global]
	port = 80

8. Monitoring RPi temperature

The python version of the sensors plugin uses lm-sensors. Unfortunately the temperature reading of RPi are not supported by lm-sensors.

netdata also has a bash version of the sensors plugin that can read RPi temperatures. It is disabled by default to avoid the conflicts with the python version.

To enable it, edit /etc/netdata/charts.d.conf and uncomment this line:

sensors=force

Then restart netdata. You will get this:

image

General


Running Netdata

Special Uses

Notes on memory management


Database Replication and Mirroring


Backends
archiving netdata collected metrics to a time-series database


Health monitoring - Alarms
alarms and alarm notifications in netdata


Netdata Registry


Monitoring Info


Netdata Badges


Data Collection

Binary Modules

Python Modules

Node.js Modules

BASH Modules

Active BASH Modules

Obsolete BASH Modules

JAVA Modules


API Documentation


Web Dashboards


Running behind another web server


Package Maintainers


Donations


Blog


Other monitoring tools

Clone this wiki locally
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.
Press h to open a hovercard with more details.