scylla does not start when kernel inotify limits are exceeded #7700

avikivity · 2020-11-25T18:41:44Z

Each tls instance consumes an inotify watch, and there can be multiple tls instances per shard. A large machine can run out, and will fail startup. The default is 128, which is enough for 64 shards.

Since f3bcd4d ("Merge 'Support SSL Certificate Hot Reloading' from Calle"), we reload certificates as they are modified on disk. This uses inotify, which is limited by a sysctl fs.inotify.max_user_instances, with a default of 128. This is enough for 64 shards only, if both rpc and cql are encrypted; above that startup fails. Increase to 1200, which is enough for 6 instances * 200 shards. Fixes scylladb#7700.

psarna · 2020-11-26T08:49:19Z

Maybe this error should also not be fatal, but instead print an error message in the logs? I guess it depends if the mechanisms that rely on inotify also allow other ways of reloading observed files (e.g. via a signal or REST or whatever).

avikivity · 2020-11-26T09:21:31Z

@psarna it threw an exception, but the exception was converted to an assert() when a sharded<> instance was destroyed incorrectly. @elcallio promised to fix that.

avikivity · 2020-11-29T09:05:28Z

Backported to 4.1, 4.2, 4.3.

Since f3bcd4d ("Merge 'Support SSL Certificate Hot Reloading' from Calle"), we reload certificates as they are modified on disk. This uses inotify, which is limited by a sysctl fs.inotify.max_user_instances, with a default of 128. This is enough for 64 shards only, if both rpc and cql are encrypted; above that startup fails. Increase to 1200, which is enough for 6 instances * 200 shards. Fixes #7700. Closes #7701 (cherry picked from commit 390e07d)

elcallio · 2020-11-30T11:33:32Z

Question: Should we try to address this on a seastar level? While the basic problem or shard multiplication cannot be solved, we could maybe fix it somewhat for the usage pattern of shard-shared credentials builder generating a reloadable credentials object per shard.
With some (terrible) juggling of foreign pointers it should be possible to make only a single shard actually use inotify, and the rest cross-shard subscribe to the originating shards notifications:

There will be a lot of cross-shard calls when stuff changes, but...

We can also add a fallback option for the actual originating shard reloader, to use polling iff inotify is not available.

avikivity · 2020-11-30T13:35:51Z

We could, but it's a huge amount of work compared to writing to a sysctl file.

elcallio · 2020-11-30T13:48:46Z

I take that as a down prioritization of the idea.

avikivity · 2020-11-30T14:28:38Z

Yes.

avikivity changed the title ~~scylla can exhaust inotify kernel limits~~ scylla does not start when kernel inotify limits are exceeded Nov 25, 2020

This was referenced Nov 25, 2020

dist: sysctl: configure more inotify instances #7701

Closed

install.sh does not apply sysctl configuration #7702

Closed

slivne added type/bug area/tls labels Nov 26, 2020

slivne added this to the 4.4 milestone Nov 26, 2020

slivne assigned elcallio Nov 26, 2020

scylladb-promoter closed this as completed in 390e07d Nov 27, 2020

scylladb-promoter added the Backport candidate label Nov 27, 2020

avikivity removed the Backport candidate label Nov 29, 2020

tnozicka mentioned this issue Aug 6, 2021

Set up sysctls when tuning nodes scylladb/scylla-operator#749

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scylla does not start when kernel inotify limits are exceeded #7700

scylla does not start when kernel inotify limits are exceeded #7700

avikivity commented Nov 25, 2020

psarna commented Nov 26, 2020

avikivity commented Nov 26, 2020

avikivity commented Nov 29, 2020

elcallio commented Nov 30, 2020

avikivity commented Nov 30, 2020

elcallio commented Nov 30, 2020

avikivity commented Nov 30, 2020

scylla does not start when kernel inotify limits are exceeded #7700

scylla does not start when kernel inotify limits are exceeded #7700

Comments

avikivity commented Nov 25, 2020

psarna commented Nov 26, 2020

avikivity commented Nov 26, 2020

avikivity commented Nov 29, 2020

elcallio commented Nov 30, 2020

avikivity commented Nov 30, 2020

elcallio commented Nov 30, 2020

avikivity commented Nov 30, 2020