Limit in-memory use for inodes #58

alexlarsson · 2022-08-10T11:40:49Z

This limits the xattr data size to 4k, and splits the dentry data into chunks of (max) 4k which we read in chunks as needed. This drops the in-memory requirement for inodes a lot.

@rhvgoyal does this take care of your issues wrt memory use?

Note: For memory use I will also look at re-using xattr chunks in memory between inodes as they are already shared on disk. Should be rather easy.

This matches what ext4 does and saves some space. Signed-off-by: Alexander Larsson <alexl@redhat.com>

This avoids the module being unloaded while the fs is mounted. Signed-off-by: Alexander Larsson <alexl@redhat.com>

Signed-off-by: Alexander Larsson <alexl@redhat.com>

alexlarsson · 2022-08-10T11:48:50Z

Wth is up with armv7:

  dump.c:254:24: error: format '%ld' expects argument of type 'long int', but argument 11 has type 'long long unsigned int' [-Werror=format=]
    254 |                 printf("name:%.*s|ino:%" PRIu64
        |                        ^~~~~~~~~~~~~~~~~
  ......
    259 |                        ino.st_size, (uint64_t)0,
        |                                     ~~~~~~~~~~~
        |                                     |
        |                                     long long unsigned int

Surely the printf type PRIu64 (defined in the system header) should match uint64_t(also defined in system header).

This changes the way a directory inode stores the dentry list. Instead of a list of all dentries followed by all the filenames we store a list of chunks, where each chunk is at most 4k of such dentries and names. Additionally we have a header in the beginning with the size and dentry count of all such chunks. On the reader side we don't read (and alloc) all the dentries with the inode, instead we read the blocks as needed, meaning we use a lot less memory per in-memory inode. We preload the chunk headers for typical size dirs (4 chunks max) as a cheap way to avoid having to re-read it. However for large dirs we read the table as needed. Since we don't read all dentries when creating the inode we now also use the st_nlink from the file rather than computing it from the dentries. Signed-off-by: Alexander Larsson <alexl@redhat.com>

This means we're not using potentially unbounded kernel memory for the inode for the xattrs. I think in practice we're not going to see such large xattrs anyway, they are mainly used to store things like ACLs, file caps or selinux contexts. Signed-off-by: Alexander Larsson <alexl@redhat.com>

giuseppe

LGTM

alexlarsson added 4 commits August 10, 2022 13:42

Limit filename size to 255

14d48d6

This matches what ext4 does and saves some space. Signed-off-by: Alexander Larsson <alexl@redhat.com>

cfs: Set module as owner of filesystem

f83d9fa

This avoids the module being unloaded while the fs is mounted. Signed-off-by: Alexander Larsson <alexl@redhat.com>

cfs: Fix error checking of cfs_make_inode return value

722414c

Signed-off-by: Alexander Larsson <alexl@redhat.com>

cfs: Check for errors in cfs_dir_lookup() return value

097fcb2

Signed-off-by: Alexander Larsson <alexl@redhat.com>

alexlarsson force-pushed the dentry-size branch from d2d2bf0 to e9afa89 Compare August 10, 2022 11:42

alexlarsson added 2 commits August 10, 2022 13:52

alexlarsson force-pushed the dentry-size branch from e9afa89 to 0fbff50 Compare August 10, 2022 11:52

alexlarsson mentioned this pull request Aug 11, 2022

Use binary search on names in cfs_lookup #59

Closed

giuseppe approved these changes Aug 22, 2022

View reviewed changes

alexlarsson merged commit 361f373 into containers:main Aug 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit in-memory use for inodes #58

Limit in-memory use for inodes #58

alexlarsson commented Aug 10, 2022

alexlarsson commented Aug 10, 2022

giuseppe left a comment

Limit in-memory use for inodes #58

Limit in-memory use for inodes #58

Conversation

alexlarsson commented Aug 10, 2022

alexlarsson commented Aug 10, 2022

giuseppe left a comment

Choose a reason for hiding this comment