Add metadata search index for more responsive FUSE #182

dnnr · 2015-01-22T19:51:45Z

This adds the construction of a metadata index during archive creation,
which can be used to narrow down the location of particular entries
within the items list. The FUSE mount uses this index to fetch only
those chunks that are relevant to the specific operation instead of
fetching the metadata of the entire archive.

As a result, using FUSE mounts of large archives is consirably more
reponsive. And more importantly, the performance isn't indirectly
proportional to the the archive size anymore. Any bulk operations that
require the full metadata tree anyways (such as running "find" on the
entire archive) are not negatively impacted.

For this to work, the filesystem traversal order had to be changed from
depth-first to breadth-first, which introduces the new metadata version
number 2. Any otherwise unrelated parts of the code and tests that
relied on the previous behavior are adjusted accordingly.

dnnr · 2015-01-22T20:58:21Z

This could be improved even further by adding a read-cache into attic. As far as I could tell, the Cache class is currently used for write access only, right? As of yet, my patch still fetches chunks repeatedly if (and only if) multiple archives are loaded that have intersecting entries their metadata['items'] list... which isn't unlikely for real datasets.

I wasn't sure though if this should be just slapped into the Cache class and used in do_mount, so I left that out for now.

This adds the construction of a metadata index during archive creation, which can be used to narrow down the location of particular entries within the items list. The FUSE mount uses this index to fetch only those chunks that are relevant to the specific operation instead of fetching the metadata of the entire archive. As a result, using FUSE mounts of large archives is consirably more reponsive. And more importantly, the performance isn't indirectly proportional to the the archive size anymore. Any bulk operations that require the full metadata tree anyways (such as running "find" on the entire archive) are not negatively impacted. For this to work, the filesystem traversal order had to be changed from depth-first to breadth-first, which introduces the new metadata version number 2. Any otherwise unrelated parts of the code and tests that relied on the previous behavior are adjusted accordingly.

dnnr force-pushed the metadata-index branch from ad3d3f4 to a8533b5 Compare March 6, 2015 23:28

dnnr force-pushed the metadata-index branch from a8533b5 to 6b64fc9 Compare March 6, 2015 23:29

maltefiala mentioned this pull request May 14, 2015

Dealing with attic issues borgbackup/borg#5

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add metadata search index for more responsive FUSE #182

Add metadata search index for more responsive FUSE #182

dnnr commented Jan 22, 2015

dnnr commented Jan 22, 2015

Add metadata search index for more responsive FUSE #182

Are you sure you want to change the base?

Add metadata search index for more responsive FUSE #182

Conversation

dnnr commented Jan 22, 2015

dnnr commented Jan 22, 2015