Shared Chunk Cache for all DatasetArrays, CacheWeight for AlfuCache #7067

fm3 · 2023-05-10T13:03:35Z

The cache for remote dataset array contents can now have a configured size in bytes. New config option datastore.cache.imageArrayChunks.maxSizeBytes. Default is 2 GB, consider increasing for production.
For this, one cache object is shared between the dataset arrays
Switches from akka cache to Caffeine directly (akka uses caffeine internally as well), allowing to add cache item weighing (weight is now set to bytes per chunk for array chunks)
Unified some caching usages / naming (no more distinction between AlfuCache and AlfuFoxCache)

TODO

use shared cache also in Precomputed + Neuroglacner classes
include vault path in cache key
how to clear cache for individual datasets?
null pointer exception for loading volume annotation buckets

URL of deployed dev instance (used for testing):

https://cacheweight.webknossos.xyz

Steps to test:

Load some datasets, should still show
annotate some, should still show
use reload button and look for logging (cache chunks should be removed, but only for the reloaded layer)

Issues:

fixes Share chunkContents cache across multiple arrays #6630
contributes to Unify backend-internal caching #6969

Updated changelog
Considered common edge cases
Needs datastore update after deployment

…ective cache clear

frcroth

Maybe also change PR name?

CHANGELOG.unreleased.md

util/src/main/scala/com/scalableminds/util/cache/LRUConcurrentCache.scala

...nossos-datastore/app/com/scalableminds/webknossos/datastore/services/BinaryDataService.scala

frcroth · 2023-06-12T15:17:54Z

...nossos-datastore/app/com/scalableminds/webknossos/datastore/services/BinaryDataService.scala

+
+    bucketProviderCache.clear(bucketProviderPredicate)
+
+    def chunkContentsPredicate(key: String): Boolean =


Not totally happy with this being placed here, since the definition of the key is somewhere else. Maybe move this predicate to where the key is defined and only use it here?

Hmm, yes I see what you mean. However, the same would be the case for the other two predicate functions here. So we could either extract each of those so that there is only one place for each cache, where cache key building happens. This would lead to less distributed knowledge on how the keys work. However, I also like all of these methods being together here, since it is less distributed knowledge about the fact that these caches should be cleared dataset-wise or layer-wise. So it’s some fragmentation either way. If you’re ok with it, I’d leave it as it is for the moment.

…esign-right-sidebar * 'master' of github.com:scalableminds/webknossos: added Youtube videos to docs Log dataset uploads (with no conversion) to slack (#7157) Added "Automation Tutorial" to docs (#7160) fix logo image in README.md Second try for “Async IO for HttpsDataVault, Fox Error Handling” (#7155) Revert "Async IO for HttpsDataVault, Fox Error Handling (#7137)" (#7154) Async IO for HttpsDataVault, Fox Error Handling (#7137) Fix vault path for precomputed datasets (#7151) Add extended keyboard shortcut mode via ctrl + k for tool shortcuts (#7112) Shared Chunk Cache for all DatasetArrays, CacheWeight for AlfuCache (#7067)

CacheWeight for AlfuCache

a621236

fm3 self-assigned this May 10, 2023

fm3 added 3 commits May 10, 2023 16:29

removal callback function, cleanup

35911b2

WIP: share chunk contents cache

742574d

pass shared cache around

dec7efb

fm3 added enhancement backend refactoring labels May 12, 2023

fm3 added 5 commits May 31, 2023 14:38

merge master into cache-weight

7341927

Merge branch 'master' into cache-weight

1f10d75

pass datasource id and layer name all the way to DatasetArray for eff…

ef8d7e9

…ective cache clear

make cache size configurable

58bff3d

null check

9c1c156

fm3 marked this pull request as ready for review June 12, 2023 14:34

fm3 requested a review from frcroth June 12, 2023 14:45

frcroth reviewed Jun 12, 2023

View reviewed changes

fm3 added 2 commits June 13, 2023 10:06

Merge branch 'master' into cache-weight

3fb1a6d

PR feedback

d50c75b

fm3 requested a review from frcroth June 13, 2023 08:22

frcroth approved these changes Jun 13, 2023

View reviewed changes

fm3 changed the title ~~CacheWeight for AlfuCache~~ Shared Chunk Cache for all DatasetArrays, CacheWeight for AlfuCache Jun 14, 2023

Merge branch 'master' into cache-weight

5ce1207

fm3 mentioned this pull request Jun 14, 2023

Async IO for HttpsDataVault, Fox Error Handling #7137

Merged

15 tasks

fm3 merged commit b91b15f into master Jun 14, 2023
2 checks passed

fm3 deleted the cache-weight branch June 14, 2023 11:07

fm3 mentioned this pull request Jul 17, 2023

Unify backend-internal caching #6969

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shared Chunk Cache for all DatasetArrays, CacheWeight for AlfuCache #7067

Shared Chunk Cache for all DatasetArrays, CacheWeight for AlfuCache #7067

fm3 commented May 10, 2023 •

edited

frcroth left a comment

frcroth Jun 12, 2023

fm3 Jun 13, 2023


		bucketProviderCache.clear(bucketProviderPredicate)

		def chunkContentsPredicate(key: String): Boolean =

Shared Chunk Cache for all DatasetArrays, CacheWeight for AlfuCache #7067

Shared Chunk Cache for all DatasetArrays, CacheWeight for AlfuCache #7067

Conversation

fm3 commented May 10, 2023 • edited

TODO

URL of deployed dev instance (used for testing):

Steps to test:

Issues:

frcroth left a comment

Choose a reason for hiding this comment

frcroth Jun 12, 2023

Choose a reason for hiding this comment

fm3 Jun 13, 2023

Choose a reason for hiding this comment

fm3 commented May 10, 2023 •

edited