OAK-10494 - Add cache to reduce number of remote blobstore calls. #1155

ahanikel · 2023-10-16T11:08:18Z

The cache size should be made configurable and I should probably add some tests, too, but we can still start the discussion.

ahanikel · 2023-10-17T08:32:53Z

@amit-jain Could you have a look when you have time? Thanks a lot! The test is kinda stupid but I couldn't think of a better one...

amit-jain · 2023-10-17T09:09:28Z

@ahanikel the PR looks good but I am not sure if the invalidation also has to happen somewhere.
The tests are in CachingDataStoreTest for this class.

Overall, I think the call need not go to the backend in case the the corresponding file is available in the cache.

ahanikel · 2023-10-18T10:51:06Z

@amit-jain I think the least recently used entries in the cache automatically fall off the cliff when the maxSize is reached, but I've still added an expiration of 15 minutes, so we don't waste memory unnecessarily.

I've added my test to the tests in CachingDataStoreTest. I've also tried to use Mockito but it does not seem to work in this case (modifying the class variable seems to be ignored somehow in the mocking process).

Overall, I think the call need not go to the backend in case the the corresponding file is available in the cache.

Yes, but my understanding is that we try to avoid loading the blob if we don't need it, and so if we only call backend.getRecord() it never ends up in the existing cache, and therefore getRecordIfStored() always falls back to backend.getRecord(). If my understanding is wrong, then there must be something else at play, because the recordCache has a measurable impact on performance (I've put the details on my internal git.corp account unfortunately, the link is in GRANITE-47685).

amit-jain · 2023-10-25T09:30:18Z

oak-blob-plugins/src/test/java/org/apache/jackrabbit/oak/plugins/blob/CachingDataStoreTest.java

@@ -638,4 +642,165 @@ private void waitFinish() {
            e.printStackTrace();
        }
    }
+    private AbstractSharedCachingDataStore createDataStore() throws Exception {


Can we not use the existing dummy objects to construct the datastore. For the variables we can create instance variables instead of static variables to make it easier to inject.

@amit-jain I don't know what I was thinking when using static variables for the recordCache configuration, sorry for that. I'm using the existing test datastore now, but I had to adapt it a bit for simulating a delay when doing (remote) backend requests. Could you have another look? Thank you very much!

amit-jain · 2023-10-27T10:26:04Z

oak-blob-plugins/src/test/java/org/apache/jackrabbit/oak/plugins/blob/CachingDataStoreTest.java

+            LOG.trace("" + dataStore.getRecordIfStored(di)); // LOG.trace to avoid the call being optimised away
+        }
+        long timeCached = System.nanoTime() - start;
+


Ah! Now I understood why you are adding a delay. The assertion with time can be quite fragile. Can't we just assert on the cache object not being empty and presence of the record with identifier?

Yes, I wanted to "prove" the effectiveness of the cache but although I've only checked for a 5x improvement where the difference should be at least 100x for that test case, I agree it is still fragile and should be avoided.

I've changed the test to only check that the record is in the cache after the first access, and to ensure that it is loaded from the cache when accessed a second time. Is that what you meant?

yes. If you need to add test performance numbers maybe you can also add to oak-benchmarks.

This reverts commit 82e8dac.

amit-jain · 2023-11-02T12:04:27Z

oak-blob-plugins/src/test/java/org/apache/jackrabbit/oak/plugins/blob/CachingDataStoreTest.java

+        assertNotNull("Record with id " + id + " should be in the recordCache now",
+                dataStore.recordCache.get().getIfPresent(id));
+        // make sure the record is loaded from the cache
+        backend.deleteRecord(di);


I think it's better to go through the AbstractSharedCachingDataStore#deleteRecord. That way we can assert that the record is not available in the cache as well. AbstractBacked#deleteRecord wouldn't be called directly imo.

@amit-jain The thing is that I'm deleting the record from the cache in AbstractSharedCachingDataStore#deleteRecord as well: https://github.com/ahanikel/jackrabbit-oak/blob/issues/OAK-10494/oak-blob-plugins/src/main/java/org/apache/jackrabbit/oak/plugins/blob/AbstractSharedCachingDataStore.java#L331 so the following assert would fail.
The reason I'm deleting from the backend here is just to prove that the record is actually read from the cache.

Ah, you meant to make sure the record is not read from the existing cache either? I've added an invalidate for that in the commit below.

What I meant was if you call AbstractSharedCachingDataStore#deleteRecord. then invalidate is called implicitly and the test then also covers that change.

amit-jain · 2023-11-03T12:28:04Z

oak-blob-plugins/src/test/java/org/apache/jackrabbit/oak/plugins/blob/CachingDataStoreTest.java

+        dataStore.cache.invalidate(id);
+        assertNull("Record with id " + id + " should not be in the backend anymore",
+                backend.getRecord(di));
+        assertNotNull("The record could not be loaded from the cache",


this has to be changed to assertNull because would have been called.

Ah, you want to make sure that the record is no longer in the cache after deletion, right? I'll add a check for that in the next commit. The check above is to ensure that the record is loaded from the cache without using the backend when accessed a second time. That's why I'm deleting it from the backend, that way I can be sure it comes from the cache.

Axel Hanikel added 4 commits October 16, 2023 13:01

OAK-10494 - Add cache to reduce number of remote blobstore calls.

cbc02f5

Make record cache size configurable.

3c902f0

Cache size is a long

1251443

Add a test to check performance improvement.

956bbe5

ahanikel marked this pull request as ready for review October 17, 2023 08:30

Use the same DataIdentifier instance for both runs.

ce6a622

Axel Hanikel added 7 commits October 18, 2023 10:14

Add record cache expiration defaulting to 15 mins

186208d

Add record cache expiration defaulting to 15 mins

71a7703

Make recordCache protected

1ce56c8

Try to use Mockito (does not work in this case)

782f13d

Remove AbstractSharedCachingDataStoreTest

e9079e4

Add test performanceGetRecordIfStored

0458cf9

RecordCache needs to be invalidated upon deletion.

ce42248

amit-jain reviewed Oct 25, 2023

View reviewed changes

Axel Hanikel added 3 commits October 26, 2023 14:44

Make recordCache optional and configuration non-static with setters.

6a34a61

Add optional backendResponseDelay to TestMemoryBackend.

82e8dac

Use existing test datastore implementation

b2f1cd4

amit-jain reviewed Oct 27, 2023

View reviewed changes

Axel Hanikel added 2 commits October 30, 2023 14:00

Revert "Add optional backendResponseDelay to TestMemoryBackend."

7dd439a

This reverts commit 82e8dac.

Don't measure timings (fragile) but ensure cache is working properly.

3811fc3

amit-jain reviewed Nov 2, 2023

View reviewed changes

Make sure the record is not loaded from the existing cache either.

7f409ff

amit-jain reviewed Nov 3, 2023

View reviewed changes

Axel Hanikel added 2 commits November 7, 2023 11:18

Make sure the record is no longer cached after deletion.

6608cc7

Switch the record cache off by default.

534e4ba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OAK-10494 - Add cache to reduce number of remote blobstore calls. #1155

OAK-10494 - Add cache to reduce number of remote blobstore calls. #1155

ahanikel commented Oct 16, 2023

ahanikel commented Oct 17, 2023

amit-jain commented Oct 17, 2023

ahanikel commented Oct 18, 2023 •

edited

amit-jain Oct 25, 2023

ahanikel Oct 26, 2023

amit-jain Oct 27, 2023

ahanikel Oct 30, 2023

amit-jain Nov 2, 2023 •

edited

amit-jain Nov 2, 2023

ahanikel Nov 2, 2023

ahanikel Nov 2, 2023

amit-jain Nov 3, 2023

amit-jain Nov 3, 2023

ahanikel Nov 7, 2023

OAK-10494 - Add cache to reduce number of remote blobstore calls. #1155

Are you sure you want to change the base?

OAK-10494 - Add cache to reduce number of remote blobstore calls. #1155

Conversation

ahanikel commented Oct 16, 2023

ahanikel commented Oct 17, 2023

amit-jain commented Oct 17, 2023

ahanikel commented Oct 18, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amit-jain Nov 2, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ahanikel commented Oct 18, 2023 •

edited

amit-jain Nov 2, 2023 •

edited