feat: transaction caching #252

wadeking98 · 2024-01-18T00:56:52Z

Added functionality to cache read requests to the ledger.
I've provided added functionality to store the cache in memory or on disk in either a TTL or a LRU style cache. I've left the implementation extendable so users can easily implement storing the cache in a db, redis, whatever.

Demo:

The cache will keep values as valid for 24 hours, add the --use-cache flag to use the caching functionality. If the cache path isn't supplied then it will use a cache in memory instead of on the file system. If cache size is not specified then it will use a cache with size 1000
./target/debug/indy-vdr-proxy -g ./genesis.txn --use-cache --cache-size 1024 --cache-path /dev/shm/vdr-cache -p 3000

Cache:

No Cache:

With caching enabled on a request to lookup a schema, it takes under 32 milliseconds and is sometimes as fast as 7 milliseconds. Compared with when no cache is enabled, the same request will take over 135 milliseconds to almost a second

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

libindy_vdr/src/pool/runner.rs

libindy_vdr/Cargo.toml

swcurran · 2024-01-18T05:58:44Z

For all but RevRegEntries, expiring the cache is not needed since the transactions are immutable. Would it make sense (is it possible?) have a fixed (maximum) sized cache with an LRU expiration policy?

Just as a matter of interest, what is the approach on RevRegEntries for finding cache hits? It seems unlikely there would be cache hits for RevRegEntry transactions, since the combination of from, to intervals would have to be the same.

wadeking98 · 2024-01-18T17:05:23Z

For all but RevRegEntries, expiring the cache is not needed since the transactions are immutable. Would it make sense (is it possible?) have a fixed (maximum) sized cache with an LRU expiration policy?

Just as a matter of interest, what is the approach on RevRegEntries for finding cache hits? It seems unlikely there would be cache hits for RevRegEntry transactions, since the combination of from, to intervals would have to be the same.

I like the idea of not having a time based expiration, the only thing with the LRU approach is that cache get request will have to mutate the cache as well (to mark an entry as recently used). I don't really have a problem with this but it might look a little weird for end users if the get cache requests mutate things. Still, I think it's a good idea.

In terms of the revocation registry, I think it's still a good idea to cache. For example, in the wallet the user may open a proof request multiple times, or select an alternate credential for the proof which would fetch the revocation registry entry. I think the wallet would benefit from having RevRegEntry transactions cached

andrewwhitehead · 2024-01-18T20:25:50Z

You could add a TTL and maximum number of items, then prune the cache (removing expired items first) when adding new items. Objects like credential definitions are treated as immutable, but in reality can potentially be updated on the ledger (depending on the access controls).

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 · 2024-01-18T23:26:48Z

I could add support for both and then the end user can decide which one to use, or do you think LRU should be avoided @andrewwhitehead ?

andrewwhitehead · 2024-01-19T00:39:05Z

I think the only issue with straight LRU is that commonly used entries might never expire, which means they would never be re-fetched. Also, the get/insert/remove methods of the cache implementation don't necessarily need to take &mut self, because locking the Mutex gives you a mutable reference anyway.

swcurran · 2024-01-19T00:48:15Z

I think the fetching patterns vary by object type. Is it worth worrying about that?

Specifically:

DIDs, ATTRIBs can be “updated” (e.g. a new instance written, and on read the latest is returned) so they do need a “moderate” TTL on them.
- That said, for DIDs at least, we rarely need them, so reads are likely relatively rare.
CredDefs and RevRegs don’t ever expire, so can have a very long TTL
RevRegStatus requests (they aren’t ledger objects) must be cached using their query parameters (from, to) and as such should have a short TTL, as getting them out of the cache fairly quickly is probably a good idea. Every time a new presentation is generated, the query is likely to be different.

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 · 2024-01-19T03:03:16Z

I think the only issue with straight LRU is that commonly used entries might never expire, which means they would never be re-fetched. Also, the get/insert/remove methods of the cache implementation don't necessarily need to take &mut self, because locking the Mutex gives you a mutable reference anyway.

Got it, I've left the LRU cache there just in case. I've implemented the TTL caching functionality as well. I think that I need to keep insert and remove mutable because I've replaced the mutex on the cache with a RwLock. I only had it as a mutex when the LRU cache was the only implementation we had

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 · 2024-01-19T22:04:40Z

I think the fetching patterns vary by object type. Is it worth worrying about that?

Specifically:

* DIDs, ATTRIBs can be “updated” (e.g. a new instance written, and on read the latest is returned) so they do need a “moderate” TTL on them.
  
  * That said, for DIDs at least, we rarely need them, so reads are likely relatively rare.

* CredDefs and RevRegs don’t ever expire, so can have a very long TTL

* RevRegStatus requests (they aren’t ledger objects) must be cached using their query parameters (from, to) and as such should have a short TTL, as getting them out of the cache fairly quickly is probably a good idea.  Every time a new presentation is generated, the query is likely to be different.

I think this is a good idea... but I think may be complex enough to deserve it's own PR

andrewwhitehead · 2024-01-19T23:06:34Z

I think that I need to keep insert and remove mutable because I've replaced the mutex on the cache with a RwLock

The same is true for RwLock. The contents of an Arc are not generally writable, except in cases like this where a write lock is obtained.

If the TTL varies by transaction type then it might be best for the caller to specify what that is in each request? Although we could also add some defaults.

andrewwhitehead · 2024-01-19T23:08:17Z

One important case to test would be a GET_TXN for an undefined (not yet published) sequence number. This can return a response with a 'null' value which we do not want to cache.

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 · 2024-01-20T00:15:22Z

I think that I need to keep insert and remove mutable because I've replaced the mutex on the cache with a RwLock

The same is true for RwLock. The contents of an Arc are not generally writable, except in cases like this where a write lock is obtained.

If the TTL varies by transaction type then it might be best for the caller to specify what that is in each request? Although we could also add some defaults.

I was thinking the TTL storage type could handle that, maybe in the new() function the user passes in a default ttl along with a list of enums that match the request type with an associated type of u64 or something that determines the ttl for that request type

wadeking98 · 2024-01-29T20:33:15Z

@andrewwhitehead I've updated the cache functionality. I tried your idea of having one cache per pool, but we still run into the issue with potential conflicts between ledgers that have the same genesis transactions. Additionally I realized that it would be difficult to clean up the fs cache in this scenario. Say a ledger updates it's genesis file, the pool would then have a new ID and a new cache with a different fs location. The old cache would stay on the file system and wouldn't be cleaned up.

I decided to use a global cache strategy and storage and then initialize a new cache instance with the pool genesis transactions as a key prefix. This way we don't have to worry about cache conflicts between different ledgers (so long as they have a different genesis), but we also don't have to worry about managing multiple locations on the file system

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

libindy_vdr/src/ffi/mod.rs

libindy_vdr/src/ffi/pool.rs

libindy_vdr/src/pool/cache/mod.rs

libindy_vdr/src/pool/cache/storage.rs

andrewwhitehead · 2024-02-02T17:48:53Z

Getting pretty close now I think. The solution for handling multiple ledgers should work, it's not the end of the world if the cache gets expired when the pool transactions are changed.

We may want to swap out sled later as it seems to be going without maintenance at the moment, but it's fine for now. fjall could be a good option? I'd like to find a key-value DB with random element access (or add that feature) so that this could be implemented for the cache eviction: https://danluu.com/2choices-eviction/

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wrappers/javascript/indy-vdr-nodejs/src/NodeJSIndyVdr.ts

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

libindy_vdr/src/ffi/mod.rs

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

libindy_vdr/include/libindy_vdr.h

berendsliedrecht · 2024-02-05T15:22:00Z

libindy_vdr/src/common/error.rs

@@ -130,6 +130,12 @@ impl From<zmq::Error> for VdrError {
    }
 }

+impl From<sled::Error> for VdrError {


Can leave this as-is for now, but I think we can use #[from(sled::Error)] from thiserror and it will derive it properly.

libindy_vdr/src/pool/cache/mod.rs

libindy_vdr/src/pool/requests/prepared_request.rs

wrappers/javascript/indy-vdr-nodejs/src/NodeJSIndyVdr.ts

wrappers/javascript/indy-vdr-react-native/cpp/indyVdr.cpp

wrappers/javascript/indy-vdr-react-native/src/NativeBindings.ts

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 · 2024-02-06T16:58:43Z

I'll need @berendsliedrecht or @andrewwhitehead to merge since I don't have write access to the repo

wadeking98 added 7 commits January 15, 2024 19:19

playing around with transaction caching

584adb0

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

implemented caching

b752781

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

finished writing tests

4478b2f

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

fixed cache storage to be async

aa57d69

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

fixed request cache key

0b2ddd0

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

updated formatting

2b99028

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

fixed tests

ea01382

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 force-pushed the feat-trans-caching branch from 2bc69c7 to ea01382 Compare January 18, 2024 03:10

andrewwhitehead reviewed Jan 18, 2024

View reviewed changes

libindy_vdr/src/pool/runner.rs Outdated Show resolved Hide resolved

andrewwhitehead reviewed Jan 18, 2024

View reviewed changes

libindy_vdr/src/pool/runner.rs Outdated Show resolved Hide resolved

andrewwhitehead reviewed Jan 18, 2024

View reviewed changes

libindy_vdr/Cargo.toml Outdated Show resolved Hide resolved

Added support for LRU memCache

43623a9

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 added 3 commits January 18, 2024 18:07

Added a TTL cache and improved LRU cache

8e5f2dc

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

changed cache lock to rwlock

c293903

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

updated ttl cache expiration policy

cb79261

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 added 2 commits January 19, 2024 10:16

factored ordered hash map code

c54dc01

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

moved cache functionality to module

bd281a7

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 force-pushed the feat-trans-caching branch from 8723a30 to bd281a7 Compare January 19, 2024 21:34

skip caching if data is null

5cb1297

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 requested a review from andrewwhitehead January 29, 2024 23:03

wadeking98 force-pushed the feat-trans-caching branch from 4ed5833 to b33aa5d Compare January 30, 2024 20:28

created per-ledger cache for FFIs

96b2db6

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 force-pushed the feat-trans-caching branch from b33aa5d to 96b2db6 Compare January 30, 2024 20:33

change to global cache with unique key

59244e8

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 force-pushed the feat-trans-caching branch from 3153e5b to 59244e8 Compare February 2, 2024 01:29

andrewwhitehead reviewed Feb 2, 2024

View reviewed changes

libindy_vdr/src/ffi/mod.rs Show resolved Hide resolved

andrewwhitehead reviewed Feb 2, 2024

View reviewed changes

libindy_vdr/src/ffi/pool.rs Outdated Show resolved Hide resolved

andrewwhitehead reviewed Feb 2, 2024

View reviewed changes

libindy_vdr/src/pool/cache/mod.rs Show resolved Hide resolved

andrewwhitehead reviewed Feb 2, 2024

View reviewed changes

libindy_vdr/src/pool/cache/storage.rs Outdated Show resolved Hide resolved

updated ffi cache methods

9c3877a

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 commented Feb 2, 2024

View reviewed changes

wrappers/javascript/indy-vdr-nodejs/src/NodeJSIndyVdr.ts Outdated Show resolved Hide resolved

cleaned up ordered hash map

df072ba

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 force-pushed the feat-trans-caching branch from 59c4408 to df072ba Compare February 2, 2024 19:33

wadeking98 requested a review from andrewwhitehead February 2, 2024 19:42

using new_txns for key prefix

19fc142

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

andrewwhitehead reviewed Feb 2, 2024

View reviewed changes

libindy_vdr/src/ffi/mod.rs Outdated Show resolved Hide resolved

andrewwhitehead requested a review from berendsliedrecht February 2, 2024 21:05

quick syntax cleanup

0224681

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 requested a review from andrewwhitehead February 2, 2024 21:09

berendsliedrecht reviewed Feb 5, 2024

View reviewed changes

wadeking98 added 2 commits February 5, 2024 10:08

updated TS wrappers

f0b6530

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

switched to signed typing for ffis

12deec6

Signed-off-by: wadeking98 <wkingnumber2@gmail.com>

wadeking98 requested a review from berendsliedrecht February 5, 2024 20:42

berendsliedrecht approved these changes Feb 6, 2024

View reviewed changes

andrewwhitehead approved these changes Feb 6, 2024

View reviewed changes

andrewwhitehead merged commit f9f9752 into hyperledger:main Feb 6, 2024
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: transaction caching #252

feat: transaction caching #252

wadeking98 commented Jan 18, 2024 •

edited

swcurran commented Jan 18, 2024

wadeking98 commented Jan 18, 2024 •

edited

andrewwhitehead commented Jan 18, 2024 •

edited

wadeking98 commented Jan 18, 2024

andrewwhitehead commented Jan 19, 2024

swcurran commented Jan 19, 2024

wadeking98 commented Jan 19, 2024

wadeking98 commented Jan 19, 2024 •

edited

andrewwhitehead commented Jan 19, 2024

andrewwhitehead commented Jan 19, 2024

wadeking98 commented Jan 20, 2024

wadeking98 commented Jan 29, 2024 •

edited

andrewwhitehead commented Feb 2, 2024

berendsliedrecht Feb 5, 2024

wadeking98 commented Feb 6, 2024

feat: transaction caching #252

feat: transaction caching #252

Conversation

wadeking98 commented Jan 18, 2024 • edited

Demo:

Cache:

No Cache:

swcurran commented Jan 18, 2024

wadeking98 commented Jan 18, 2024 • edited

andrewwhitehead commented Jan 18, 2024 • edited

wadeking98 commented Jan 18, 2024

andrewwhitehead commented Jan 19, 2024

swcurran commented Jan 19, 2024

wadeking98 commented Jan 19, 2024

wadeking98 commented Jan 19, 2024 • edited

andrewwhitehead commented Jan 19, 2024

andrewwhitehead commented Jan 19, 2024

wadeking98 commented Jan 20, 2024

wadeking98 commented Jan 29, 2024 • edited

andrewwhitehead commented Feb 2, 2024

berendsliedrecht Feb 5, 2024

Choose a reason for hiding this comment

wadeking98 commented Feb 6, 2024

wadeking98 commented Jan 18, 2024 •

edited

wadeking98 commented Jan 18, 2024 •

edited

andrewwhitehead commented Jan 18, 2024 •

edited

wadeking98 commented Jan 19, 2024 •

edited

wadeking98 commented Jan 29, 2024 •

edited