#114 WIP collection cache #285

joepio · 2022-01-15T21:51:21Z

TODO:

Build the cache
Get rid of .unwrap() in update_member closures
Load from cache when possible
Update CollectionWatcher dynamically

PR Checklist:

Link to related issue Collection caching #114
Add changelog entry linking to issue
Add tests
Add tests for authorization
Add tests for changing data, updating index (e.g. remove stuff)
More tests?

joepio · 2022-01-15T21:54:44Z

So the thing I'm currently struggling with, is performing the cached query. Currently, Collection does not depend onDb, which means that I can't access Db specific methods from Collection. So should I add collection building methods to Storelike?

Edit: ended up making the Query abstraction!

joepio · 2022-01-17T17:17:47Z

Having some trouble with conditionally reversing the sled iterator, depending on the sort_desc setting of the user.

    let iter = if q.sort_desc {
        store.members_index.range(start_key..end_key)
    } else {
        store.members_index.range(start_key..end_key).rev()
    };

Fails, because

`if` and `else` have incompatible types
expected type `sled::Iter`
 found struct `std::iter::Rev<sled::Iter>`

So I read about using Box to prevent this, but that also doesn't work:

    let iter = if q.sort_desc {
        Box::new(store.members_index.range(start_key..end_key))
    } else {
        Box::new(store.members_index.range(start_key..end_key).rev())
    };

Fails:

`if` and `else` have incompatible types
expected type `std::boxed::Box<sled::Iter>`
 found struct `std::boxed::Box<std::iter::Rev<sled::Iter>>`rustcE0308

joepio · 2022-01-19T08:32:00Z

The changing data test is failing, because the cache is not properly invalidated.

We have this logic, that checks an update atom according to query filters that are being watched:

            let should_update = match (&q_filter.property, &q_filter.value) {
                (Some(prop), Some(val)) => prop == &atom.property && val == &atom.value,
                (Some(prop), None) => prop == &atom.property,
                (None, Some(val)) => val == &atom.value,
                // We should not create indexes for Collections that iterate over _all_ resources.
                _ => false,
            };

But... That goes wrong when the sorted value differs from the filtered value. So let's say we have a QueryFilter that filters by is_a human, and sorts by firstname. Let's say we update the firstname. In this case we want the old Atom to be removed from the index. So in this case the should_update function is called for the new firstname atom. This atom does not contain the is_a property or the human value. We want to update the existing indexed value, but we can't, because we can't perform the is_a human check. We need all the properties of the Resource, or we need a pre-confirmed check which collections the resource matches.

We should first check if the sort_property matches, then we should check if the resource has the query, I think. This check should be memoized, probably.

Maybe the problem is more fundamental. The should_update function cannot decide if a value needs to be removed, knowing only the Atom and the FilterQuery.

Flip it

Say we want to check if an atom will match things. We could first check if the resource matches the query filter. If that is the case, we can continue, and perform the existing checks.

joepio · 2022-01-22T14:21:12Z

So the previous cache invalidation issue has now been solved by passing the old resource to the cache invalidation logic. It is needed to match the QueryFilters in query_index.

Things are working fine if a commit is only applied once, but for the /commit endpoint, I'm actually applying a Commit twice.

joepio · 2022-01-24T09:45:09Z

I think atoms for the QueryIndex and the ReferenceIndex might need to be a bit different.
The difficult part is ResourceArrays.

Let's consider I want add a Paragraph to a Document.
This means I need to add one Atom to the Index.

ReferenceIndex

Needs to update that the Paragraph now has an incoming link from the Document
The old paragraph also still needs a reference
It is entirely possible that in this Atom, both paragraphs are modified (and not just one new appended item). Just to make sure, we will first have to delete all references from the old atom, and then create the new references from the new atom.
This will be an unnecessarily costly operation if we only append an item to an array

QueryIndex

If someone has a Query that sorts document by their number of Paragraphs, their query might need to be updated.
Note that in this case, the IndexAtom will not have any references - it should have a count

Solution

I think we need to have two value fields in a QueryAtom: both a Filtervalue and a Sortvalue. Or we should accept that we can't sort by count. Which is also acceptable.

joepio · 2022-01-24T19:51:25Z

I think I finally know what is causing my last failing test: remove_atom_from_index and add_atom_to_index should get passed a different resource, not the same one. Should update this in the apply method of commits

joepio · 2022-01-24T21:26:59Z

Tests are green! I think I'll merge this soon. But first, I'll look for some extra things to break, test and fix.

#114 WIP collection cache

c93c84f

joepio added 5 commits January 16, 2022 20:10

#114 WIP collection cache working but slow

e0beb90

#114 try different approach

f60f810

WIP tests passing, but sorting not working

8972413

WIP

52cac01

Sorting one way works...

ca45b40

joepio added 7 commits January 17, 2022 18:39

Fix sorting

b4caac2

mostly working

02845b2

Move db tests to file

209ff4f

Move some utility functions

a597ce3

Cleanup

a1d2b78

authorization tests

e164f88

Add authorization tests, get them green

00608e7

joepio added 2 commits January 19, 2022 22:01

Cache invalidation test passing

89e5e18

Add test for delting, fix temp path gitignore

0b9c464

Refactor commit opts

8c0b802

Fix query index

7bcec1a

joepio added 2 commits January 24, 2022 20:53

Change TPF, fix test

9c09a76

Tests passing

a0608ca

joepio marked this pull request as ready for review January 24, 2022 21:27

joepio added 2 commits January 25, 2022 16:13

Improve sorting

1725fb1

Bump to v0.31.0

bf77145

joepio merged commit d9b9fb6 into master Jan 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

#114 WIP collection cache #285

#114 WIP collection cache #285

Uh oh!

joepio commented Jan 15, 2022 •

edited

Loading

Uh oh!

joepio commented Jan 15, 2022 •

edited

Loading

Uh oh!

joepio commented Jan 17, 2022

Uh oh!

joepio commented Jan 19, 2022

Uh oh!

joepio commented Jan 22, 2022

Uh oh!

joepio commented Jan 24, 2022

Uh oh!

joepio commented Jan 24, 2022

Uh oh!

joepio commented Jan 24, 2022

Uh oh!

Uh oh!

#114 WIP collection cache #285

#114 WIP collection cache #285

Uh oh!

Conversation

joepio commented Jan 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

joepio commented Jan 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

joepio commented Jan 17, 2022

Uh oh!

joepio commented Jan 19, 2022

Flip it

Uh oh!

joepio commented Jan 22, 2022

Uh oh!

joepio commented Jan 24, 2022

ReferenceIndex

QueryIndex

Solution

Uh oh!

joepio commented Jan 24, 2022

Uh oh!

joepio commented Jan 24, 2022

Uh oh!

Uh oh!

joepio commented Jan 15, 2022 •

edited

Loading

joepio commented Jan 15, 2022 •

edited

Loading