kvs: convert missing_refs_list to hash or deal with duplicates #1751

chu11 · 2018-10-23T16:48:16Z

While investigating #1747, it occurred to me that the missing_refs_list that is returned in kvstxn could have duplicate references. Perhaps it'd be better to internally represent it as a hash, so that duplicate missing references aren't looked up twice.

The text was updated successfully, but these errors were encountered:

chu11 · 2018-10-24T18:15:21Z

I actually began to make this change, but wasn't sure if it was a net win. Basically, if a user were to do a kvs transaction with many writes to the same directory, such as

txn = flux_kvs_txn_create()
flux_kvs_txn_put (txn, "dir.a", "a")
flux_kvs_txn_put (txn, "dir.b", "b")
...
flux_kvs_txn_put (txn, "dir.y", "y")
flux_kvs_txn_put (txn, "dir.z", "z")
flux_commit (h, txn)

and the reference for directory "dir" happens to not yet be loaded, this would generate 26 missing references to be looked up, and 26 lookups. So ...

A) is this common? I don't think so? B/c in the above scenario, I imagine most of the time, it's more like:

txn = flux_kvs_txn_create()
flux_kvs_txn_mkdir (txn, "dir")
flux_kvs_txn_put (txn, "dir.a", "a")
flux_kvs_txn_put (txn, "dir.b", "b")
...
flux_kvs_txn_put (txn, "dir.y", "y")
flux_kvs_txn_put (txn, "dir.z", "z")
flux_commit (h, txn)

so the missing reference for "dir" isn't an issue.

B) if it is maybe something to consider, is detecting duplicates in the references list a net win? Regardless of how its done.

chu11 · 2018-10-25T00:42:03Z

was talking to @garlick about this, and he reminded me that while the above is true, higher level code in the cache / wait data structures probably handles this. @garlick's memory is far better than mine and he's right, although the code is a tad non-optimal.

Basically in the core missing references loop is:

for all missing references {
    entry = cache_lookup (ref);
    if (!entry) {
        entry = cache_create_entry (ref);
        content_store_load (ref);
    }
    if (cache_entry_doesn't_have_valid_data ())  {
        cache_add_me_to_waitlist (wait);
        stall;
    }
}

So the worst case of sending multiple rpcs to load the same data from the content store is avoided.

However, a small non-optimal thing is that every identical missing reference would be added to the waitlist on a cache entry. So in my worst case example above, the waitlist queue on the cache entry would be 26 long.

At the minimum some comments to clarify in the code should be done to explain this. So I'll leave this open until I add some comments.

Fixes flux-framework#1751

chu11 changed the title ~~kvs: convert missing_refs_list to hash~~ kvs: convert missing_refs_list to hash or deal with duplicates Oct 24, 2018

chu11 added a commit to chu11/flux-core that referenced this issue Oct 26, 2018

modules/kvs: Add comments on optimization case

8c7182b

Fixes flux-framework#1751

chu11 mentioned this issue Oct 26, 2018

kvs: add additional tests & comments #1771

Closed

chu11 added a commit to chu11/flux-core that referenced this issue Oct 26, 2018

modules/kvs: Add comments on optimization case

f37c2d5

Fixes flux-framework#1751

chu11 mentioned this issue Oct 26, 2018

kvs: add tests, comments, and use after free fix #1773

Merged

garlick closed this as completed in e4ef7bf Oct 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kvs: convert missing_refs_list to hash or deal with duplicates #1751

kvs: convert missing_refs_list to hash or deal with duplicates #1751

chu11 commented Oct 23, 2018

chu11 commented Oct 24, 2018

chu11 commented Oct 25, 2018

kvs: convert missing_refs_list to hash or deal with duplicates #1751

kvs: convert missing_refs_list to hash or deal with duplicates #1751

Comments

chu11 commented Oct 23, 2018

chu11 commented Oct 24, 2018

chu11 commented Oct 25, 2018