Fix Issue 11644 - EvictingStrategy.LRU for std.functional.memoize #5826

jercaianu · 2017-10-30T14:49:41Z

I implemented the O(1) algorithm for "get" and "set" from the cache similar to what is described here[1].

Thanks,
Alex

dlang-bot · 2017-10-30T14:49:43Z

Thanks for your pull request, @jercaianu! We are looking forward to reviewing it, and you should be hearing from a maintainer soon.

Some tips to help speed things up:

smaller, focused PRs are easier to review than big ones
try not to mix up refactoring or style changes with bug fixes or feature enhancements
provide helpful commit messages explaining the rationale behind each change

Bear in mind that large or tricky changes may require multiple rounds of review and revision.

Please see CONTRIBUTING.md for more information.

Bugzilla references

Auto-close	Bugzilla	Description
✓	11644	EvictingStrategy.LRU for std.functional.memoize

MartinNowak · 2017-10-31T21:13:31Z

Thanks for your PR, looks interesting.

The usage of a doubly linked list and classes bothers me a little from a performance perspective.
Could you use indices and a lazily pre-allocated array instead?

Also most of the issue has been addressed by the random replacement from the cuckoo hashing. Do you have an example that has severe performance issues with that approach?

andralex · 2017-10-31T21:15:33Z

Where is the link to [1]?

andralex

This is a great instance of "academia vs. real world" clash :)

andralex · 2017-10-31T21:19:23Z

std/functional.d

@@ -1068,6 +1068,146 @@ template memoize(alias fun, uint maxSize)
    }
 }

+/// ditto
+template memoize(alias fun, uint maxSize, bool lru = true)


I searched this page and found no use of lru.

andralex · 2017-10-31T21:20:41Z

std/functional.d

+    import std.traits : ReturnType;
+    import std.typecons : Tuple;
+
+    class Node


A class with dynamic allocation and linked in a doubly linked list is textbook-correct but certainly out of character for phobos and for high-performance applications. Per @MartinNowak we'd be looking at a properly managed contiguous array,

andralex · 2017-10-31T21:21:37Z

std/functional.d

+
+        this(Tuple!(Parameters!fun) args = Tuple!(Parameters!fun).init,
+             ReturnType!fun res = ReturnType!fun.init)
+        {


Hmmm, the defaults here are suprising - most functions will NOT return the default result for the default parameter values. Where are the defaults used?

andralex · 2017-10-31T21:24:28Z

std/functional.d

+        static Node head;
+        static Node tail;
+        static Node[Tuple!Args] cache;
+        static int capacity = maxSize;


size_t probably

andralex · 2017-10-31T21:32:16Z

By the way LRU sux we should use LIRS: https://en.wikipedia.org/wiki/LIRS_caching_algorithm

MartinNowak · 2017-11-01T10:13:12Z

It's very welcome to review and properly benchmark the current approach.
#2591
If we find this to be inadequate for certain usages, we could look at different replacement policies.

Not too certain we really need a fancy eviction policy here, since we have a uniformly distributed hash (MD5), random replacement should be fairly OKish, and just using a slightly bigger cache will fix performance problems.
Of course a better eviction policy can further optimize for cache size vs. hit ratio.

If we go that route we should replace the current cuckoo hash implementation.

Hopefully we can find a general purpose replacement policy. Making it selectable via a template parameter does increase the complexity of memoize even further. An adaptive cache size would be nice as well (but hard), as it's often difficult to come up with a good number ahead of time.

ben-manes · 2017-11-29T04:39:43Z

LIRS is very complicated to implement correctly. Most implementations miss key details that degrades the hit rates, have memory leaks due to the paper neglecting to bound the ghost entries (as done in their C code), and may have jitter due to slow stack pruning if bounded inefficiently. However, it does have an excellent hit rate across many workloads and can be speedy.

After doing a deep dive into approaches for my caching library, I chose TinyLFU with some minor adaptions. It is very easy to implement, time/space efficient, and beats LIRS in many real-world workloads. As an admission policy, it can be paired with any eviction policy (such as an array-based approach rather than linked lists - e.g. sampled LRU).

jercaianu · 2017-11-29T08:59:34Z

@ben-manes @MartinNowak
Thanks for the comments!
I have left this issue on the backburner, since I've been working more on allocators.
I will definitely take a look at TinyLFU.

Alex

jercaianu requested a review from andralex as a code owner October 30, 2017 14:49

dlang-bot added the Enhancement label Oct 30, 2017

jercaianu added 2 commits October 30, 2017 23:50

Fix Issue 11644 - EvictingStrategy.LRU for std.functional.memoize

670130e

fixed style errors

554706d

jercaianu force-pushed the lrucache branch from 637c445 to 554706d Compare October 30, 2017 21:50

andralex requested changes Oct 31, 2017

View reviewed changes

wilzbach added the Needs Work label Dec 13, 2017

dlang-bot added Needs Rebase and removed Needs Rebase labels Jan 1, 2018

dlang-bot added Needs Rebase stalled labels May 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Issue 11644 - EvictingStrategy.LRU for std.functional.memoize #5826

Fix Issue 11644 - EvictingStrategy.LRU for std.functional.memoize #5826

jercaianu commented Oct 30, 2017

dlang-bot commented Oct 30, 2017

MartinNowak commented Oct 31, 2017 •

edited

andralex commented Oct 31, 2017

andralex left a comment

andralex Oct 31, 2017

andralex Oct 31, 2017

andralex Oct 31, 2017

andralex Oct 31, 2017

andralex commented Oct 31, 2017

MartinNowak commented Nov 1, 2017 •

edited

ben-manes commented Nov 29, 2017

jercaianu commented Nov 29, 2017

Fix Issue 11644 - EvictingStrategy.LRU for std.functional.memoize #5826

Are you sure you want to change the base?

Fix Issue 11644 - EvictingStrategy.LRU for std.functional.memoize #5826

Conversation

jercaianu commented Oct 30, 2017

dlang-bot commented Oct 30, 2017

Bugzilla references

MartinNowak commented Oct 31, 2017 • edited

andralex commented Oct 31, 2017

andralex left a comment

Choose a reason for hiding this comment

andralex Oct 31, 2017

Choose a reason for hiding this comment

andralex Oct 31, 2017

Choose a reason for hiding this comment

andralex Oct 31, 2017

Choose a reason for hiding this comment

andralex Oct 31, 2017

Choose a reason for hiding this comment

andralex commented Oct 31, 2017

MartinNowak commented Nov 1, 2017 • edited

ben-manes commented Nov 29, 2017

jercaianu commented Nov 29, 2017

MartinNowak commented Oct 31, 2017 •

edited

MartinNowak commented Nov 1, 2017 •

edited