[New] Size limited LRU cache #58

ZhymabekRoman · 2022-05-27T01:16:15Z

I made a small working prototype of the cache implementation on top of OrderedDict. Please write what can be improved.

https://gist.github.com/ZhymabekRoman/a9c7a25c155dfdea52277cc74f28fa65

ZhymabekRoman · 2022-05-27T01:18:17Z

There is a small problem - it seems to me that it works a little slow, I still have no idea how to speed up

Animenosekai · 2022-05-28T14:29:00Z

I took a look at the different implementations and I think that it could be possible to adapt pympler's asizeof implementation for Python >=3.2.

Here is my slightly modified implementation :
https://gist.github.com/Animenosekai/4e5a3a980e7ed2e542003a58a54ede96

Animenosekai · 2022-05-28T14:34:51Z

I just tested the different implementations and yes our implementation is quite slow, pympler's one is a bit faster but sys.getsizeof is definitely the fastest (which is normal considering that there is no recursion).

Animenosekai · 2022-05-28T14:36:32Z

Or maybe we could check if the object is a native object, in which case we would check with sys.getsizeof and we would use other implementations for more complex objects

ZhymabekRoman · 2022-05-28T16:03:32Z

Maybe implement asynchronous checking and clearing the cache using ThreadPoolExecutor? This should speed up the caching process

Animenosekai · 2022-05-28T23:54:25Z

Maybe implement asynchronous checking and clearing the cache using ThreadPoolExecutor? This should speed up the caching process

Well I tried to implement it too in 58521f1 but seems to also be slow

I'll need to figure out why because it seems odd...

ZhymabekRoman · 2023-01-22T04:37:26Z

So I tried to optimise the LRU cache, and it seems I have it. Some numbers:

raw dictionary implementation code:

import os

if __name__ == "__main__":
    a = {}
    for i in range(1_000):
        print(i)
        a[i] = os.urandom(200_000_0)

Also in the test, I changed the old LRU implementation to use os.urandom instead of just putting numbers.

Just raw dictionary time:

python3 just_dict.py  0.03s user 8.59s system 99% cpu **8.643 total**

Old LRU cache implementation time:

python3 lru_old.py  48.94s user 8.59s system 86% cpu **1:06.65 total**

New LRU cache implementation time:

python3 lru.py  0.50s user 7.64s system 99% cpu **8.201 total**

time shows that the new LRU implementation is like a zero-overhead. I think that's a good result. In the new implementation, instead of recomputing the cache size every time, it simply takes the object size into a separate class property, and uses these values to free the cache in the future.

New LRU implementation code:
https://gist.github.com/ZhymabekRoman/43ee515959de024416e29b8dd97e4d96

Animenosekai · 2023-01-22T07:40:37Z

Ohhh I understand how you did that. Really cool numbers!

Did you try on python <3.7 to see if it was working as well ?

ZhymabekRoman · 2023-01-22T08:11:14Z

Did you try on python <3.7 to see if it was working as well ?

Hmm, no. I'll try to install debian stretch on my machine and test it there with Python 3.5.

ZhymabekRoman · 2023-01-22T08:44:34Z

Yes, it works great! Some debugging shows that all the logic works as expected.

ZhymabekRoman · 2023-08-29T04:14:39Z

I think this issue can be closed, since we have merged into next branch

Animenosekai mentioned this issue May 28, 2022

ServiceConnectionError Exception #57

Closed

This was referenced Jan 24, 2023

Not accurate source language autodetection #74

Open

add: new size based LRU cache & minor fixes #76

Merged

Animenosekai mentioned this issue Feb 23, 2023

Next: 3.0 #78

Open

27 tasks

ZhymabekRoman closed this as completed Aug 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New] Size limited LRU cache #58

[New] Size limited LRU cache #58

ZhymabekRoman commented May 27, 2022

ZhymabekRoman commented May 27, 2022

Animenosekai commented May 28, 2022

Animenosekai commented May 28, 2022

Animenosekai commented May 28, 2022

ZhymabekRoman commented May 28, 2022

Animenosekai commented May 28, 2022 •

edited

Loading

ZhymabekRoman commented Jan 22, 2023

Animenosekai commented Jan 22, 2023

ZhymabekRoman commented Jan 22, 2023

ZhymabekRoman commented Jan 22, 2023

ZhymabekRoman commented Aug 29, 2023

[New] Size limited LRU cache #58

[New] Size limited LRU cache #58

Comments

ZhymabekRoman commented May 27, 2022

ZhymabekRoman commented May 27, 2022

Animenosekai commented May 28, 2022

Animenosekai commented May 28, 2022

Animenosekai commented May 28, 2022

ZhymabekRoman commented May 28, 2022

Animenosekai commented May 28, 2022 • edited Loading

ZhymabekRoman commented Jan 22, 2023

Animenosekai commented Jan 22, 2023

ZhymabekRoman commented Jan 22, 2023

ZhymabekRoman commented Jan 22, 2023

ZhymabekRoman commented Aug 29, 2023

Animenosekai commented May 28, 2022 •

edited

Loading