Cache entrypoints in group #3622

ltalirz · 2019-12-09T16:58:44Z

Entry point loading is already cached at the reentry level but getting
all entry points within a group can still take significant amount of
time.
This commit introduces a simple cache at the AiiDA level. An alternative
would be to add a cache at the reentry level.

To provide some context: The timings on a query for 300 Dict nodes are
as follows:

No cache: ~110 ms
Cache: 67ms
Cache at load_node_class level: 58ms

@muhrin I would say the savings are significant and probably worth reaping. Of course, one could also try to do this at the reentry side...
Let me know. If you think we keep the cache here, I'll add the proper docstrings etc.

greschd · 2019-12-09T17:52:34Z

aiida/plugins/entry_point.py

@@ -50,6 +50,28 @@ class EntryPointFormat(enum.Enum):
    MINIMAL = 3


+class EntryPointCache():


I think this might be equivalent to adding the @functools.lru_cache(maxsize=None) decorator to the get_entry_points function.

Good point @greschd , yes this would be a better way to do it

Thanks, that seems to be much more elegant

Entry point loading is already cached at the `reentry` level but getting all entry points within a group can still take significant amount of time. This commit introduces a simple cache at the AiiDA level. An alternative would be to add a cache at the reentry level. To provide some context: The timings on a query for 300 Dict nodes are as follows: * No cache: ~110 ms * Cache: 67ms * Cache at `load_node_class` level: 58ms

ltalirz · 2019-12-09T21:01:50Z

The lru_cache seems to be a bit slower for some reason

In [4]: %timeit qb2.all()  # with project='*'
10 loops, best of 5: 82 ms per loop

There is a C implementation of the cache: https://pypi.org/project/fastcache/
Anyhow, I think it doesn't really matter that much here - we have some speedup from the lru_cache which adds two lines of code. I'd be fine with that.

muhrin · 2019-12-09T21:21:25Z

So you want to go with the lru_cache? If not there are is a small change I would make to the PR as it stands

muhrin

Looks great

ltalirz force-pushed the cache-entrypoints-in-group branch from 030f1b3 to b6fe7da Compare December 9, 2019 16:59

greschd reviewed Dec 9, 2019

View reviewed changes

ltalirz force-pushed the cache-entrypoints-in-group branch 2 times, most recently from 60c9114 to 3f0a43b Compare December 9, 2019 20:48

ltalirz force-pushed the cache-entrypoints-in-group branch from 3f0a43b to e51e7d8 Compare December 9, 2019 20:49

muhrin approved these changes Dec 9, 2019

View reviewed changes

Merge branch 'develop' into cache-entrypoints-in-group

c05afdf

ltalirz merged commit fcfbf3a into aiidateam:develop Dec 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache entrypoints in group #3622

Cache entrypoints in group #3622

ltalirz commented Dec 9, 2019 •

edited

Loading

greschd Dec 9, 2019

muhrin Dec 9, 2019

ltalirz Dec 9, 2019

ltalirz commented Dec 9, 2019

muhrin commented Dec 9, 2019

muhrin left a comment

		@@ -50,6 +50,28 @@ class EntryPointFormat(enum.Enum):
		MINIMAL = 3


		class EntryPointCache():

Cache entrypoints in group #3622

Cache entrypoints in group #3622

Conversation

ltalirz commented Dec 9, 2019 • edited Loading

greschd Dec 9, 2019

Choose a reason for hiding this comment

muhrin Dec 9, 2019

Choose a reason for hiding this comment

ltalirz Dec 9, 2019

Choose a reason for hiding this comment

ltalirz commented Dec 9, 2019

muhrin commented Dec 9, 2019

muhrin left a comment

Choose a reason for hiding this comment

ltalirz commented Dec 9, 2019 •

edited

Loading