Add word cache reset function #152

eliotwrobson · 2023-06-17T05:24:47Z

See title, adds a word cache reset function and precomputes the digraph used by the DFA. Resolves #148

In looking at this, I thought about using a cached property for some of the methods that return fixed information about the given DFAs. However, switching these to cached properties (with the decorator) requires using a slightly different API. @caleb531 what are your thoughts about changing this? Is there a way to provide this caching without changing the API? It would be great if there were a way to do this cleanly.

coveralls · 2023-06-17T05:26:02Z

coverage: 99.883%. remained the same when pulling 5313990 on eliotwrobson:caching into 2aa2dda on caleb531:develop.

docs/fa/class-dfa.md

tests/test_dfa.py

automata/fa/dfa.py

caleb531

@eliotwrobson Overall looks good, but left you a few comments with some requested changes (mostly the name of the method itself).

Regarding preserving the API, I believe the @functools.lru_cache() decorator is the stdlib way to cache the output of methods. The only thing to note is that the method returns the cached output based on the arguments you pass to the function at call time. So foo(first='Eliot') and foo(last='Robson') will each be cache misses on the first call. But I think this is the behavior we want in most cases, anyway.

eliotwrobson · 2023-06-17T18:13:28Z

@caleb531 sounds good! As you pointed out, I made a mistake in the docs with the name in the first place 😅 so changing it is no problem.

Regarding caching, using lru cache here is actually a mistake: https://youtu.be/sVjtp6tGo0g

The use here would be for caching something like isempty, where there's no sense in recomputing the result each time for an immutable DFA. The canonical way of doing this as of 3.8 is with this decorator: https://docs.python.org/3/library/functools.html#functools.cached_property

However, this changes the method into a property (so it is accessed by the client code differently). This might be more natural since users may expect cached properties to behave this way, but I also don't know if changing the API this way is the right move.

caleb531 · 2023-06-17T18:51:37Z

@eliotwrobson Ah yes, I forgot that lru_cache is problematic for methods (ironically, this is something I myself have mentioned in a previous issue, yet have apparently forgotten 😅). However, it seems that there are at least two workarounds:

Implement the weakref-based decorator for caching methods (if we go this approach, I would call this decorator cached_method instead of memoized_method to achieve parity with the below package, plus I just like the name better)
Require the cached_method package as an additional dependency

The package doesn't seem like much overhead, but given how small this decorator is, we could just as easily include the implementation directly in the code. I could go either way.

So this whole post of mine obviously implies that I'd still prefer members like isempty to be methods, not properties. For me, it isn't just about preserving API backwards-compatibility, but about the semantics of methods vs. properties and the flexibility that methods provide.

Methods can have additional arguments in a way that's effectively future-proof with keyword arguments; properties can't no matter what you do. And as for the semantic argument: I tend to use properties to represent nouns/things, but when I have some check or operation to perform, I use a method.

I recognize that Automaton.input_parameters is the only exception to my rule of thumb because it represents a thing (or things), not an operation. Plus, the code is pretty simple and hardly expensive.

Okay, thank you for coming to my TED talk. Please let me know your thoughts!

eliotwrobson · 2023-06-17T18:57:45Z

I tend to agree, having these be methods allows more flexibility and conveys these are something to be computed. In the interest of writing less code, I'm inclined to go with the adding the optional dependency. So I will do that along with the renames described.

automata/fa/dfa.py

automata/base/automaton.py

caleb531 · 2023-06-17T21:30:00Z

@eliotwrobson Just want to say please feel free to continue to make changes, and just convert this PR out of Draft mode when it's ready for my re-review.

automata/base/automaton.py

eliotwrobson · 2023-06-17T21:41:22Z

@caleb531 looking over the changes I think this is ready to go! Flipping to open.

caleb531

@eliotwrobson LGTM! 👍

caleb531 · 2023-06-17T21:47:23Z

Actually, one quick afterthought:

How will all this additional caching behave if a user enables mutable automata? I guess they can just clear the caches manually as they make changes, correct?

eliotwrobson · 2023-06-17T21:52:09Z

How will all this additional caching behave if a user enables mutable automata? I guess they can just clear the caches manually as they make changes, correct?

@caleb531 yes, they can manually clear cached methods by emptying the __dict__. I don't think people will run into issues (as the API still assumes immutability even though the data structures themselves don't prevent this behavior).

eliotwrobson added 3 commits June 16, 2023 23:39

Cache digraph for DFA

353a3a2

Added tests and docs

aedcc5d

Fixed test

68f3a37

eliotwrobson added 2 commits June 17, 2023 00:27

mypy

5f06cad

lint

cb99eed

caleb531 reviewed Jun 17, 2023

View reviewed changes

docs/fa/class-dfa.md Outdated Show resolved Hide resolved

caleb531 reviewed Jun 17, 2023

View reviewed changes

tests/test_dfa.py Outdated Show resolved Hide resolved

caleb531 reviewed Jun 17, 2023

View reviewed changes

automata/fa/dfa.py Outdated Show resolved Hide resolved

caleb531 requested changes Jun 17, 2023

View reviewed changes

eliotwrobson added 5 commits June 17, 2023 15:36

Added first cached method

da62895

Moved function

931286d

Added more caching

03da781

Renamed cache clear function

61e16a6

Update test_dfa.py

8ba372f

caleb531 reviewed Jun 17, 2023

View reviewed changes

automata/fa/dfa.py Show resolved Hide resolved

eliotwrobson added 2 commits June 17, 2023 16:24

Added contains

9df5970

Update automaton.py

54295b4

caleb531 reviewed Jun 17, 2023

View reviewed changes

automata/base/automaton.py Outdated Show resolved Hide resolved

caleb531 marked this pull request as draft June 17, 2023 21:29

Minor type changes

5313990

eliotwrobson commented Jun 17, 2023

View reviewed changes

automata/base/automaton.py Show resolved Hide resolved

eliotwrobson marked this pull request as ready for review June 17, 2023 21:41

caleb531 self-requested a review June 17, 2023 21:42

caleb531 approved these changes Jun 17, 2023

View reviewed changes

eliotwrobson merged commit 766090f into caleb531:develop Jun 17, 2023
5 checks passed

eliotwrobson mentioned this pull request Jun 17, 2023

v8 Caching Behavior #148

Closed

eliotwrobson mentioned this pull request Jun 17, 2023

Proper support for allow_partial #147

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add word cache reset function #152

Add word cache reset function #152

eliotwrobson commented Jun 17, 2023

coveralls commented Jun 17, 2023 •

edited

Loading

caleb531 left a comment

eliotwrobson commented Jun 17, 2023

caleb531 commented Jun 17, 2023

eliotwrobson commented Jun 17, 2023

caleb531 commented Jun 17, 2023

eliotwrobson commented Jun 17, 2023

caleb531 left a comment

caleb531 commented Jun 17, 2023

eliotwrobson commented Jun 17, 2023

Add word cache reset function #152

Add word cache reset function #152

Conversation

eliotwrobson commented Jun 17, 2023

coveralls commented Jun 17, 2023 • edited Loading

caleb531 left a comment

Choose a reason for hiding this comment

eliotwrobson commented Jun 17, 2023

caleb531 commented Jun 17, 2023

eliotwrobson commented Jun 17, 2023

caleb531 commented Jun 17, 2023

eliotwrobson commented Jun 17, 2023

caleb531 left a comment

Choose a reason for hiding this comment

caleb531 commented Jun 17, 2023

eliotwrobson commented Jun 17, 2023

coveralls commented Jun 17, 2023 •

edited

Loading