Add trim and reverse methods for NFAs #73

jn1z · 2024-02-09T01:08:59Z

Add trim and reverse methods for NFAs. (Reverse could be useful in theory for DFAs to NFAs, as well.)

I tried to make these generic, but got stuck in basically the same place as before.

mtf90 · 2024-02-09T18:36:33Z

Just for a better understanding: Does the reversed NFA essentially represent the complement language? And is triming essentially a minimization of the NFA?

jn1z · 2024-02-09T19:08:21Z

Does the reversed NFA essentially represent the complement language?

Good question!
The complement language \bar{L} is separate from the reversed language L^R.
\bar{L} is recognized by just flipping the final and non-final states.
L^R requires flipping the final and initial states, and inverting all transitions.

So, for example,
L = {0, 01, 100}
L^R = {0, 10, 001}
\bar{L} = {1, 10, 011} or something like that

I'll look for a reference to add to the comments.

is triming essentially a minimization of the NFA?

Yes, it's an O(m) minimization of the NFA that removes "dead states".
Right-trim removes those that aren't reachable from the initial state(s) -- these are often removed during Subset Construction.
Left-trim removes those that don't reach the final state(s).

The "trim" terminology is mentioned here, for example: https://cs.stackexchange.com/questions/159828/

I'll look for a reference to add to the comments.

No rush on this PR especially if you've got large changes happening. BTW, I'll email you about my goals here, it might be of interest to you.

mtf90 · 2024-02-09T19:41:55Z

Don't worry, there is always stuff to do :D. I'll manage to find some time for your contributions. I'm excited to hear about the broader picture.

The reversal part makes sense so far. I'll have to think a little bit about where to put it best (because you could also reverse a DFA for example, maybe even MealyMachines?).

As for the triming, maybe we could use some of the existing code. The Minimizer works on arbitrary UniversalGraphs which NFAs can be transformed to. Maybe something along the lines of Automata#minimize? Or does trim include some other important properties?

jn1z · 2024-02-09T20:06:57Z

Cool!

As for the triming, maybe we could use some of the existing code.

Yes, absolutely. trim is not sophisticated. Half of it just applies a quotient, which is already done by the Minimizer. So there could be re-use there.

The reversal part makes sense so far. I'll have to think a little bit about where to put it best (because you could also reverse a DFA for example, maybe even MealyMachines?).

I don't know much about Mealy machines. But according to this, you can't reverse a Mealy machine in general: https://cs.stackexchange.com/questions/83383/reverse-the-input-and-output-of-a-mealy-machine

mtf90 · 2024-02-09T20:47:47Z

I don't know much about Mealy machines. But according to this, you can't reverse a Mealy machine in general: https://cs.stackexchange.com/questions/83383/reverse-the-input-and-output-of-a-mealy-machine

Yeah, reversal may introduce non-determinism. For FSAs this is not a problem, but AutomataLib currently does not support non-deterministic transducer.

jn1z · 2024-02-11T19:09:10Z

The Minimizer works on arbitrary UniversalGraphs which NFAs can be transformed to.

Oh... does that mean this might already support NFA reduction via "multiple-relation Paige-Tarjan" ?!
e.g., [Ilie-Yu], https://www.csd.uwo.ca/~ilie/TisF04.pdf

That would be wonderful if so -- I've had difficulty finding an implementation of that anywhere, but it should be only subtly different from Paige-Tarjan and bisimulation.

mtf90 · 2024-02-12T14:06:05Z

I took a closer look at the referenced paper of the Minimizer class. Unfortunately, the abstract states that

We design an algorithm that minimizes irreducible deterministic local automata ...

While the implementation seems to be able to deal with non-deterministic automata (or rather graphs) as well, I observed another problem. Since the implementation works on graphs, it has no knowledge about the automaton-specific semantics and only compares structural properties of states. For example, an accepting state with no outgoing transitions and an accepting state with only rejecting sink-successors would be treated as in-equivalent. There are other cases where this behavior causes confusion. Maybe we can use your work as a kick-off to overhaul / cleanup some of the minimization interfaces and be more precise when to use what method.

As for the trimming / minimization of NFAs: the PaigeTarjanInitializers class has specific methods for minimizeDFA and minimizeMealy. Maybe it is possible to hook into existing code with a special minizeNFA initialization? Unfortunately, I have never looked into the specific algorithms in detail, so it might take me some time to get into it. If you have any ideas, feel free to go ahead.

jn1z · 2024-02-12T16:24:22Z

We design an algorithm that minimizes irreducible deterministic local automata ...

I'm hopeful that it didn't mention NFAs, because technically the algorithm can't "minimize" NFAs (which is NP-hard), but rather reduce them somewhat.

Also it mentions that it solves "for a sequence of partial functions", i.e., similar to a sequence of relations.

For example, an accepting state with no outgoing transitions and an accepting state with only rejecting sink-successors would be treated as in-equivalent.

I believe this is actually what's desired for Ilie-Yu NFA reduction! (In either state, any further transition is rejecting, so they're equivalent. Additionally, all sink states are equivalent to each other, i.e., at most one sink state when quotiented.)

As for the trimming / minimization of NFAs: the PaigeTarjanInitializers class has specific methods for minimizeDFA and minimizeMealy. Maybe it is possible to hook into existing code with a special minizeNFA initialization?

I actually attempted that: https://github.com/jn1z/NFA-experiments/blob/main/NFAMinimize.java
This initializes PaigeTarjan with the expected data for an NFA. But testing appeared to show this didn't work. I fear this is due to the subtle difference of partition refinement and "multiple-relation partition refinement".

This paper says it adapted the multiple-relation version to Aldebaran (which was replaced with the somewhat-closed-source CADP): https://www.sciencedirect.com/science/article/pii/016764239090071K
"...an adapted version... computing the coarsest partition problem with respect to the family of binary relation instead of one binary relation"

I tried understanding the difference, but there's a subtlety I'm missing.

jn1z · 2024-02-12T16:32:31Z

As an aside, the Beal-Crochemore paper mentions that:

"We assume that the automaton is trim, i.e., each state is co-accessible: there is at least one successful path starting from this state. This assumption is important to guarantee the correctness of the algorithm."

...which means that the current Minimize implementation does a trim somewhere ?

jn1z · 2024-02-12T21:54:35Z

would be treated as in-equivalent.

Oh, I misread that as "equivalent". You're right, that's not what we want.

mtf90 · 2024-02-14T16:58:02Z

I skimmed over the Beal/Crochemore paper but find it very hard to understand because it requires a lot of background knowledge about specific sub-classes of automata. I think the code is not a direct implementation of the algorithm but does some additional steps to make it more general. For example, the paper states that

In general, it is not true that a deterministic non-minimal automaton has mergeable states (see the automaton of Figure 2).

While I understand that the automaton in Fig. 2 is a non-AFT automaton, it sounds like the algorithm presented in the paper cannot minimize it. Yet, the implementation can. Furthermore, your remark

...which means that the current Minimize implementation does a trim somewhere ?

is included in the initialize method (which filters all uneachable states via a breadth-first traversal) which supports the idea of additional tweaks.

It may reasonable to try to extend it to non-deterministic automata, but I would first like to understand what your goal with this PR is. I also read the Ilie/Navarro/Yu paper and personally find the equivalence-based reduction (using PaigeTarjan like in your draft) rather elegant. However, both approaches (left-/right-equivalence) only seem to reduce but not necessarily minimize the NFA (?). Any particular reason why you are interested in this approach but not actual minimization (even though it is PSPACE complete)?

jn1z · 2024-02-14T18:01:36Z

it sounds like the algorithm presented in the paper cannot minimize it. Yet, the implementation can.

Strange! Actually, originally I was reading a 2008 version of the Beal-Crochemore paper:
https://hal.science/hal-00620274/document

Which appears clearer to me anyway. That's the one that mentions "trim" (the other one mentions irreducible, i.e. strongly connected, which seems like an even stronger assumption.)

(which filters all unreachable states via a breadth-first traversal)

Does that handle the co-accessible case? I think that's required by the paper (then again, the implementation may be doing something different than the algorithm).

I would first like to understand what your goal with this PR is.

I just wanted to add some functionality that I'm using locally. "reverse", in particular, seems like a generic use case.

The PR can be dropped/closed though, that's fine.

I also read the Ilie/Navarro/Yu paper and personally find the equivalence-based reduction (using PaigeTarjan like in your draft) rather elegant.

Me too -- I like that it reduces to Hopcroft in the DFA case.

I have a naive version of the NFA reduction that I can add to my repo in a bit -- it's supposedly O(mn) rather than O(m log n) and as such is too slow for some use cases.

Any particular reason why you are interested in this approach

I'm interested in making expected-case determinization faster, and in certain cases, these make determinization much faster at a small cost (think of them as "preconditioners").

For a description of the overall goal, I emailed your tu-dortmund.de account a few days ago from my gmail account -- it may have gone to spam ?

mtf90 · 2024-02-14T20:48:55Z

Strange! Actually, originally I was reading a 2008 version of the Beal-Crochemore paper: https://hal.science/hal-00620274/document

Oh wow, the DOI link in the documentation actually links to a different paper than the link name suggests. Your version seems much more digestible. I'll read your version and update the documentation when time allows.

I just wanted to add some functionality that I'm using locally. "reverse", in particular, seems like a generic use case.
...
I have a naive version of the NFA reduction that I can add to my repo in a bit -- it's supposedly O(mn) rather than O(m log n) and as such is too slow for some use cases.

I think keeping the PR for now is fine, even if it is just for experimentation. Maybe converting it to a draft would be cleaner, though, at least until things are finalized.

I'm interested in making expected-case determinization faster, and in certain cases, these make determinization much faster at a small cost (think of them as "preconditioners").

For a description of the overall goal, I emailed your tu-dortmund.de account a few days ago from my gmail account -- it may have gone to spam ?

Unfortunately, I haven't received any mail. Maybe try re-sending it. gmail addresses shouldn't be blocked, as far as I am concerned. Otherwise, if you are comfortable with having this discussion publicly, feel free to create a thread at the discussion page.

jn1z · 2024-02-14T21:09:11Z

Thanks! I'll create a discussion thread about NFA reductions.

Some of the remainder of the goal is original research (and, also, not proven to be correct yet ;-) ), so not ready for a public forum.

Fix case where transitions are not co-accessible.

[skip ci]

jn1z marked this pull request as draft February 14, 2024 20:58

mtf90 and others added 11 commits May 26, 2024 22:51

Add basic reverse functionality.

9c67725

Switch from int to S. Add comments.

2770696

Test reverse.

384f11e

Add basic implementation of trim.

220bfc1

Clean up and begin testing.

e6e5e0c

More generic types.

32399e6

Use more generic types.

43de419

Add more tests.

a00220d

Fix case where transitions are not co-accessible.

Fix checkstyle.

fd005cc

Fix comments and docs.

0f13dbc

cleanups

14a1685

mtf90 force-pushed the trim_automata branch from 46a9374 to 14a1685 Compare May 26, 2024 20:57

mtf90 marked this pull request as ready for review May 26, 2024 21:12

formatting

5c440f3

[skip ci]

mtf90 merged commit 1994a6e into LearnLib:develop May 26, 2024

mtf90 mentioned this pull request May 26, 2024

PaigeTarjanMinimization.minimizeDFA does not guarantee minimal result #48

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add trim and reverse methods for NFAs #73

Add trim and reverse methods for NFAs #73

Uh oh!

jn1z commented Feb 9, 2024

Uh oh!

mtf90 commented Feb 9, 2024

Uh oh!

jn1z commented Feb 9, 2024 •

edited

Loading

Uh oh!

mtf90 commented Feb 9, 2024

Uh oh!

jn1z commented Feb 9, 2024

Uh oh!

mtf90 commented Feb 9, 2024

Uh oh!

jn1z commented Feb 11, 2024

Uh oh!

mtf90 commented Feb 12, 2024

Uh oh!

jn1z commented Feb 12, 2024 •

edited

Loading

Uh oh!

jn1z commented Feb 12, 2024

Uh oh!

jn1z commented Feb 12, 2024

Uh oh!

mtf90 commented Feb 14, 2024

Uh oh!

jn1z commented Feb 14, 2024

Uh oh!

mtf90 commented Feb 14, 2024 •

edited

Loading

Uh oh!

jn1z commented Feb 14, 2024

Uh oh!

Uh oh!

Add trim and reverse methods for NFAs #73

Add trim and reverse methods for NFAs #73

Uh oh!

Conversation

jn1z commented Feb 9, 2024

Uh oh!

mtf90 commented Feb 9, 2024

Uh oh!

jn1z commented Feb 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mtf90 commented Feb 9, 2024

Uh oh!

jn1z commented Feb 9, 2024

Uh oh!

mtf90 commented Feb 9, 2024

Uh oh!

jn1z commented Feb 11, 2024

Uh oh!

mtf90 commented Feb 12, 2024

Uh oh!

jn1z commented Feb 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jn1z commented Feb 12, 2024

Uh oh!

jn1z commented Feb 12, 2024

Uh oh!

mtf90 commented Feb 14, 2024

Uh oh!

jn1z commented Feb 14, 2024

Uh oh!

mtf90 commented Feb 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jn1z commented Feb 14, 2024

Uh oh!

Uh oh!

jn1z commented Feb 9, 2024 •

edited

Loading

jn1z commented Feb 12, 2024 •

edited

Loading

mtf90 commented Feb 14, 2024 •

edited

Loading