Left recursion fix #75

Victorious3 · 2018-08-14T13:33:08Z

Fixes #57, fixes #69 see discussion on there for context.
Now fixes #27 as well

This resolves left recursion when the grammar is created instead of figuring it out on the fly. For that it runs some code in leftrec.py to detect left recursive cycles and marks one rule as left recursive to break the cycle. For the code generation this information is saved with a new annotation @leftrec that can be used together with @tatsumasu.
Furthermore, the annotation @nomemo is used to block memoization on rules that are part of a left recursive cycle.

TODO:

This doesn't account for memoization yet, mainly because I don't have a failing test for that.
Interlocking cycles? See https://github.com/PhilippeSigaud/Pegged/wiki/Left-Recursion - Not subject of this PR
I don't like how it deals with RuleRef at all, ideally those should be resolved before the analysis is run, but that would cause a cascade of changes.
Could write some sort of walker class for pretties
Figure out how SkipTo (->) and Cut (~) interacts with left recursion
Now that Parser drops part of input #27 is fixed I should try if the more simple left recursion detection was actually sufficient, would make everything simpler - Not in the way it was before

This still doesn't cover all cases, but we are getting closer. The PR adds new failing test cases. (skipped for now)

It doesn't seem to be used anywhere, and it's broken (walk_node vs walk_Node). PreOrderWalker does the same thing

Didn't test it properly but it seems to work. I'm not exactly happy with some of the code (mainly how it deals with RuleRef).

The real solution would be to resolve RuleRef beforehand and replace them with the correct Rule instance, but that would mess with code generation

apalala · 2018-08-14T17:38:17Z

Wow! I'll take a look at this ASAP!

Victorious3 · 2018-08-14T18:13:10Z

Travis is still complaining but I don't know how to get it to shut up about those warnings, so I'll leave it like this for now

https://scrapinghub.atlassian.net/browse/left_recursion_fix

apalala · 2018-08-14T20:46:37Z

Hi @Victorious3 This branch has the mods required to let type checking and unit tests pass:

https://github.com/neogeny/TatSu/tree/Victorious3-left_recursion_fix

The two skipped left-recursion tests are for valid grammars, and they point to defects in either the implementation of left-recursion, or the algorithm.

There are other left recursive grammars in the TatSu issues that also fail, and should probably be part of the unit tests.

I'll take another look at all this in the next few days.

tatsu/contexts.py

The last implementation was advancing the input way too much, this resets it after every rule invokation I can get the tests to pass by stripping all whitespace, something's off with that.

One of the grammars was actually broken (not PEG)

Victorious3 · 2018-08-18T00:11:44Z

test/grammar/left_recursion_test.py

@@ -310,8 +307,8 @@ def test_left_recursion_bug(self, trace=False):
            start = expression $ ;

            expression =
-                | paren_expression


It doesn't consider minus_expression a valid alternative after it successfully parsed a paren_expression,
therefore (3 - 2) - 1 was correctly failing.

Victorious3 · 2018-08-18T00:25:13Z

Those tests literally took forever without memoization, I better get to fixing that ^^

This still discards many useful memos but for now this is the best I can do

…owever, the examples are non trivial and hard to understand so I don't think these cases are common occurance.

Victorious3 · 2018-08-18T08:40:30Z

@apalala This is pretty much done apart from cosmetic changes, and hopefully good enough to let me continue my project.

Victorious3 · 2018-08-18T18:10:06Z

I couldn't come up with a test case for -> that is also left recursive at the same time, it sorta goes against the point of it. However, since -> is supposed to advance the input in all cases, and this is disabled inside left recursive propagation, there might be some weird edge cases. ~ seems to do alright and is covered by an already existing test case.

While doing this I also discovered that @@nameguard :: False isn't respected, but I'll make a separate issue about that.

leftrec.py could use some work to make it more pretty but not doing it in this PR, I'll leave the TODOs open for that.

So yea, that's it from my side unless you have anything I should change to get it merged.

I don't think @tatsumasu can accept any other parameters. It's still a bit ugly but better than before. @leftrec now needs to be applied before @tatsumasu

Turns memoization back on for rules that aren't part of a left recursive cycle. This should improve performance significantly, previously all memoization was turned off when any left recursion took place.

Victorious3 · 2018-09-02T13:12:47Z

The bootstrap test doesn't include anything related to left recursion, it would be a good idea to make sure that the tests are run twice, once directly and once using the generated parser

Victorious3 · 2018-09-02T14:45:16Z

It's still pretty conservative at doing memos during left recursion but it should be solid at least. I ran my project using it and went down from 37s on parsing 1000 lines of 1 + 1 to about 20s, no failures. The expression grammar works entirely on using left recursive rules with multiple precedence levels.

Victorious3 · 2019-04-02T16:21:23Z

I didn't have any problems with this over the course of several months, from my perspective its ready to merge?

apalala · 2019-04-02T19:36:48Z

I'm sorry I haven't had the time to review this in detail, @Victorious3 . You're in charge now. If you think it's ready, then it is. I'd just check that all the non-leftrec unit tests pass. I see some merge conflicts, but I'm sure you're aware of them. Do the docs need any updates?

Victorious3 · 2019-04-03T14:32:29Z

There's one issue with how the unittests run currently. There are significant differences between how the parser behaves when running from the generated code and when running in immediate. Tatsu's self test doesn't cover all the features that have been added, so for completeness all the other unit tests should run twice.

The docs don't need changes, it's running "as advertised" now ^^

Victorious3 added 10 commits August 7, 2018 22:49

Remove NodePreOrderWalker

cfc3978

It doesn't seem to be used anywhere, and it's broken (walk_node vs walk_Node). PreOrderWalker does the same thing

Add calculator example for testing

827cd09

Groundwork for Nullability

ad02bfe

Nullability check

9fd895e

Didn't test it properly but it seems to work. I'm not exactly happy with some of the code (mainly how it deals with RuleRef).

Add VSCode to gitignore

f5c76f4

Refactor unnecessary parameters

7d2e03b

Somewhat workable, commit before I fix handling of RuleRef

8d5c9fb

Some cleanup

e8013e0

The real solution would be to resolve RuleRef beforehand and replace them with the correct Rule instance, but that would mess with code generation

Add codegen, tests are passing

ee9216b

Make flake happy

a37868b

Fix for 2.7

0b2007a

apalala added 7 commits August 14, 2018 16:02

[leftrec] allow mypy tests to pass

4f53059

https://scrapinghub.atlassian.net/browse/left_recursion_fix

[test] disable graph tests under Py37

563e561

https://scrapinghub.atlassian.net/browse/left_recursion_fix

[util] add constant for PY37

65fd548

https://scrapinghub.atlassian.net/browse/left_recursion_fix

allow update of generated files

6900c8e

[leftrec] guard against nodes not in rule_dict

56807e6

https://scrapinghub.atlassian.net/browse/left_recursion_fix

[examples] ignore generated files

8830f92

https://scrapinghub.atlassian.net/browse/left_recursion_fix

[leftrec] ensure Py27 compatibility

cb297c0

Victorious3 commented Aug 15, 2018

View reviewed changes

tatsu/contexts.py Show resolved Hide resolved

Trying to fix the left recursion algorithm

f0d9e3b

The last implementation was advancing the input way too much, this resets it after every rule invokation I can get the tests to pass by stripping all whitespace, something's off with that.

Victorious3 mentioned this pull request Aug 17, 2018

Parser drops part of input #27

Closed

Now eating whitespace correctly

edd9449

One of the grammars was actually broken (not PEG)

Victorious3 commented Aug 18, 2018

View reviewed changes

(snow-)flake to kick the tests off

4fbcf7f

Victorious3 added 2 commits August 18, 2018 09:29

Turn memoization back on and leave it off for left recursive progression

2a1d4b8

This still discards many useful memos but for now this is the best I can do

Add two failing testsGetting these two run needs some more changes. H…

debab57

…owever, the examples are non trivial and hard to understand so I don't think these cases are common occurance.

Victorious3 added 4 commits August 30, 2018 10:17

Actually, let's make is_leftrec part of RuleRef

4845f15

I don't think @tatsumasu can accept any other parameters. It's still a bit ugly but better than before. @leftrec now needs to be applied before @tatsumasu

Add @nomemo

06e3bd4

Turns memoization back on for rules that aren't part of a left recursive cycle. This should improve performance significantly, previously all memoization was turned off when any left recursion took place.

Flake

65643d1

Fix python 2/3 inconsistency

18bf011

It's probably a good idea to return False here

6ce98ba

Add __slots__ for faster access

b4f8a8b

Victorious3 requested a review from apalala October 12, 2018 14:38

Merge branch 'master' into left_recursion_fix

8f96ca2

Merge branch 'master' into left_recursion_fix

6c688ad

Victorious3 mentioned this pull request Apr 10, 2019

Left recursion bug when not at top level? #94

Closed

apalala merged commit 617a979 into neogeny:master Apr 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Left recursion fix #75

Left recursion fix #75

Victorious3 commented Aug 14, 2018 •

edited

apalala commented Aug 14, 2018

Victorious3 commented Aug 14, 2018

apalala commented Aug 14, 2018

Victorious3 Aug 18, 2018

Victorious3 commented Aug 18, 2018 •

edited

Victorious3 commented Aug 18, 2018

Victorious3 commented Aug 18, 2018

Victorious3 commented Sep 2, 2018

Victorious3 commented Sep 2, 2018

Victorious3 commented Apr 2, 2019

apalala commented Apr 2, 2019

Victorious3 commented Apr 3, 2019 •

edited

Left recursion fix #75

Left recursion fix #75

Conversation

Victorious3 commented Aug 14, 2018 • edited

TODO:

apalala commented Aug 14, 2018

Victorious3 commented Aug 14, 2018

apalala commented Aug 14, 2018

Victorious3 Aug 18, 2018

Choose a reason for hiding this comment

Victorious3 commented Aug 18, 2018 • edited

Victorious3 commented Aug 18, 2018

Victorious3 commented Aug 18, 2018

Victorious3 commented Sep 2, 2018

Victorious3 commented Sep 2, 2018

Victorious3 commented Apr 2, 2019

apalala commented Apr 2, 2019

Victorious3 commented Apr 3, 2019 • edited

Victorious3 commented Aug 14, 2018 •

edited

Victorious3 commented Aug 18, 2018 •

edited

Victorious3 commented Apr 3, 2019 •

edited