Left factor parser for function types #606

Gabriella439 · 2018-09-27T17:41:20Z

Fixes #108

This gives a massive (~30x) parsing performance improvement for the benchmark
code from the above issue:

benchmarking Issue #108/Text
time                 169.9 ms   (167.6 ms .. 172.8 ms)
                     1.000 R²   (0.999 R² .. 1.000 R²)
mean                 174.8 ms   (172.6 ms .. 177.3 ms)
std dev              3.525 ms   (2.008 ms .. 5.131 ms)
variance introduced by outliers: 12% (moderately inflated)

After:

time                 5.860 ms   (5.826 ms .. 5.904 ms)
                     1.000 R²   (1.000 R² .. 1.000 R²)
mean                 5.921 ms   (5.902 ms .. 5.939 ms)
std dev              56.42 μs   (46.69 μs .. 67.52 μs)

The root cause was that the parser for function types was introducing
excessive backtracking. This led to parsing performance being exponential
in the number of atomic operatorExpressions.

This change left-factors the expression parser by gutting
annotatedExpression. Specifically, this moves the logic for parsing unannotated
List/Optional/merge into expression and then consolidates the
remaining logic for parsing an ordinary Annot into expression, applying
a fixup for unannotated List/Optional/merge expressions to tag them with
their annotation.

Fixes #108 This gives a massive (~30x) parsing performance improvement for the benchmark code from the above issue: ``` benchmarking Issue #108/Text time 169.9 ms (167.6 ms .. 172.8 ms) 1.000 R² (0.999 R² .. 1.000 R²) mean 174.8 ms (172.6 ms .. 177.3 ms) std dev 3.525 ms (2.008 ms .. 5.131 ms) variance introduced by outliers: 12% (moderately inflated) time 5.860 ms (5.826 ms .. 5.904 ms) 1.000 R² (1.000 R² .. 1.000 R²) mean 5.921 ms (5.902 ms .. 5.939 ms) std dev 56.42 μs (46.69 μs .. 67.52 μs) ``` The root cause was that the parser for function types was introducing excessive backtracking. This led to parsing performance being exponential in the number of atomic `operatorExpression`s. This change left-factors the `operatorExpression` parser by gutting `annotatedExpression`. Specifically, this moves the logic for parsing annotated `List`/`Optional`/`merge` into `primitiveExpression` and then consolidating the remaining logic for parsing an ordinary `Annot` into `operatorExpression` so that it no longer has to backtrack.

Gabriella439 · 2018-09-27T18:04:19Z

Note that still needs a little cleanup before merging because it discards the Noted constructors from type annotations. I will polish this a bit more later tonight

f-f · 2018-09-27T18:17:13Z

I can confirm that this also fixes #580 🎉

Running time with this branch is ~1s, which I'd consider good enough.

Thanks for the good work @Gabriel439 and @phadej 👏

This was the pathological example that helped quickly narrow down the problem

`alternative4` now subsumes `alternative5`

…hall-Library into gabriel/left_factor

This was referenced Sep 27, 2018

Improve parser performance #108

Closed

Try parens as first primitive #605

Closed

phadej mentioned this pull request Sep 27, 2018

Wip #607

Closed

Gabriella439 added 3 commits September 27, 2018 15:43

Don't remove Note constructors

bc45335

Add benchmark for deeply-nested parentheses

3d6c087

This was the pathological example that helped quickly narrow down the problem

Merge branch 'master' into gabriel/left_factor

762c590

jneira mentioned this pull request Sep 28, 2018

Performance issues with many deeply nested imports #580

Closed

Gabriella439 added 2 commits September 28, 2018 06:40

Remove unused parsing alternative

2fb8015

`alternative4` now subsumes `alternative5`

Merge branch 'gabriel/left_factor' of github.com:Gabriel439/Haskell-D…

cbb1602

…hall-Library into gabriel/left_factor

Gabriella439 merged commit 218e90a into master Sep 28, 2018

Gabriella439 deleted the gabriel/left_factor branch September 28, 2018 13:51

Gabriella439 mentioned this pull request Mar 22, 2019

echo '[]' | dhall encode | dhall decode #862

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Left factor parser for function types #606

Left factor parser for function types #606

Gabriella439 commented Sep 27, 2018 •

edited

Loading

Gabriella439 commented Sep 27, 2018

f-f commented Sep 27, 2018

Left factor parser for function types #606

Left factor parser for function types #606

Conversation

Gabriella439 commented Sep 27, 2018 • edited Loading

Gabriella439 commented Sep 27, 2018

f-f commented Sep 27, 2018

Gabriella439 commented Sep 27, 2018 •

edited

Loading