Limit trivia by AST MainNode name. #992

nojaf · 2020-08-07T09:05:40Z

Hey @jindraivanek, in this PR I've experimented with limiting the allowed match for TriviaNode by the AST MainNode name.
I'd like to hear your opinion on this one.

So the problem was that when printing the trivia of the TypeDefnSig it also prints the trivia for the SynTypeDefnSimpleRepr.Union because they have exactly the range.

That is why as a test I further limited the allowed TriviaNodes by type name.

Potentially it can:

Speed up finding the correct trivia node as we further limit the set of options.

genTriviaFor ["TypeDefnSig"] range

Allow us to be sure that we only print the trivia once and that could lead that we don't need to clean up the already printed trivia.
This could be beneficial for the benchmark.

If you think this has some value I can flesh out this PR a bit more.
Please let me know what you think.
Many thanks.

nojaf · 2020-08-08T18:06:56Z

So I've completed the exercise out of curiosity sake and it turns out this is not good for the performance:

| Method |    Mean |   Error |   StdDev | Rank |       Gen 0 |      Gen 1 |      Gen 2 | Allocated |
|------- |--------:|--------:|---------:|-----:|------------:|-----------:|-----------:|----------:|
| Format | 9.571 s | 3.266 s | 0.1790 s |    1 | 141000.0000 | 48000.0000 | 11000.0000 |    2.4 GB |

I was actually expecting that

let private findTriviaMainNodeFromNameAndRange nodes ((mainNodesNames, range): string list * range) =
    nodes
    |> List.tryFind(fun n ->
        match n.Type with
        | MainNode(mn) ->
            List.contains mn mainNodesNames && RangeHelpers.rangeEq n.Range range
        | _ -> false)

would have sped things up.

nojaf · 2020-08-09T10:23:49Z

Splitting up the trivia main nodes by types upfront improves the situation yet nothing all that significant:

        let triviaByNodes =
            trivia
            |> List.choose (fun tn ->
                match tn.Type with
                | MainNode(mn) -> Some (mn, tn)
                | _ -> None)
            |> List.groupBy fst
            |> List.map (fun (k, g) -> k, List.map snd g)
            |> Map.ofList

| Method |    Mean |    Error |   StdDev | Rank |       Gen 0 |      Gen 1 |      Gen 2 | Allocated |
|------- |--------:|---------:|---------:|-----:|------------:|-----------:|-----------:|----------:|
| Format | 5.264 s | 0.0300 s | 0.0016 s |    1 | 103000.0000 | 36000.0000 | 11000.0000 |   2.37 GB |

sepNlnConsideringTriviaContentBefore

sepNlnConsideringTriviaContentBefore usages.

…butes

nojaf · 2020-08-09T15:14:36Z

Slowly getting there:

| Method |    Mean |    Error |   StdDev | Rank |       Gen 0 |      Gen 1 |      Gen 2 | Allocated |
|------- |--------:|---------:|---------:|-----:|------------:|-----------:|-----------:|----------:|
| Format | 4.625 s | 0.0928 s | 0.0051 s |    1 | 103000.0000 | 33000.0000 | 11000.0000 |   2.37 GB |

The more calls that can be replaced by specific main node or token calls the better it the numbers are.

nojaf · 2020-08-09T19:24:22Z

Booya 🥰

| Method |    Mean |    Error |   StdDev | Rank |       Gen 0 |      Gen 1 |      Gen 2 | Allocated |
|------- |--------:|---------:|---------:|-----:|------------:|-----------:|-----------:|----------:|
| Format | 3.913 s | 0.8803 s | 0.0483 s |    1 | 149000.0000 | 55000.0000 | 20000.0000 |   2.38 GB |

jindraivanek · 2020-08-10T06:42:35Z

I like the general idea. 👍

I think that with increased use of MainType we should replace MainType string with enum. Maybe use code from my abandoned experiment https://github.com/jindraivanek/fantomas/blob/6770151d604fe6d67bff0495dc2001ae1fe31813/src/Fantomas/FsAstTypes.fs ? Or at least use single case DU.

nojaf · 2020-08-10T07:03:55Z

Good idea, that would speed things up. We could do the same thing for the by Token Map in Context as well.

 TriviaMainNodes: Map<string, TriviaNode list>
 TriviaTokenNodes: Map<string, TriviaNode list>

One thing I haven't mentioned here is that although the current tests pass, I did not take every possible scenario into account.
From time to time I was able to remove a genTrivia call completely and all the tests still pass so we need to be careful that we only call it when we really expect it.
Or to put it simply: we will gain some speed yet lose some correct results because we didn't have a test for them.
I'm pretty ok with this in a post v4 world, it will lead to some easy bugs and is worth the gain.

nojaf added 2 commits August 7, 2020 10:39

Proof of concept to limit trivia by AST MainNode name.

dd21a72

Correct assumptions about the mainNode names.

5c121ee

nojaf requested a review from jindraivanek August 7, 2020 09:05

nojaf linked an issue Aug 7, 2020 that may be closed by this pull request

FSI formatting does the wrong thing with comments on single-case DU #965

Closed

3 tasks

nojaf added 2 commits August 8, 2020 19:40

Don't remove trivia nodes after they have been printed.

ed49927

No longer confused.

40db5eb

nojaf added 2 commits August 8, 2020 20:07

Remove commented code

cf27551

Store TriviaNodes by MainNode type.

20e8685

nojaf added 6 commits August 9, 2020 12:43

Improve hasLineCommentAfterInfix

7c29839

sepConsideringTrivia for specific main nodes.

c31b9b2

Remove unnecessary

ce6adc4

sepNlnConsideringTriviaContentBefore

Replaced some

011db99

sepNlnConsideringTriviaContentBefore usages.

Store token trivia nodes separate.

1ae11a4

Replaced some usages of sepNlnConsideringTriviaContentBeforeWithAttri…

dfa41e0

…butes

nojaf added 4 commits August 9, 2020 19:32

Remove sepNlnConsideringTriviaContentBeforeWithAttributes

096fa34

Replaced some enterNode with enterNodeFor.

d2c102f

Remove genTrivia

b779b98

Remove enterNode, remove leaveNode, remove node

b5c3e85

nojaf added 6 commits August 10, 2020 19:06

Use FsAstType as main node type.

17df746

Merged master

d59a487

Remove additional check.

831bbff

Use RangeHelpers.rangeEq instead of =

eaaa331

Introduced FsTokenType

cb2a0ed

Optimize getCharContent

abddcf4

nojaf added 10 commits August 21, 2020 11:15

Fix newline before match.

4a4b9ce

Fix function call with LPAREN_STAR_RPAREN token.

548cee7

Don't add newline before record expr.

6b1cbbc

Keep newline before tuple.

23fa551

Keep newline before do bang.

4a5f105

Keep newline before tuple in match clause.

b388b0d

Keep newline before do bang.

a558107

Don't repeat newline before SynExpr.Paren.

568bfb2

Keep newline before SynExpr.TryWith.

286b8a2

No extra newline before SynExpr.AnonRecd.

ccd8ab5

nojaf marked this pull request as ready for review August 25, 2020 20:00

nojaf added 5 commits August 27, 2020 17:38

Merge branch 'master' into fix-965

a325a1c

genTriviaFor SynModuleDecl_HashDirective.

88388cb

Fixed recursive types.

13c603c

Format AstTransformer.fs

4fdb2a5

Code clean up

4b2d33c

nojaf changed the title ~~Proof of concept to limit trivia by AST MainNode name.~~ Limit trivia by AST MainNode name. Aug 28, 2020

nojaf added 2 commits August 28, 2020 09:31

Code clean up

6eab555

Merge branch 'fix-965' of https://github.com/nojaf/fantomas into fix-965

841a96d

nojaf merged commit 7f524a3 into fsprojects:master Aug 28, 2020

nojaf deleted the fix-965 branch August 28, 2020 07:55

nojaf mentioned this pull request Sep 4, 2020

Extra new line before for loop in computation expression #1092

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit trivia by AST MainNode name. #992

Limit trivia by AST MainNode name. #992

nojaf commented Aug 7, 2020 •

edited

nojaf commented Aug 8, 2020

nojaf commented Aug 9, 2020

nojaf commented Aug 9, 2020 •

edited

nojaf commented Aug 9, 2020

jindraivanek commented Aug 10, 2020

nojaf commented Aug 10, 2020

Limit trivia by AST MainNode name. #992

Limit trivia by AST MainNode name. #992

Conversation

nojaf commented Aug 7, 2020 • edited

nojaf commented Aug 8, 2020

nojaf commented Aug 9, 2020

nojaf commented Aug 9, 2020 • edited

nojaf commented Aug 9, 2020

jindraivanek commented Aug 10, 2020

nojaf commented Aug 10, 2020

nojaf commented Aug 7, 2020 •

edited

nojaf commented Aug 9, 2020 •

edited