Support for Reporting Multiple Compiler Errors #139

FrobtheBuilder · 2021-08-11T21:46:48Z

Make the Bebop compiler report multiple errors.

These include validation errors and parsing errors. I had to rework the parser significantly to give it the ability to recover upon invalid input. It's not perfect, as it doesn't retain context well while it is while recovering, causing occasional spurious "not a top level definition!" errors following a syntax error, especially in unions. But it works really well other than that.
When the parser encounters an unexpected token, it attempts to skip forward to the next token that could possibly be valid within the closest context possible. So say, within a definition it will attempt to skip to a token that should start a new field, or if it can't find one, it will skip to the next top level definition. In the worst case, if an exception is thrown all the way to the top level Parse method, it will skip to the end of the current file. I also moved the test schemas around since I wanted to add bebop.json project files for both the valid and invalid schemas, and the compiler always searches recursively so they needed to be separated into different directories.

andrewmd5 · 2021-08-11T22:07:16Z

Compiler/Properties/launchSettings.json

+    "Oops All Failures": {
+      "commandName": "Project",
+      "commandLineArgs": "--check",
+      "workingDirectory": "C:\\Users\\Frob\\Projects\\bebop\\Laboratory\\Schemas\\ShouldFail"


These should be relative.

Yeah I would have done that, but I'm not sure exactly what it's relative TO and I couldn't find any info on it. Can you enlighten me?

FrobtheBuilder · 2021-08-12T03:46:33Z

What is the purpose of "DummyDefinition?"

Mainly a contrivance I created so I wouldn't have to make definitions nullable in a bunch of places, but I can do that instead if you'd prefer.

lynn

Looks great! I only have some small comments.

My one meta-comment is that there is a bit of this:

bool errored = false;
while (!Eat(TokenKind.CloseBrace)) {
  if (errored && unrecoverable) {
    CancelScope();
    return null;
  }
  errored = false;
  try {
    parse_some_stuff;
    if (wrong) throw new SomeError();
  } catch (SpanException e) {
    _errors.Add(e);
    errored = true;
    SkipUntil(whatever);
    continue;
  }
}

It looks like errored can be eliminated to simplify the control flow a little:

while (!Eat(TokenKind.CloseBrace)) {
  try {
    parse_some_stuff;
    if (wrong) throw new SomeError();
  } catch (SpanException e) {
    _errors.Add(e);
    SkipUntil(whatever);
    if (unrecoverable) {
      CancelScope();
      return null;
    }
    continue;
  }
}

(I would actually be happy if we could eliminate try-catch from the parsing logic entirely, because it feels weird how SpanExceptions are both "things we throw" and "things we make a careful list of" — but then when we do throw one, we're actually immediately catching and putting it in the list… But, I see how it's useful if you have a bunch of parsing steps that could go wrong and you want to recover from them all in the same way. If only C# had monads, or something)

lynn · 2021-08-12T18:38:01Z

Core/Exceptions/Exceptions.cs

-        public InvalidUnionBranchException(Definition definition)
-            : base($"The definition '{definition.Name}' cannot be used as a union branch. Valid union branches are messages and structs.", definition.Span, 113)
+        public InvalidUnionBranchException(Definition? definition)
+            : base($"The definition '{definition?.Name ?? "null"}' cannot be used as a union branch. Valid union branches are messages and structs.", definition?.Span ?? new Span(), 113)


I don't see a good reason for this parameter to become nullable: an error message like The definition 'null' cannot be used as a union branch. with an empty span is not useful. And at the call site, it doesn't look like definition can be null.

You're right. There was a point where I had a call to that without an argument available but it's gone now.

lynn · 2021-08-12T18:41:01Z

Core/Parser/SchemaParser.cs

@@ -20,6 +20,8 @@ namespace Core.Parser
 {
    public class SchemaParser
    {
+        private readonly HashSet<TokenKind> _topLevelDefinitionKinds = new() { TokenKind.Enum, TokenKind.Struct, TokenKind.Message, TokenKind.Union };
+        private readonly HashSet<TokenKind> _universalFollowKinds = new() { TokenKind.Enum, TokenKind.Struct, TokenKind.Message, TokenKind.Union, TokenKind.EndOfFile };


What does "universal follow kinds" mean? Does this mean: if we get confused, we can resume parsing from any token-kind in this set? Maybe it is worthy of a small doc comment.

Yeah, it's the bottom level follow set for pretty much everything. I could add a comment.

lynn · 2021-08-12T18:48:35Z

Core/Parser/SchemaParser.cs

+            additionalTokens ??= new();
+            ConsumeBlockComments();
+            if (CurrentToken.Kind != kind)
+            {
+                _errors.Add(new UnexpectedTokenException(kind, CurrentToken, hint));
+                while (_index < _tokens.Count - 1 && CurrentToken.Kind != kind && !additionalTokens.Contains(CurrentToken.Kind))
+                {
+                    _index++;
+                }
+            }


I think this could be DRY-er if we just call ExpectAndSkip(new HashSet<TokenKind>{ kind }, additionalTokens, hint), performance be damned (it only affects the very first if, anyway).

Nah the problem there is that will add another error to the log. I could probably just use that SkipUntil method I added though.

Wait nevermind I thought you were looking at a different part.

lynn · 2021-08-12T18:50:16Z

Core/Parser/SchemaParser.cs

+        {
+            additionalTokens ??= new();
+            ConsumeBlockComments();
+            if (kinds.All(kind => kind != CurrentToken.Kind))


Maybe just if (kinds.Contains(CurrentToken.Kind)) return;, then un-indent the lines below.

lynn · 2021-08-12T18:53:55Z

Core/Parser/SchemaParser.cs

+        private void SkipUntil(HashSet<TokenKind> kinds)
+        {
+            // Always advance by one.
+            if (_index < _tokens.Count - 1)
+            {
+                _index++;
+            }


Then IMO this should be called SkipCurrentAndThenSkipUntil.

Maybe it should. It works like that to prevent infinite loops, since it's possible that the current token isn't even potentially valid.

FrobtheBuilder added 8 commits August 4, 2021 11:32

Get started on the dang multiple errors.

4a9fc92

Merge master.

51f58d2

Further work on this

694e2cd

Move union validator.

f6f50cf

Make a bunch of parser errors non-fatal.

af5b3e0

Tearing up the parser oh yeah.

3e8360f

More parser recovery logic.

e95fbd2

Gitignore, further parser improvements.

5d1d1b6

FrobtheBuilder requested review from andrewmd5 and lynn August 11, 2021 21:46

andrewmd5 changed the title ~~Multiple errors~~ Support for Reporting Multiple Compiler Errors Aug 11, 2021

andrewmd5 requested changes Aug 11, 2021

View reviewed changes

FrobtheBuilder added 2 commits August 12, 2021 11:04

Replace the null definition with nulls.

3c5bcdd

Fix tests.

1f0933e

lynn approved these changes Aug 12, 2021

View reviewed changes

Address additional feedback.

fc27aca

FrobtheBuilder requested a review from andrewmd5 August 12, 2021 20:13

FrobtheBuilder merged commit 10143cc into master Aug 12, 2021

andrewmd5 deleted the multiple-errors branch July 18, 2022 15:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Reporting Multiple Compiler Errors #139

Support for Reporting Multiple Compiler Errors #139

FrobtheBuilder commented Aug 11, 2021

andrewmd5 Aug 11, 2021

FrobtheBuilder Aug 12, 2021

FrobtheBuilder commented Aug 12, 2021

lynn left a comment

lynn Aug 12, 2021

FrobtheBuilder Aug 12, 2021

lynn Aug 12, 2021

FrobtheBuilder Aug 12, 2021

lynn Aug 12, 2021

FrobtheBuilder Aug 12, 2021

FrobtheBuilder Aug 12, 2021

lynn Aug 12, 2021

lynn Aug 12, 2021

FrobtheBuilder Aug 12, 2021

Support for Reporting Multiple Compiler Errors #139

Support for Reporting Multiple Compiler Errors #139

Conversation

FrobtheBuilder commented Aug 11, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

FrobtheBuilder commented Aug 12, 2021

lynn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment