Fix to #225, added better error reporting for internal compiler errors. #329

srijan-paul · 2020-09-16T10:48:59Z

Previously, when an incorrect table was passed into a function dealing with a certain type of table, only an "impossible" error was thrown followed by a Lua stack trace.

Now it's possible to see the tag which caused the error, and wherever appropriate, an error message following it. Calls to error("impossible") have been replaced with a call to typedecl.tag_error(tag, "(optional error message.)").

Hopefully, this should fix #225 .

…s.lua

hugomg

Lovely! Thanks for going through this. I think you even found some cases that I would have missed if I tried doing this myself.

I do have some suggestion for changes before we can merge this though.

The first is that if we choose a good default error message for tag_error then I think we should be able to use the default error message in more places. I think it would be good if we had a clear rule for whether we should have a special error message or not. That way, we don't need to decide for every possible if-elseif whether it should have a special error message or not. One rule of thumb we could follow is that if the if-elseif covers all the possible cases then it can use the default error message and if it only covers a subset of the possible cases then it should have a special error message.

Another thing that we could change is that there are some places in the code that could be updated to use the new tag_matches function. You can find them by running grep -R 'string.match(.*tag'.

pallene/typedecl.lua

pallene/types.lua

pallene/coder.lua

pallene/to_ir.lua

pallene/typedecl.lua

pallene/to_ir.lua

…e for tag_matches

…ing prefixes

srijan-paul · 2020-09-17T10:44:01Z

I made the changes as asked, excuse the CI fails in older commits, I made some mistakes by merging into master too early. Everything should now work as expected.

One part I couldn't figure out is this line, in coder.lua:1508 :

local name = assert(string.match(cmd._tag, "^ir%.Cmd%.(.*)$"))

I had originally replaced that with this:

local name = cmd._tag
assert(typedecl.tag_matches(name, "ir.Cmd."))

however, doing so would cause the tests to fail, and I seem to be unable to diagnose why.
So I left that one line unchanged. It would be awesome if you could explain that. cheers.

hugomg · 2020-09-17T13:45:29Z

string.match(cmd._tag, "^ir%.Cmd%.(.*)$")
however, doing so would cause the tests to fail, and I seem to be unable to diagnose why.

The string.match() here is using a string capture. For example, if cmd._tag is ir.Cmd.Binop then name will be Binop.

Maybe we should change the tag_matches function to return this string instead of just true/false?

hugomg

Thanks. I marked all the things that you fixed as resolved. There are only a handful that were missed that are still there:

The tag_error for unknown unary operators
There is stll an assert(string.match(typ._tag, "^types%.T%.")) in types.lua
There is the name = assert(string.match(cmd._tag, "^ir%.Cmd%.(.*)$")) that broke the CI

pallene/types.lua

hugomg · 2020-09-17T14:03:51Z

About the broken CI, if we really wanted we could use git rebase -i to fix the old commits so they pass the CI. However, in this case it is only a very small problem so I don't think we have to bother with it.

srijan-paul · 2020-09-17T14:59:46Z

The string.match() here is using a string capture. For example, if cmd._tag is ir.Cmd.Binop then name will be Binop.

Maybe we should change the tag_matches function to return this string instead of just true/false?

Yeah that would seem like a good idea.
I missed the capture group, kinda rusty with regex. Thanks for explaining.

I feel like the name tag_matches implies a boolean. What would you say about letting that one line stay as is instead of changing intent of the function for just one usage ?

srijan-paul · 2020-09-17T15:02:44Z

Good idea. It would be safer to always add the second .. What do you think about adding this . inside the tag_matches function, so the caller doesn't have to remember to add it themselves?

Since the tag_matches function does prefix matching, I personally would find it easier if it only matched against the string it receives as an argument instead of adding a ".". It could, at times cause confusion. Thoughts ?

Better internal compiler errors specifying the tag which caused the error instead of "impossible"

hugomg · 2020-09-17T17:01:23Z

I personally would find it easier if it only matched against the string it receives as an argument instead of adding a ".".

Maybe it should have a different name instead of tag_matches then? What we we are trying to see is if a certain tag (such as "types.T.Boolean") has a given type (such as types.T). We aren't trying to match against an arbitrary string...

I feel like the name tag_matches implies a boolean.

If you can think of a good name that doesn't imply a boolean we could use that. If not, I think it is OK if we continue using string.match("ir%.Cmd%.(.*)") in that one place.

…ment.

Updated the tag_error function further to allow for more uses across the codebase.

srijan-paul · 2020-09-17T17:42:22Z

If you can think of a good name that doesn't imply a boolean we could use that.

I went with match_tag and made the changes as asked.
The function now adds a second "." and captures and returns everything following it.

Probably not the best name but this is loosely similar to what string.match does in Lua , or Javascript,
match a pattern in a string and return the matched sequence. Except this returns the part following the ..

Do let me know if I missed something yet again :D

…uffix.

hugomg

Nice! I like the new name.

I found a couple of small bugs but they should be easy to fix and after that I think this should be good for merging.

It would also be nice if we added some test cases to typedecl_spec.lua. In particular, we should test that something like typedecl.match_tag("abc.d.e", "a.c") returns false because that is a bug that already hit us before and it happened again today. Would you be able to help by writing those tests?

hugomg · 2020-09-17T21:55:06Z

pallene/print_ir.lua

@@ -214,7 +215,8 @@ local function Cmd(cmd)
    elseif tag == "ir.Cmd.CallStatic" then rhs = Call(Fun(cmd.f_id),  Vals(cmd.srcs))
    elseif tag == "ir.Cmd.CallDyn"    then rhs = Call(Val(cmd.src_f), Vals(cmd.srcs))
    else
-        local tagname = assert(string.match(cmd._tag, "^ir.Cmd.(.*)"))
+        local tagname = cmd._tag
+        assert(typedecl.match_tag(tagname, "ir.Cmd"))


Should this be local tagname = assert(typedecl.match_tag(cmd._tag, "ir.Cmd"))?

Oh yes, my bad.

pallene/typedecl.lua

srijan-paul · 2020-09-18T05:10:39Z

It would also be nice if we added some test cases to typedecl_spec.lua. In particular, we should test that something like typedecl.match_tag("abc.d.e", "a.c") returns false

Yeah, I should be able to put together some simple tests for that.

Added some test cases to test "match_tag" function in typedecl.

srijan-paul · 2020-09-18T06:22:00Z

I updated that assert guard, changed match_tag to no longer treat it's arguments as regex strings.
Also added 3 basic test cases to typedecl_spec.lua.

So, do you think these test cases are good for now ?

hugomg

Just some small things in the typedecl_spec and then I think this is done.

hugomg · 2020-09-18T15:24:04Z

spec/typedecl_spec.lua

+        assert.falsy(typedecl.match_tag("foo.Bar.baz", "f.o.Bar"))
+        assert.truthy(typedecl.match_tag("foo.Bar.baz", "foo.Bar"))
+        assert.falsy(typedecl.match_tag("types.T.Float", "types.T."))
+    end, "")


We could put each test case in a separate it block. That way we can give a descriptive name for each of them. For example the first test can say that it is testing that the "." is not being interpreted as a regex character.

The second test case could test that the return value is "baz" instead of only testing that it is truthy

I think we don't need to put that "") at the end

True, should have gone with more descriptive test logs.
Done 👍.

Updated tests to be more specific with logs.

hugomg

Perfect! Thanks a lot for the contribution! It's much appreciated.

srijan-paul added 7 commits September 16, 2020 18:36

added tag_matches and tag_error functions to typedecl module

6a08daa

added a test file, and replaced all error("impossible") calls in type…

fd0d9ef

…s.lua

added proper error messages in ast.lua, print_ir.lua and checker.lua

b8ee216

added proper error reporting to 'to_ir.lua'

95b4d3d

fixed typo

7f74d2e

added error reporting to 'coder.lua' and 'constant_propagation.lua'

f02aaf7

updated error_tag error message

a06532b

hugomg requested changes Sep 16, 2020

View reviewed changes

srijan-paul force-pushed the master branch from 05ba979 to 706aed2 Compare September 17, 2020 08:08

srijan-paul added 4 commits September 17, 2020 15:26

removed uneccessary whitespace after doc comments. added usage exampl…

7a0729b

…e for tag_matches

removed uneccessary whitespace after doc comments. added usage exampl…

ca72031

…e for tag_matches

replaced all uses of tag_is_type(tag) with tag_matches(tag, "types.T"

2134fe5

replaced old uses of string.match with typedecl.tag_matches for check…

6910a0d

…ing prefixes

srijan-paul requested a review from hugomg September 17, 2020 10:47

srijan-paul added 2 commits September 17, 2020 16:24

removed uneccessary error messages.

b6091ac

better default error message

706aed2

srijan-paul added 2 commits September 17, 2020 19:21

updated doc comment

29c9bef

fixed incorrect require path

edd073a

hugomg requested changes Sep 17, 2020

View reviewed changes

pallene/types.lua Show resolved Hide resolved

srijan-paul added 2 commits September 17, 2020 21:32

fixed bug with assert guards, now passes all tests

e99c0dd

Merge branch 'better-errors' into master

07350ca

Better internal compiler errors specifying the tag which caused the error instead of "impossible"

srijan-paul added 2 commits September 17, 2020 23:01

updated calls to match_tag , removed the "." at end of each call argu…

81a5676

…ment.

Merge branch 'better-errors' into master

88b1f02

Updated the tag_error function further to allow for more uses across the codebase.

srijan-paul requested a review from hugomg September 17, 2020 18:47

srijan-paul added 2 commits September 18, 2020 02:21

replaced the remaining string.matches with tag_matches

51123e9

changed function tag_matches to match_tag, now returns the captured s…

1827abb

…uffix.

hugomg requested changes Sep 17, 2020

View reviewed changes

srijan-paul added 2 commits September 18, 2020 10:24

updated match_tag to not treat the 2nd argument as regex

619c89d

updated a previously missed assert guard

93d2669

srijan-paul added 4 commits September 18, 2020 11:22

added some test cases to typedecl_spec

9367f29

Merge branch 'better-errors' into master

f7f553f

Added some test cases to test "match_tag" function in typedecl.

removed uneccessary comment

e99a95b

Merge branch 'better-errors' into master

ceb5309

srijan-paul requested a review from hugomg September 18, 2020 06:22

hugomg requested changes Sep 18, 2020

View reviewed changes

srijan-paul added 2 commits September 18, 2020 22:17

updated the test cases to be more precise with logs

9189ba0

Merge branch 'better-errors' into master

170094a

Updated tests to be more specific with logs.

srijan-paul requested a review from hugomg September 18, 2020 16:54

hugomg approved these changes Sep 18, 2020

View reviewed changes

hugomg merged commit e27f114 into pallene-lang:master Sep 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix to #225, added better error reporting for internal compiler errors. #329

Fix to #225, added better error reporting for internal compiler errors. #329

srijan-paul commented Sep 16, 2020 •

edited

hugomg left a comment

srijan-paul commented Sep 17, 2020

hugomg commented Sep 17, 2020

hugomg left a comment

hugomg commented Sep 17, 2020

srijan-paul commented Sep 17, 2020 •

edited

srijan-paul commented Sep 17, 2020

hugomg commented Sep 17, 2020

srijan-paul commented Sep 17, 2020

hugomg left a comment

hugomg Sep 17, 2020

srijan-paul Sep 18, 2020

srijan-paul commented Sep 18, 2020 •

edited

srijan-paul commented Sep 18, 2020 •

edited

hugomg left a comment

hugomg Sep 18, 2020

srijan-paul Sep 18, 2020

hugomg left a comment

Fix to #225, added better error reporting for internal compiler errors. #329

Fix to #225, added better error reporting for internal compiler errors. #329

Conversation

srijan-paul commented Sep 16, 2020 • edited

hugomg left a comment

Choose a reason for hiding this comment

srijan-paul commented Sep 17, 2020

hugomg commented Sep 17, 2020

hugomg left a comment

Choose a reason for hiding this comment

hugomg commented Sep 17, 2020

srijan-paul commented Sep 17, 2020 • edited

srijan-paul commented Sep 17, 2020

hugomg commented Sep 17, 2020

srijan-paul commented Sep 17, 2020

hugomg left a comment

Choose a reason for hiding this comment

hugomg Sep 17, 2020

Choose a reason for hiding this comment

srijan-paul Sep 18, 2020

Choose a reason for hiding this comment

srijan-paul commented Sep 18, 2020 • edited

srijan-paul commented Sep 18, 2020 • edited

hugomg left a comment

Choose a reason for hiding this comment

hugomg Sep 18, 2020

Choose a reason for hiding this comment

srijan-paul Sep 18, 2020

Choose a reason for hiding this comment

hugomg left a comment

Choose a reason for hiding this comment

srijan-paul commented Sep 16, 2020 •

edited

srijan-paul commented Sep 17, 2020 •

edited

srijan-paul commented Sep 18, 2020 •

edited

srijan-paul commented Sep 18, 2020 •

edited