AST source locations #16

rvanasa · 2022-08-14T18:19:04Z

Resolves #7.

Progress:

Find a good way to convert usize ranges to line/column numbers
Refactor Exp_, Type_, etc. to include source locations
Refactor the lexer to use the same source location pattern
Update usages throughout the codebase to account for these changes
Update the parser to preserve source locations from tokens

matthewhammer · 2022-08-15T14:12:52Z

src/lib/ast.rs

 use serde::{Deserialize, Serialize};

+#[derive(Debug, Clone, PartialEq, Eq, Serialize, Deserialize)]
+pub struct Loc<X>(pub X, pub Source);


Does it help in any way to absorb the Box<_> around the X into this type definition?

That way, we avoid having to think about the Box and the Loc separately, and they combine in the AST structure.

I originally included Box<_> in the definition, but I ran into edge cases where I just wanted the source information without storing everything on the heap (specifically for the lexer). This could also give us room for optimizations down the line in the interpreter.

matthewhammer · 2022-08-15T14:13:29Z

src/lib/ast.rs


 pub type Prog = Decs;

+pub type Dec_ = Loc<Box<Dec>>;


I wonder if reading Loc<Dec> is simpler, where it would imply the box?

See my earlier comment about this question, near the def for Loc.

Makes sense for the AST but adds a bunch of extra overhead to the lexer. If this is enough of a QoL issue, we could maybe add a standard type alias called LB<_> or similar.

We could also add some specialized sugar functions by adding and impl<X> for Loc<Box<X>>, which makes it seem okay to have the optional heap allocation from an abstraction standpoint.

matthewhammer · 2022-08-15T14:14:38Z

src/lib/parser_utils.rs

-            vec: vec![d],
-            has_trailing: false,
+pub fn dec_into_exp(d: Dec_) -> Exp_ {
+    Loc(


Same question, and more evidence for combining I think.

Here's a counterexample where it makes sense to use Loc<_> without Box<_>. We could also eventually use this to optimize the VM, since boxing would add a bit of extra overhead which could make an impact on the interpreter's performance. Seems worth having a "separation of concerns" similar to Rust's Arc<Mutex<_>> pattern for reusability in different situations. I'll definitely add a shorthand for Loc<Box<_>> though, since it will probably continue to be used in most cases.

matthewhammer · 2022-08-15T14:15:27Z

This is looking really good, and very tedious in terms of the changes required. Thank you for undertaking it!

rvanasa · 2022-08-15T15:18:31Z

Replaced all usages of Loc<Box<_>> with a type alias called Node<_>, which has a specialized impl with its own helper functions, etc. Let me know if you think of a better name, since I just chose Node since it seemed to cause slightly less cognitive overhead than LB.

matthewhammer · 2022-08-15T21:28:01Z

Accidentally converted from "draft" to "ready to review", so just reverted that.

rvanasa added 11 commits August 14, 2022 10:00

Add source locations to AST

5e72878

Conversion progress

75d1ed8

Convert parser_utils

4a12d85

Add location.expand(location) helper function

cfb476f

Account for unknown source locations

82a7ba4

Various proselytizations

34b8529

Convert lexer to use AST source location types

a1770a7

Refactor Located -> Loc, Location -> Source

59fd99b

Prosthelytize VM

e040407

Add structopt to format command width argument

64549b4

Merge branch 'main' into issue-7-line-col-numbers

7763e7d

rvanasa requested a review from matthewhammer August 14, 2022 18:19

rvanasa marked this pull request as draft August 14, 2022 18:19

rvanasa added 2 commits August 14, 2022 12:48

Update parser, initial pass

35ea78d

Add 'map_into' helper function for Loc

911ea8b

matthewhammer suggested changes Aug 15, 2022

View reviewed changes

Replace Loc<Box<_>> with Node<_>

7005cea

matthewhammer marked this pull request as ready for review August 15, 2022 21:27

matthewhammer marked this pull request as draft August 15, 2022 21:27

rvanasa added 8 commits August 16, 2022 13:05

Progress

a44178f

Progress

43a0e18

Add syntax tree enum

b07ab9c

Progress

ac36104

Fix formatter test file

5e4065d

Implement custom Debug + Display for Source

2124496

Replace parser_utils::Lexer with iterated Tokens

1a96a14

Filter whitespace / comments before parsing

0d9abee

rvanasa and others added 13 commits August 16, 2022 16:22

Progress

c83f104

Remove unused comment

d3fea5a

Merge branch 'main' into issue-7-line-col-numbers

cf49976

Progress

a55e506

Fix token tree filtering edge case

12d24b0

Progress

8491f5f

Preserve sugar for return statements

7340699

Rename ValueFromExpError to ValueError

d612d9a

Preserve sugar for Exp::Call type arguments

e512991

Remove unused parser rules

ed81998

Progress

208bb44

Add test cases for line and block comments

24835d1

fix sneaky parser issue with bad error message.

ed018f4

rvanasa marked this pull request as ready for review August 17, 2022 18:14

rvanasa merged commit ce7d4ed into main Aug 17, 2022

rvanasa deleted the issue-7-line-col-numbers branch August 18, 2022 00:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AST source locations #16

AST source locations #16

rvanasa commented Aug 14, 2022 •

edited

matthewhammer Aug 15, 2022

rvanasa Aug 15, 2022

matthewhammer Aug 15, 2022

rvanasa Aug 15, 2022

matthewhammer Aug 15, 2022

rvanasa Aug 15, 2022

matthewhammer commented Aug 15, 2022

rvanasa commented Aug 15, 2022 •

edited

matthewhammer commented Aug 15, 2022

AST source locations #16

AST source locations #16

Conversation

rvanasa commented Aug 14, 2022 • edited

matthewhammer Aug 15, 2022

Choose a reason for hiding this comment

rvanasa Aug 15, 2022

Choose a reason for hiding this comment

matthewhammer Aug 15, 2022

Choose a reason for hiding this comment

rvanasa Aug 15, 2022

Choose a reason for hiding this comment

matthewhammer Aug 15, 2022

Choose a reason for hiding this comment

rvanasa Aug 15, 2022

Choose a reason for hiding this comment

matthewhammer commented Aug 15, 2022

rvanasa commented Aug 15, 2022 • edited

matthewhammer commented Aug 15, 2022

rvanasa commented Aug 14, 2022 •

edited

rvanasa commented Aug 15, 2022 •

edited