Automatic creation of a Concrete Syntax Tree. #215

bd82 · 2016-07-16T00:30:13Z

Zumbala · 2016-10-02T09:51:23Z

I came across the same problem and did something like below to overcome this.

// create place holder for your ast
var ast = {};
// give a descriptive name for adding to the same level, value = null or object
// value null will clear the array
function AST(name, value) {
 if (value === null) ast[name] = [];
 if (value) ast[name].push(value);
 return ast[name];
}

// sample usage:

$.start = $.RULE("start",function () {
  return {
    schema : $.SUBRULE($.define),
    rules : (function(name) { 
              AST(name,null);
              $.MANY(function () { AST(name,$.SUBRULE($.definitions));});
              return AST(name);
             })("rules"),
    start : $.SUBRULE($.root)   
  };
});

So by using an IIFE you can create an AST.

Maybe better is to have the MANY and others have the same implementation as the OR function.
Meaning the return value can be anything coming from the consuming functions.
In this case an ARRAY should be returned as MANY suggests doing so.

$.definitions = $.RULE("definitions",function () {
   return $.OR([
    { ALT : function () { return $.SUBRULE($.validator); } },
    { ALT : function () { return $.SUBRULE($.type); } },
    { ALT : function () { return $.SUBRULE($.object); } }
  ]);

});

bd82 · 2016-10-02T10:20:20Z

Thanks @Zumbala.

I'm not sure what is actually happening in the example you provided.
By I think understand the gist of it of minimizing the AST building code so the grammar would be more "pronounced".

I've done something similar by building a very simple "ParseTree" structure and deferring the full AST building to a later stage.
https://github.com/SAP/chevrotain/blob/master/examples/language_services/src/examples/json/parser.ts

but this is just a mitigation of the problem. I'm interested in creating a completely pure grammar and defining the "actions" outside of it (listener/visitor/...). That would be a full solution.

I believe this is possible with Chevrotain by overriding CONSUME/SUBRULE/RULE.
And hope to find time to investigate in the future 😄

bd82 · 2016-10-02T10:30:18Z

Maybe better is to have the MANY and others have the same implementation as the OR function.
Meaning the return value can be anything coming from the consuming functions.
In this case an ARRAY should be returned as MANY suggests doing so.

Possibly, however:

This won't be suitable for all use cases as sometimes the "MANY** has complex contents as its not always a single "SUBRULE" invocation.
It would also conflict with "MANY_SEP" which returns an array of separators.
If you are interested in this open a different issue and we can discuss it farther, however
And it would have a negative performance impact even in situations the returned array is not used.

I'm not sure I can implement this without causing issues...

You could easily override MANY1-5 and "manyInternal"
to accomplish this, but with the disadvantage of possibly having extra work when upgrading Chevrotain versions.

tlrobinson · 2016-12-12T01:29:31Z

If this idea is what I think it is, it would be really nice. I have a grammar that already has actions, which I use to parse and compile a simple language. I'd like to re-use the same grammar for syntax highlighting. It would be neat if I could toggle a "syntax tree only" mode that ignored the actions and returned a syntax tree instead.

Awesome project, by the way!

bd82 · 2016-12-12T09:52:41Z

Awesome project, by the way!

Thanks @tlrobinson. 😄

I've renamed the issue to be clearer.
It is similar to what you described, but instead of having grammar actions that compile a simple language and having another flow which would disable those actions and build a tree.

You will have a pure grammar without any actions (example)
And two grammar listeners.

One used to parse and compile.
One used to do syntax highlights.

So there will be a stronger separation of concerns between the syntax definition
and the actions performed on the input.

You could still write grammar actions if you prefer.
But I have no way to selectively disable the user defined actions because they are
always mixed with the grammar in the same JavaScript function.
Example:

$.RULE("atomicExpression", function() {
            return $.OR([
                {ALT: function(){ return $.SUBRULE($.parenthesisExpression)}},
                // The grammar action (parseInt) can't be separated from the grammar (CONSUME)
                {ALT: function(){ return parseInt($.CONSUME(NumberLiteral).image, 10)}}
            ]);
    });

tlrobinson · 2016-12-12T19:25:00Z

Sounds good.

In the meantime, how would you recommend re-using a grammar for both purposes? Perhaps I could have an abstract class that has the grammar with actions that call abstract methods that are implemented by two different subclasses.

But I have no way to selectively disable the user defined actions because they are
always mixed with the grammar in the same JavaScript function.

Doesn't the content assist mode bypass the actions?

bd82 · 2016-12-13T13:24:58Z

moved discussion related to metabase grammar here: #327

fixes #215

bd82 · 2017-03-04T10:59:39Z

Benchmarked using a JSON grammar with a large 1,000 lines input.
When CST building is enabled the performance is about 70% of when it is not. (on V8).

This is a little slower than expected, but it does make sense as there is a quite a bit of additional overhead
required to "insert" the CST building into the flow, not to mention that the CST itself is by definition
very detailed and includes information on the entire parse tree so that is more work than only collecting
specific parts for a specific flow.

fixes #215

bd82 · 2017-03-11T15:41:46Z

The Semantic Actions on the Concrete Syntax Tree output will be handled in a separate issue
as this is already a huge change.

fixes #215

Allows writing "pure" grammars and perform semantic actions afterwards. In the (near?) future will add CST visitors to make it easy to traverse the CST structure. fixes #215

bd82 · 2017-03-12T16:41:09Z

Just missing Docs now.

It is too complicate from the perspective of and end user to try and figure out the type of the children dictionary value (mandatory/optional/collection) As it depends on (potentially) traversing multiple paths in the same grammar rule. Related to #215.

Relates to #215.

bd82 · 2017-03-16T22:30:28Z

Left overs related to docs & benchmark will be handled separately.

bd82 added the enhancement label Jul 16, 2016

bd82 added New Feature and removed enhancement labels Aug 1, 2016

bd82 changed the title ~~Investigate automatic creation of a Syntax Tree.~~ Investigate automatic creation of a Syntax Tree And Grammar Listeners Oct 25, 2016

bd82 mentioned this issue Oct 25, 2016

chevrotain as the 'generated' code from PEG #293

Closed

bd82 mentioned this issue Dec 7, 2016

More Functional Parsing DSL API. #324

Closed

bd82 changed the title ~~Investigate automatic creation of a Syntax Tree And Grammar Listeners~~ Separate the Grammar from the Grammar Actions. Dec 12, 2016

bd82 mentioned this issue Dec 13, 2016

Metabase Expressions Grammar Assistance Thread #327

Closed

bd82 added a commit that referenced this issue Feb 9, 2017

CST WIP

a4d38a1

fixes #215

bd82 mentioned this issue Feb 9, 2017

CST #365

Merged

bd82 added a commit that referenced this issue Feb 9, 2017

CST WIP

66ee549

fixes #215

bd82 added a commit that referenced this issue Feb 9, 2017

CST WIP

e57498b

fixes #215

bd82 added a commit that referenced this issue Feb 16, 2017

CST WIP

b14236a

fixes #215

bd82 added a commit that referenced this issue Feb 17, 2017

CST WIP

d77570c

fixes #215

bd82 added a commit that referenced this issue Feb 17, 2017

CST WIP

b9d328e

fixes #215

bd82 added a commit that referenced this issue Feb 19, 2017

CST WIP

8e947bd

fixes #215

bd82 added a commit that referenced this issue Feb 20, 2017

CST WIP

ff4225b

fixes #215

bd82 added a commit that referenced this issue Feb 20, 2017

CST WIP

ca40d4e

fixes #215

bd82 added a commit that referenced this issue Feb 20, 2017

CST WIP

a2c7951

fixes #215

bd82 added a commit that referenced this issue Feb 21, 2017

CST WIP

49c59b2

fixes #215

bd82 added a commit that referenced this issue Feb 21, 2017

CST WIP

870ddb4

fixes #215

bd82 added a commit that referenced this issue Feb 22, 2017

CST WIP

69975fb

fixes #215

bd82 added a commit that referenced this issue Feb 22, 2017

CST WIP

4b53129

fixes #215

bd82 added a commit that referenced this issue Mar 3, 2017

CST WIP

1c58a1b

fixes #215

bd82 added a commit that referenced this issue Mar 3, 2017

CST WIP

df29767

fixes #215

bd82 added a commit that referenced this issue Mar 3, 2017

CST WIP

9c5982b

fixes #215

bd82 added a commit that referenced this issue Mar 4, 2017

CST WIP

be83c56

fixes #215

bd82 added a commit that referenced this issue Mar 4, 2017

CST WIP

708e500

fixes #215

bd82 added a commit that referenced this issue Mar 4, 2017

CST WIP

7f6cc7a

fixes #215

bd82 added a commit that referenced this issue Mar 4, 2017

CST WIP

33a4e46

fixes #215

bd82 added a commit that referenced this issue Mar 7, 2017

CST WIP

4f94ab1

fixes #215

bd82 added a commit that referenced this issue Mar 7, 2017

CST WIP

8a4cccb

fixes #215

bd82 added a commit that referenced this issue Mar 9, 2017

CST WIP

5b839de

fixes #215

bd82 added a commit that referenced this issue Mar 9, 2017

CST WIP

6dd7c99

fixes #215

bd82 added a commit that referenced this issue Mar 11, 2017

CST WIP

cc2ed28

fixes #215

bd82 added a commit that referenced this issue Mar 11, 2017

CST WIP

003f8e2

fixes #215

bd82 mentioned this issue Mar 11, 2017

Concrete Syntax Tree / Semantic Actions Visitor. #381

Closed

6 tasks

bd82 added a commit that referenced this issue Mar 11, 2017

CST WIP

fd8db81

fixes #215

bd82 added a commit that referenced this issue Mar 12, 2017

CST WIP

15c661c

fixes #215

bd82 added a commit that referenced this issue Mar 12, 2017

CST WIP

62ed2da

fixes #215

bd82 added a commit that referenced this issue Mar 12, 2017

Automatic create of Concrete Syntax Tree.

42bc321

Allows writing "pure" grammars and perform semantic actions afterwards. In the (near?) future will add CST visitors to make it easy to traverse the CST structure. fixes #215

bd82 added a commit that referenced this issue Mar 12, 2017

Automatic create of Concrete Syntax Tree.

424168f

Allows writing "pure" grammars and perform semantic actions afterwards. In the (near?) future will add CST visitors to make it easy to traverse the CST structure. fixes #215

bd82 closed this as completed in #365 Mar 12, 2017

bd82 reopened this Mar 12, 2017

bd82 pushed a commit that referenced this issue Mar 16, 2017

CST Docs.

c472d39

Relates to #215.

bd82 mentioned this issue Mar 16, 2017

CST Docs. #399

Merged

bd82 changed the title ~~Separate the Grammar from the Grammar Actions.~~ Automatic creation of a Concrete Syntax Tree. Mar 16, 2017

bd82 closed this as completed Mar 16, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic creation of a Concrete Syntax Tree. #215

Automatic creation of a Concrete Syntax Tree. #215

bd82 commented Jul 16, 2016 •

edited

Loading

Zumbala commented Oct 2, 2016 •

edited

Loading

bd82 commented Oct 2, 2016

bd82 commented Oct 2, 2016

tlrobinson commented Dec 12, 2016 •

edited

Loading

bd82 commented Dec 12, 2016 •

edited

Loading

tlrobinson commented Dec 12, 2016 •

edited

Loading

bd82 commented Dec 13, 2016 •

edited

Loading

bd82 commented Mar 4, 2017

bd82 commented Mar 11, 2017

bd82 commented Mar 12, 2017

bd82 commented Mar 16, 2017

Automatic creation of a Concrete Syntax Tree. #215

Automatic creation of a Concrete Syntax Tree. #215

Comments

bd82 commented Jul 16, 2016 • edited Loading

Zumbala commented Oct 2, 2016 • edited Loading

bd82 commented Oct 2, 2016

bd82 commented Oct 2, 2016

tlrobinson commented Dec 12, 2016 • edited Loading

bd82 commented Dec 12, 2016 • edited Loading

tlrobinson commented Dec 12, 2016 • edited Loading

bd82 commented Dec 13, 2016 • edited Loading

bd82 commented Mar 4, 2017

bd82 commented Mar 11, 2017

bd82 commented Mar 12, 2017

bd82 commented Mar 16, 2017

bd82 commented Jul 16, 2016 •

edited

Loading

Zumbala commented Oct 2, 2016 •

edited

Loading

tlrobinson commented Dec 12, 2016 •

edited

Loading

bd82 commented Dec 12, 2016 •

edited

Loading

tlrobinson commented Dec 12, 2016 •

edited

Loading

bd82 commented Dec 13, 2016 •

edited

Loading