Implement a rule versioning mechanism #35

sharwell · 2012-03-01T08:15:14Z

In my IDE work, support for any given language frequently includes hundreds up to several thousand references to context objects in the parse tree. Regardless of whether manual walking methods (iterating over or instanceof on the result of getChild(n) etc) or implicit labels are used, this code can be extremely sensitive to minor changes in the grammar and is a frequent source of regression bugs. A "versioning mechanism" for rules could be extremely helpful in tracking down interface problems early which arise when the grammar changes.

Rule annotations (current experimental implementation)

By adding annotations named @RuleVersion and @RuleDependency, a balance between the above items could be achieved. The generated code for a rule method would be marked with an @RuleVersion(n) annotation with the rule version. Code which depends on rules could be marked with one or more @RuleDependency(RULE_expr, 1) annotations declaring the dependency. Multiple dependencies can be wrapped in a @RuleDependencies annotation.

Benefits of this method include:

Compile time dependency checking ~~can be provided~~ is provided by an annotation processor, but would not force a failure to compile following a version change. Runtime checking could be provided by a utility method which could check all dependency for a class and/or package at once.
No overhead when -ea is specified (runtime assertion checking).
Clean, declarative syntax.
User control over when the version of a rule is incremented. This allows cross-rule changes to be reflected in versioning, such as incrementing the versions of rules a, b, and c when rule a changes from a : b; to a : c;.

A possible syntax for this which is reasonably clean, minimizes changes to the tool, and would keep the grammar target language agnostic could be:

foo
@version{1}
   : ...
   ;

Inline actions and runtime assertions (alternative 1, not in use)

This method does not introduce an code generation changes, so it can be used with the current versions of ANTLR 4. While better than unversioned rules, it does have a number of drawbacks. If the following code is added to an @members{} block:

private static int[] ruleVersions;

{
if (ruleVersions == null) {
    ruleVersions = new int[_ATN.ruleToStartState.length];
}
}

public static int getRuleVersion(int rule) {
    return ruleVersions[rule];
}

public static int getRuleVersion(ParserRuleContext<?> context) {
    return ruleVersions[context.ruleIndex];
}

private static void setRuleVersion(ParserRuleContext<?> context, int version) {
    ruleVersions[context.ruleIndex] = version;
}

Then the following @init{} action can be used to mark a rule version that can be incremented when the rule changes:

@init{setRuleVersion($ctx, 1);}

When a block/statement of code depends on a particular form of the rule, a statement like the following will allow quicker detection of potential problems.

// for code in a listener, or where a typed context object is used
ExprContext ctx = ...;
assert MyParser.getRuleVersion(ctx) == 1;

// for general dependencies
assert MyParser.getRuleVersion(MyParser.RULE_expr) == 1;

CRC constants and compile-time assertions (alternative 2, not in use)

If a block of code like the following could be automatically generated based on a CRC calculation of each rule's syntax (ignoring whitespace, actions, and unnecessary parentheses):

public static final boolean HASH_expr_2bc29fa4=true,
    HASH_stmt_56ed0cbb=true, ...;

Then an assertion like the following will actually produce a compile-time error until dependent code is updated following a change to a rule:

assert HASH_expr_2bc29fa4;

While the fields would be hard to keep track of, any modern editor will allow updating the assertion after code verification by simply typing HASH_expr_ followed by a complete word action (Ctrl+Space in many IDEs).

The text was updated successfully, but these errors were encountered:

parrt · 2012-03-04T01:44:42Z

I like concept. I might prefer a @check("SUPERHASHCODE") or something autogenerated.

Fix syntax of the generated source code

Without these tests, the demo crashes

Build cleanup from HashEdgeMap implementation

sharwell mentioned this issue Mar 1, 2012

Rule versioning parrt/antlr4#30

Closed

sharwell mentioned this issue Mar 12, 2014

Rule versioning artifact #490

Closed

parrt added type:feature and removed type:feature:2 labels Nov 16, 2014

parrt pushed a commit that referenced this issue Jun 30, 2015

Merge pull request #35 from jcbrinfo/patch-1

22fcd94

Fix syntax of the generated source code

ericvergnaud mentioned this issue Sep 24, 2016

TestParseTrees is ok #1290

Closed

parrt pushed a commit that referenced this issue Nov 7, 2016

Merge pull request #35 from nburles/fix-prediction-context

78a1216

Without these tests, the demo crashes

ericvergnaud mentioned this issue Nov 8, 2016

Merge cpp #1346

Merged

adarre mentioned this issue Nov 21, 2017

c++ Stack use after scope bug reported by ASAN #2131

Open

sharwell added a commit to sharwell/antlr4 that referenced this issue Dec 23, 2018

Merge pull request antlr#35 from tunnelvisionlabs/hash-edge-map

7123888

Build cleanup from HashEdgeMap implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement a rule versioning mechanism #35

Implement a rule versioning mechanism #35

sharwell commented Mar 1, 2012

parrt commented Mar 4, 2012

Implement a rule versioning mechanism #35

Implement a rule versioning mechanism #35

Comments

sharwell commented Mar 1, 2012

Rule annotations (current experimental implementation)

Inline actions and runtime assertions (alternative 1, not in use)

CRC constants and compile-time assertions (alternative 2, not in use)

parrt commented Mar 4, 2012