Parametrize the grammar by externally-supplied variables #36

ceymard · 2011-08-15T22:57:51Z

I'll take the example of JinJa (I created a JS clone of it with PegJS) :

You can write {% block my_block %}{% endblock %}, which is the usual syntax.

It is however possible in Jinja to redefine {% and %} to other tokens. I would like to be able to have a rule like tag = tk_open "block" tk_close contents tk_open "endblock" tk_close {...}

Maybe with another syntax ? Like tag = $tk_open "block" $tk_close to indicate these are terminals in a variable ?

The variable could then be declared in the leading { ... } section. This best used when in conjunction with another ticket I opened : "Create an optional argument 'options' in parse()"

The text was updated successfully, but these errors were encountered:

dmajda · 2011-08-20T09:05:41Z

It is however possible in Jinja to redefine {% and %} to other tokens.

You mean in the tempalte itself? Or this is specified before the template parsing starts? Can you give me an example and/or pointer to the documentation?

ceymard · 2011-08-20T09:18:47Z

It is specified before the parsing starts. See http://jinja.pocoo.org/docs/api/#high-level-api with the block_start_* options.

guiprav · 2011-10-26T22:05:27Z

That would be some really handy feature I'd appreciate to see in PEG.js. Since I'm not familiar with the library's internals, I was considering to write a preprocessor to do that.

Not only variables would be needed in my case, though. I find myself writing this a lot:

CommaSeparatedIdentifierSequence
  = left:Identifier opt:(_* "," _* CommaSeparatedIdentifierSequence)?
    {
        var sequence = [left];
        var more = opt[3];

        if (more !== undefined)
            sequence = concat(sequence, more);

        return sequence;
    }

When I could be doing something like:

CommaSeparatedIdentifierSequence
  = @Sequence(Identifier, _* "," _*)

@Sequence being a rule template accepting the sequence element and separator as first and second "parameter", respectively:

@Sequence (Element, Separator)
  = left:Element opt:(Separator @Sequence(Element, Separator))? { ... }

Skalman · 2012-12-10T15:48:14Z

I have another, related use case, when parsing a freely formed document (wikitext from Wiktionary):

There are special keywords (the article title) which vary per parsed document:

// PEG:
title_line
  = "'''" %article_title "'''\n"
// JS:
parser.parse("...\n'''door'''\n...", { values: { article_title: 'door' } });

A further extension would be to also allow for arrays of values, in cases where there might be a long list of possible keyword values:

// PEG:
lang_header
  = '==' %lang '==\n'
// JS:
parser.parse("...\n==English==\n...\n==French==\n...", { values: { lang: ['English', 'French', 'Swedish', ... ] } });

Of course, it is possible to generate the PEG grammar, but it seems like this might be doable and it would be a lot more efficient.

fusepilot · 2014-09-28T23:43:58Z

This is something I'd like to see implemented. My current project requires dynamic rule similar to Skalman's example.

andreineculau · 2014-10-11T09:40:30Z

@n2liquid I see myself doing this, with no annoyance

CommaSeparatedIdentifierSequence
  = left:Identifier opt:CommaSeparatedIdentifier*
  { return [left].concat(opt) }
CommaSeparatedIdentifier
  = _* "," _* Identifier:Identifier
  { return Identifier }

@ceymard I was about to write that what you're suggesting is possible now (maybe it wasn't in 2011) like this

// taken from your PR's test
{
  var patterns = {
    lit: 'okay',
    yes: '%%',
    fun: function() {return 'bop'}
  }
}

start
  = &{return patterns.lit} {return 'done'}
  / &{return patterns.yes} {return 'other'}
  / &{return patterns.fun()} {return 'fun'}

but it's not, since &{...} only matches, but doesn't capture and advance parser position. Pity, maybe there should be new operators for matching as well.

guiprav · 2014-10-15T14:07:05Z

@andreineculau, that's indeed better since it avoids the clunky "magic index" my code introduces, and writing that is indeed pretty simple. The ability to define and instantiate "template rules" might still be an important addition, though, to allow for more complex grammars without resorting to duplication.

I haven't been using PEG.js lately, but I know when I was, there were some recurring rule patterns that were not so trivial to factor into simpler reusable ones like you did; and, dare I say, some seemed simply impossible. So duplication was unavoidable.

Unfortunately, I don't have examples of those handy anymore. Anyways, I'm not asking anything here, I'm just saying, really, FWIW.

ceymard mentioned this issue Feb 1, 2012

Allowing variable literals prefixed by % #47

Closed

andreineculau mentioned this issue Oct 11, 2014

New prefixed types #297

Closed

dmajda changed the title ~~Add the ability to use variables in rules~~ Parametrize the grammar by externally-supplied variables Aug 14, 2015

This was referenced Feb 3, 2020

Implement parametrizable rules #45

Open

Implement a simpler way to express lists with a separator #107

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parametrize the grammar by externally-supplied variables #36

Parametrize the grammar by externally-supplied variables #36

ceymard commented Aug 15, 2011

dmajda commented Aug 20, 2011

ceymard commented Aug 20, 2011

guiprav commented Oct 26, 2011

Skalman commented Dec 10, 2012

fusepilot commented Sep 28, 2014

andreineculau commented Oct 11, 2014

guiprav commented Oct 15, 2014

Parametrize the grammar by externally-supplied variables #36

Parametrize the grammar by externally-supplied variables #36

Comments

ceymard commented Aug 15, 2011

dmajda commented Aug 20, 2011

ceymard commented Aug 20, 2011

guiprav commented Oct 26, 2011

Skalman commented Dec 10, 2012

fusepilot commented Sep 28, 2014

andreineculau commented Oct 11, 2014

guiprav commented Oct 15, 2014