Allow whitespace around section headers #9

stasm · 2016-09-14T13:31:33Z

Fixes #5. @zbraniecki -- this implements the changes we talked about last week to characters allowed in keywords, including white-space. Note that currently the l20n.js parser doesn't allow non-Latin characters in keywords. Should we update the spec here for now?

Fixes #5.

zbraniecki · 2016-09-14T19:48:52Z

grammar.ebnf


-identifier           ::= [a-zA-Z_.?-] ([a-zA-Z0-9_.?-])*;
+identifier           ::= identifier-head (identifier-tail)*;


since you don't use identifier-head or identifier-tail, what's the value of it over:

identifier ::= [a-zA-Z_.?-] [a-zA-Z0-9_.?-]*;

?

what do you mean I don't use it? I use it right in the line you commented on

it's much more readable that way and consistent with keyword- chars. it makes it clear to the reader that there are different set of characters involved.

I mean - this is the only place where you use it. Since it's not shared with other bits, I'm not sure if there's a value in keeping it as a separate entity.

I believe that it's clear that there are different characters involved since instead of [...]+ we give [...] [...]*.

zbraniecki · 2016-09-14T19:49:10Z

grammar.ebnf

 variable             ::= '$' identifier;
-keyword              ::= [^=|#{}\[\]()]+;
+keyword              ::= keyword-head (keyword-tail* keyword-last)?


same of keywords

I'm concerned that the following will look cryptic:

keyword ::= [^=|#/{}\[\]()0-9] ([^=|#/{}\[\]() ]* [^=|#/{}\[\]()])?;

Then again, the grammar is intended at parsers not humans, so maybe that's okay. I would leave it in the clearer form at least for now since we don't use this grammar yet to generate parsers which in turn would allow us to fix bugs in the grammar itself. Thus readable-by-human is still pretty important to me.

ok, I disagree but not gonna block on this. :)

stasm · 2016-09-14T22:43:48Z

@zbraniecki what are you thoughts on the current behavior of the l20n.js parser wrt. the chars allowed in keywords?

zbraniecki · 2016-09-14T22:55:56Z

umm, not sure what are you asking about. The l20n.js parser handles characters allowed in keywords?

stasm · 2016-09-14T23:18:12Z

The grammar specifies that all possible characters except a shorthand of special ones like /, # etc. are allowed as keywords. However, the following doesn't parse in the l20n.js's parsers (ast and runtime):

foo =
    [ą] Ą

I'm asking if you have cycles to fix this in the parser or would you rater have me change the grammar for now and limit keywords to ASCII chars. Allowing more chars in the future will be backwards-compatible.

zbraniecki · 2016-09-15T00:13:11Z

I think I'd prefer to limit the chars in the spec for now. I'm concerned about non-ascii character being more confusing than helpful at this point.

stasm · 2016-09-16T09:50:15Z

I removed the additional symbols for allowed characters and also restricted keywords to ASCII for now. I'd like to go back to being more lax in the future. I also considered using named character classes like [:alpha:] but they're locale-dependent.

stasm · 2016-09-16T09:50:26Z

@zbraniecki can you take another look?

stasm · 2016-10-10T12:40:17Z

@zbraniecki ping

zbraniecki · 2016-10-12T21:15:41Z

looks great! Sorry for holding you for so long.

Allow whitespace around section headers

45c324a

Fixes #5.

zbraniecki suggested changes Sep 14, 2016

View reviewed changes

zbraniecki approved these changes Sep 15, 2016

View reviewed changes

Don't introduce new symbols for allowed characters

7aee7fd

stasm merged commit 126b422 into master Oct 13, 2016

stasm deleted the member-key branch October 13, 2016 12:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow whitespace around section headers #9

Allow whitespace around section headers #9

stasm commented Sep 14, 2016

zbraniecki Sep 14, 2016

stasm Sep 14, 2016

stasm Sep 14, 2016

zbraniecki Sep 14, 2016

zbraniecki Sep 14, 2016

stasm Sep 14, 2016

zbraniecki Sep 15, 2016

stasm commented Sep 14, 2016

zbraniecki commented Sep 14, 2016

stasm commented Sep 14, 2016

zbraniecki commented Sep 15, 2016

stasm commented Sep 16, 2016

stasm commented Sep 16, 2016

stasm commented Oct 10, 2016

zbraniecki commented Oct 12, 2016


		identifier ::= [a-zA-Z_.?-] ([a-zA-Z0-9_.?-])*;
		identifier ::= identifier-head (identifier-tail)*;

Allow whitespace around section headers #9

Allow whitespace around section headers #9

Conversation

stasm commented Sep 14, 2016

zbraniecki Sep 14, 2016

Choose a reason for hiding this comment

stasm Sep 14, 2016

Choose a reason for hiding this comment

stasm Sep 14, 2016

Choose a reason for hiding this comment

zbraniecki Sep 14, 2016

Choose a reason for hiding this comment

zbraniecki Sep 14, 2016

Choose a reason for hiding this comment

stasm Sep 14, 2016

Choose a reason for hiding this comment

zbraniecki Sep 15, 2016

Choose a reason for hiding this comment

stasm commented Sep 14, 2016

zbraniecki commented Sep 14, 2016

stasm commented Sep 14, 2016

zbraniecki commented Sep 15, 2016

stasm commented Sep 16, 2016

stasm commented Sep 16, 2016

stasm commented Oct 10, 2016

zbraniecki commented Oct 12, 2016