Lack of lexical token delimiter specification #49

liancheng · 2018-03-19T07:01:02Z

It seems that the current JMESPath specification doesn't specify any lexical token delimiters, which leaves ambiguity and confusion.

For example, considering the following two grammar rules in the spec:

comparator-expression = expression comparator expression
raw-string            = "'" *raw-string-char "'"

Sub-rules are concatenated in the same way in the above two rules but clearly the first one allows white spaces between expression and comparator since we have the following official example in the examples page:

people[?age > `20`].[name, age]

while white spaces are not allowed between "'" and *raw-string-char since a raw-string is expected to be a single lexical token.

After studying RFC 4234, I foud the following statements in section 3.1:

   LINEAR WHITE SPACE: Concatenation is at the core of the ABNF parsing
   model.  A string of contiguous characters (values) is parsed
   according to the rules defined in ABNF.  For Internet specifications,
   there is some history of permitting linear white space (space and
   horizontal tab) to be freely and implicitly interspersed around major
   constructs, such as delimiting special characters or atomic strings.

   NOTE:

      This specification for ABNF does not provide for implicit
      specification of linear white space.

   Any grammar that wishes to permit linear white space around
   delimiters or string segments must specify it explicitly.  It is
   often useful to provide for such white space in "core" rules that are
   then used variously among higher-level rules.  The "core" rules might
   be formed into a lexical analyzer or simply be part of the main
   ruleset.

Shall we specify lexical token delimiters explicitly and distinguish lexical rules from grammar rules to eliminate the ambiguity?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lack of lexical token delimiter specification #49

Lack of lexical token delimiter specification #49

liancheng commented Mar 19, 2018

Lack of lexical token delimiter specification #49

Lack of lexical token delimiter specification #49

Comments

liancheng commented Mar 19, 2018