[Suggestion] Add EOF to lexer #70

wyattjsmith1 · 2021-03-05T19:27:27Z

It is useful to be able to match the end of the string when lexing. Consider the following grammar in ebnf:

text_line ::= [a-zA-Z]* line_end
line_end ::= "\u000D\u000A" | "\u000A" | EOF

The grammar above will take any alphabetic characters until either a newline or the end of the file. As a result, a blank newline at the end of a line is optional.
This grammar can not accurately be represented in nimly because of the EOF. nimly can be expanded by adding the $ symbol to mean the end of input, similar to regex:

niml fluentLexer[MyToken]:
  "[a..zA..Z]":
    MyAlphaToken(token.token)
  "[\u000D\u000A|\u000A|$]":
    MyLineEndToken()

The text was updated successfully, but these errors were encountered:

loloicci · 2021-03-05T20:07:15Z

Thank you for your suggestion, @wyattjsmith1.

It sounds like a good idea that lexer produces a token which means EOF. But, it is not good that lexers recognize EOF as the same as other characters.

I suggest you wrapping lexIter (

nimly/src/nimly/lexer.nim

Line 116 in fa3a01e

iterator lexIter*[T](nl: var NimlLexer[T]): T =

) as it produces the token for EOF (in this example, MyLineEndToken()) after the original lexIter stops iteration.

Does it solve your problem?

wyattjsmith1 · 2021-03-05T20:23:20Z

Ah, ok. That should work. Thanks for the advice!

wyattjsmith1 closed this as completed Mar 5, 2021

loloicci added question wontfix labels Mar 6, 2021

loloicci mentioned this issue Mar 6, 2021

Add Function Lexer Produce a Token for EOF #80

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Suggestion] Add EOF to lexer #70

[Suggestion] Add EOF to lexer #70

wyattjsmith1 commented Mar 5, 2021

loloicci commented Mar 5, 2021

wyattjsmith1 commented Mar 5, 2021

[Suggestion] Add EOF to lexer #70

[Suggestion] Add EOF to lexer #70

Comments

wyattjsmith1 commented Mar 5, 2021

loloicci commented Mar 5, 2021

wyattjsmith1 commented Mar 5, 2021