add line and column information to scanner and visitor #17

P0lip · 2019-03-14T14:51:27Z

The PR implements an extra and optional onLineBreak function.
At the moment, jsonc-parser make use of offset and length for positional info, which is totally fine.
Unfortunately, quite a few tools/editors operate on lines and columns, therefore integrating jsonc-parser is a bit more troublesome, since you need to fall back to scanner/tokenizer and perform the entire parsing process (At least, I couldn't find any reasonable way to implement positional info based on columns and lines without the use of scanner).
Hope the above reasoning makes sense.
I am not sure about the presence of lineNumber and whether it should be zero-based or not. I believe we could get rid of it and let the consumer implement it if needed.
Moreover, I'm afraid we cannot really handle line breaks in multi-line comments due to the way these comments are parsed.

msftclas · 2019-03-14T14:51:39Z

All CLA requirements met.

README.md

aeschli · 2019-03-18T10:03:35Z

Cool, thanks a lot!

aeschli · 2019-03-18T10:23:06Z

Not having new-lines reported in comment is a problem. Especially as the scanner always scans and consumes comments. Its only the parser that ignores it and reports and error, if not allowed.
So if there happens to be a /* in the content, all line counting is off.

So your original idea of reporting the line number with the onSeparator is probably the right idea.

I'm also not opposed to returning linenumber/character in every call back (onObjectBegin, ...).
It should be 0-based.

P0lip · 2019-03-19T14:27:23Z

Thanks a lot for your feedback!

I decided to implement an extra getTokenLineNumber method in JSONScanner.

At the moment, the value BlockCommentTrivia token contains line-break characters, therefore line-breaks occurring within a block/multi-line comment aren't treated as separate tokens/trivias.
Changing representation of BlockCommentTrivia is potentially possible as we could break it into multiple tokens, i.e. BlockCommentSegmentTrivia and LineBreakTrivia, but I am not sure whether it's worth it.
In my opinion, the above approach would make more harm than good. Obviously, the change would be breaking, as the scanner would need to be suspended on line-break in block comment and then resumed in the same place. Due to this, offsets and lengths would be altered. In general, handling block comments would most likely be more challenging, since they wouldn't be represented as a single token.
I believe accessing a single line of multi-line comment is not a common need and can be achieved on demand (for instance by splitting the value of token) if actually needed.
In most cases, having the start/end line number of block comment is sufficient.
As onLineBreak visit method receives zero-based line number, implementing range/position for block comments would be trivial and the current usage wouldn't be affected.

The only bit I'm worried about is the confusion the onLineBreak visit method may introduce. One may expect onLineBreak to be executed on every single break line, while in fact it's invoked only when an actual line-break token (LineBreakTrivia) is encountered. I included that information in README, yet it may not be quite obvious when line-break is represented as LineBreakTrivia and when not.

We could also remove onLineBreak and simply add line number as an argument to every visit method.
Happy to hear your feedback. Don't really know which way is more ergonomic/clear.
Both approaches would be backward-compatible, so they are worth considering.
If we go with the latter (no onLineBreak + line passed to each visit method), we might be obliged to provide the column/character as well.

I hope I took into account all line breaks that may occur in the scanning process.

aeschli · 2019-03-19T16:04:25Z

We could also remove onLineBreak and simply add line number as an argument to every visit method.

That would be my favourite as its the most consistent.

P0lip · 2019-03-20T08:30:26Z

That would be my favourite as its the most consistent.

Should we pass a zero-based column alongside the current line as well?

aeschli · 2019-03-20T08:46:47Z

Yes, line and character (following the naming in the LSP)
In theory it should be startLine/startColum & endLine/endColumn.
But for all tokens the end position is easy to compute with the length, except for multi line comments.
I'd be ok to not have endLine/endColumn for the moment.

P0lip · 2019-03-20T10:36:57Z

But for all tokens the end position is easy to compute with the length, except for multi line comments.
I'd be ok to not have endLine/endColumn for the moment.

Sounds good to me.
If one really needs to compute end-line for multi-line comments, it's still feasible - it's just a matter of performing scanning process or trying to consume other visit methods (that might be a bit troublesome / less reliable, though, as other tokens might be placed on the same line as a multi-line comment).

README.md

src/impl/scanner.ts

aeschli · 2019-03-21T08:18:29Z

Looks good!

aeschli · 2019-03-29T14:23:31Z

published as 2.1

vscodebot bot assigned aeschli Mar 14, 2019

aeschli reviewed Mar 18, 2019

View reviewed changes

README.md Outdated Show resolved Hide resolved

add onLineBreak visit function

7971774

sort out whitespaces in readme

7479e45

aeschli requested changes Mar 21, 2019

View reviewed changes

README.md Outdated Show resolved Hide resolved

src/impl/scanner.ts Outdated Show resolved Hide resolved

getTokenStartLine/StartCharacter

85b5249

aeschli approved these changes Mar 21, 2019

View reviewed changes

aeschli merged commit b9ecc3b into microsoft:master Mar 21, 2019

aeschli added this to the March 2019 milestone Mar 29, 2019

aeschli changed the title ~~add onLineBreak visit function~~ add line and column information to scanner and visitor Mar 29, 2019

Exkaleburx mentioned this pull request Jul 15, 2024

[Snyk] Upgrade jsonc-parser from 3.2.1 to 3.3.0 Exkaleburx/vscode-markdownlint#4

Open

SherfeyInv mentioned this pull request Jul 15, 2024

[Snyk] Upgrade jsonc-parser from 3.2.1 to 3.3.0 SherfeyInv/pyright#3

Open

snyk-io bot mentioned this pull request Jul 15, 2024

[Snyk] Upgrade jsonc-parser from 3.2.1 to 3.3.0 Hawthorne001/wireit#11

Open

Exkaleburx mentioned this pull request Jul 16, 2024

[Snyk] Upgrade jsonc-parser from 3.2.1 to 3.3.1 Exkaleburx/vscode-markdownlint#5

Open

rpreslar4765 mentioned this pull request Jul 16, 2024

[Snyk] Upgrade jsonc-parser from 3.2.1 to 3.3.1 rpreslar4765/pyright#4

Open

Robot-Inventor mentioned this pull request Jul 16, 2024

[Snyk] Upgrade jsonc-parser from 3.2.1 to 3.3.1 Robot-Inventor/stc-filter#386

Open

Linjieqiong001 mentioned this pull request Jul 16, 2024

[Snyk] Upgrade jsonc-parser from 3.2.1 to 3.3.1 Linjieqiong001/pyright#3

Open

ghaschel mentioned this pull request Jul 17, 2024

[Snyk] Upgrade jsonc-parser from 3.2.1 to 3.3.1 ghaschel/vscode-angular-html#100

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add line and column information to scanner and visitor #17

add line and column information to scanner and visitor #17

P0lip commented Mar 14, 2019 •

edited

Loading

msftclas commented Mar 14, 2019 •

edited

Loading

aeschli commented Mar 18, 2019

aeschli commented Mar 18, 2019 •

edited

Loading

P0lip commented Mar 19, 2019 •

edited

Loading

aeschli commented Mar 19, 2019 •

edited

Loading

P0lip commented Mar 20, 2019 •

edited

Loading

aeschli commented Mar 20, 2019

P0lip commented Mar 20, 2019

aeschli commented Mar 21, 2019

aeschli commented Mar 29, 2019

add line and column information to scanner and visitor #17

add line and column information to scanner and visitor #17

Conversation

P0lip commented Mar 14, 2019 • edited Loading

msftclas commented Mar 14, 2019 • edited Loading

aeschli commented Mar 18, 2019

aeschli commented Mar 18, 2019 • edited Loading

P0lip commented Mar 19, 2019 • edited Loading

aeschli commented Mar 19, 2019 • edited Loading

P0lip commented Mar 20, 2019 • edited Loading

aeschli commented Mar 20, 2019

P0lip commented Mar 20, 2019

aeschli commented Mar 21, 2019

aeschli commented Mar 29, 2019

P0lip commented Mar 14, 2019 •

edited

Loading

msftclas commented Mar 14, 2019 •

edited

Loading

aeschli commented Mar 18, 2019 •

edited

Loading

P0lip commented Mar 19, 2019 •

edited

Loading

aeschli commented Mar 19, 2019 •

edited

Loading

P0lip commented Mar 20, 2019 •

edited

Loading