Improve location information in JsonReader exception messages #1764

Marcono1234 · 2020-08-25T17:07:35Z

Based on #1743
Fixes #1564

Improves the location information shown in JsonReader exception messages and improves the tests by making them check the exception messages as well.

Improved location information:

Partially due to Fix JsonReader advancing before throwing exception due to malformed JSON #1743 since the position is not changed if malformed JSON is encountered
Unexpected token will report start of peeked token:
Previously when the peeked token did not match the expected one, the exception message reported the location behind the token (instead of in front of it), e.g.:
```
new JsonReader(new StringReader("true")).beginObject();
// Expected BEGIN_OBJECT but was BOOLEAN at line 1 column 5 path $
```
Add location information to some exceptions which previously did not have location information
Don't have strict reader suggest lenient mode when that would not help:
Previously a strict reader would suggest lenient mode even in cases where lenient mode would not be able to read the JSON either.

Previously JsonReader.doPeek() would have already advanced or modified the state of the reader when malformed JSON was encountered and an exception was thrown. Therefore repeatedly calling methods which peek would result in inconsistent exceptions and could even make the JSON "valid" again by simply advancing past the malformed JSON. Now the state of the reader is only modified when valid JSON is encountered. Note that there are a few remaining cases where this issue still occurs, which will be fixed in subsequent commits. This change required adding more JsconScope constants because if there is enough space (or a long comment) between tokens the skipped content could not have been stored in the buffer, e.g. `{"a": ... :1}`. Here a value is expected after the `:`. However there are a lot of whitespaces followed by a second `:` (i.e. malformed JSON). With the previous JsonScopes it would not have been possible to represent this state correctly. Therefore a subsequent peek would have consumed the second `:` making the JSON "valid" again.

…scape Previously JsonReader.nextString() and skipValue() consumed the '\' of an escape sequence before throwing an exception for invalid escape sequences. Therefore a subsequent method call would have read a "valid" string. Now the stream position is only advanced when the escape sequence is valid.

Check for exception messages and make caught exception type more specific in some cases to make sure tests do not catch wrong exception and erroneously assume that code works as expected. Removes JsonWriterTest.testStrictWriterDoesNotPermitMultipleTopLevelValues() because it is the same as testMultipleTopLevelValues().

For a malformed unicode escape sequence the exception message thrown be the method erroneously only included the first 4 chars which is `\uXX`, missing the last 2 hex chars.

…ion-message-incorrect-position

Previously when JsonReader threw an exception describing an issue with the peeked value (e.g. it being different than the expected one) it used the current `lineNumber` and `pos`. This resulted in confusing error messages because both values point behind the value responsible for the error. For example: new JsonReader(new StringReader("true")).beginObject(); // Expected BEGIN_OBJECT but was BOOLEAN at line 1 column 5 path $ With these changes the exception messages now correctly refer to the start of the peeked value.

Adds location information to some exceptions thrown by JsonReader which previously did not have any location information.

Previously JsonReader threw an exception when a number or keyword was followed by a lenient non-literal. This made JsonReader consider the complete number / keyword malformed even though only the separator is malformed. To be consistent with EOF exceptions and exceptions for string values, the exception is now only thrown after the number / keyword has been consumed and when the caller tries to peek at the next JSON token. For example: JsonReader reader = new JsonReader(new StringReader("[true;true]")); reader.beginArray(); // Would previously already have thrown exception; with these changes // only trying to read value after that will throw exception reader.nextBoolean(); Additionally JsonReader will now complain about non-literal unquoted name or value starts as being not a name / value instead of suggesting making the reader lenient when that would not help, e.g.: JsonReader reader = new JsonReader(new StringReader("=")); // Previously this threw exception suggesting to make reader lenient // However, lenient reader won't accept that JSON either reader.nextString();

…lues Previously a strict JsonReader suggested enabling lenient mode for multiple top level values even if the subsequent values would not be accepted in lenient mode either.

All callers which previously wanted isLiteral(...) to check whether the reader is lenient for comments are calling nextNonWhitespace(boolean) before that, which already performs the lenient check.

Marcono1234 added 13 commits July 24, 2020 19:27

Fix JsonReader.doPeek() advancing after incomplete block comment

604fb47

Add test for double non-execute prefix

966f717

Fix JsonReader.readEscapeCharacter() exception message being incomplete

f86b579

For a malformed unicode escape sequence the exception message thrown be the method erroneously only included the first 4 chars which is `\uXX`, missing the last 2 hex chars.

Merge branch 'JsonReader-malformed-advancing' into marcono1234/except…

4e135d4

…ion-message-incorrect-position

Add location information to some exceptions thrown by JsonReader

adfc22e

Adds location information to some exceptions thrown by JsonReader which previously did not have any location information.

Fix JsonReader consuming # before checking if lenient

a776ee0

Don't have JsonReader suggest lenient mode for malformed top level va…

e8acca4

…lues Previously a strict JsonReader suggested enabling lenient mode for multiple top level values even if the subsequent values would not be accepted in lenient mode either.

Don't have JsonReader.isLiteral check lenient

3301328

All callers which previously wanted isLiteral(...) to check whether the reader is lenient for comments are calling nextNonWhitespace(boolean) before that, which already performs the lenient check.

google-cla bot added the cla: yes label Aug 25, 2020

Marcono1234 mentioned this pull request Jul 24, 2022

Improve Gson and JsonParser trailing data handling #2123

Open

Marcono1234 mentioned this pull request Apr 12, 2023

Proposal to expose Position information used for error messages in JSonReader in public API #2373

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve location information in JsonReader exception messages #1764

Improve location information in JsonReader exception messages #1764

Marcono1234 commented Aug 25, 2020 •

edited

Improve location information in JsonReader exception messages #1764

Are you sure you want to change the base?

Improve location information in JsonReader exception messages #1764

Conversation

Marcono1234 commented Aug 25, 2020 • edited

Marcono1234 commented Aug 25, 2020 •

edited