ii: remove redudant normalize on search as much as possible #1421
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Some normalizes such as
NormalizerTable
aren't idempotent. So we should not normalize the inputmultiple times.
The current
token_info_open()
related codes normalize the input multiple times. This change avoidsmultiple normalizations as much as possible. If a tokenizer enables
report_source_location
, we canavoid multiple normalizations all time. But if a tokenizer doesn't enable
report_source_location
, we maynormalize the input multiple times.
See the added tests for problem cases.