This repository was archived by the owner on Dec 16, 2025. It is now read-only.
Markdown: Fix UTF8 chars breaking elements#366
Merged
mco-gh merged 3 commits intogooglecodelabs:masterfrom Dec 24, 2020
nlepage:fix-unicode-chars-breaking-elements
Merged
Markdown: Fix UTF8 chars breaking elements#366mco-gh merged 3 commits intogooglecodelabs:masterfrom nlepage:fix-unicode-chars-breaking-elements
mco-gh merged 3 commits intogooglecodelabs:masterfrom
nlepage:fix-unicode-chars-breaking-elements
Conversation
|
Any news on this enhancement @jatorre @KeyboardNerd ? 👍 |
ThwalT
approved these changes
May 14, 2020
…-breaking-elements
Contributor
Author
|
Just updated this to upstream's master:
@marcacohen I also had opened #367 but don't have time to update it, do whatever you want with it. |
Contributor
|
thanks! |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Using UTF8 characters such as emojis in markdown input may create "offsets" in the HTML output and may even break UTF8 characters.
Example 1
With the following markdown:
HTML output is:
Example 2
With the following markdown:
HTML output is:
Bug
splitSpaceRight()uses a rune index to split a string.splitSpaceRight()andsplitSpaceLeft()also have their final return reversed.Fix
splitSpaceRight()now uses a byte index to split the string.