This repository has been archived by the owner on Aug 26, 2022. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 679
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Bug 747403 -- refine section ID generation
This is a first step, namely matching MindTouch behavior for sections whose names contain non-ASCII characters. We now generate IDs in a similar way: any section name which contains only ASCII content merely has spaces replaced with underscores. A name which contains non-ASCII characters has each such character replaced by hexadecimal digits representing the appropriate UTF-8 codepoint(s), with each set of digits preceded by a dot. The test cases are a sampling of non-ASCII and mixed-character-set section names and the slugs MindTouch generates for them. This does not guarantee absolute parity with MindTouch, but probably gets us close enough. It also does not deal with the problem of a document in which not all section names are unique, but so far as I can tell this does not introduce any new problems, merely perpeturts an old one, assuming any such documents exist.
- Loading branch information
1 parent
1ee73d6
commit f3594ea
Showing
2 changed files
with
49 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters