Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Allow Unicode characters everywhere #1017

Merged
merged 3 commits into from Jul 31, 2023
Merged

Conversation

kesara
Copy link
Member

@kesara kesara commented Jul 11, 2023

This PR that does the following:

  • Remove guards for non-ASCII & non-Latin characters.
  • Removes automatic downcoding of non-ASCII characters.
  • Allows Unicode characters in all elements.
  • With the --warn-bare-unicode command line option, xml2rfc will warn if Unicode characters are present in any element that traditionally Unicode content is not allowed.

Traditionally, Unicode content is allowed in the following elements: artwork, city, cityarea, code, country, email, extaddr, organization, pobox, postalLine, refcontent, region, sortingcode, sourcecode, street, title, u.
With #895 bare use of Unicode characters is added to the t element. But it's implemented in such a way that --warn-bare-unicode warns about it. This PR will keep that warning.

Fixes #960.

@kesara kesara changed the title feat: Allow Unicode in everywhere feat: Allow Unicode characters everywhere Jul 12, 2023
@kesara kesara marked this pull request as ready for review July 27, 2023 20:49
@kesara kesara requested a review from rjsparks July 27, 2023 21:00
@kesara kesara merged commit 46aecfb into ietf-tools:main Jul 31, 2023
13 checks passed
@kesara kesara deleted the feat/unicode branch July 31, 2023 00:32
kesara added a commit to kesara/xml2rfc that referenced this pull request Feb 19, 2024
PR ietf-tools#1017 allowed Unicord characters everywhere.
This fixes a bug that xml2rfc was gurding in againts non-ASCII characters
in attribute values.
With `--warn-bare-unicode`, xml2rfc will warn if non-ASCII characters are
present in attribute values.

Fixes ietf-tools#1105
kesara added a commit to kesara/xml2rfc that referenced this pull request Feb 19, 2024
PR ietf-tools#1017 allowed Unicord characters everywhere.
This fixes a bug that xml2rfc was guarding against non-ASCII characters
in attribute values.
With `--warn-bare-unicode`, xml2rfc will warn if non-ASCII characters are
present in attribute values.

Fixes ietf-tools#1105
kesara added a commit to kesara/xml2rfc that referenced this pull request Feb 20, 2024
PR ietf-tools#1017 allowed Unicord characters everywhere.
This fixes a bug that xml2rfc was guarding against non-ASCII characters
in attribute values.
With `--warn-bare-unicode`, xml2rfc will warn if non-ASCII characters are
present in attribute values.

Fixes ietf-tools#1105
kesara added a commit to kesara/xml2rfc that referenced this pull request Feb 20, 2024
PR ietf-tools#1017 allowed Unicord characters everywhere.
This fixes a bug that xml2rfc was guarding against non-ASCII characters
in attribute values.
With `--warn-bare-unicode`, xml2rfc will warn if non-ASCII characters are
present in attribute values.

Fixes ietf-tools#1105
kesara added a commit that referenced this pull request Feb 21, 2024
PR #1017 allowed Unicord characters everywhere.
This fixes a bug that xml2rfc was guarding against non-ASCII characters
in attribute values.
With `--warn-bare-unicode`, xml2rfc will warn if non-ASCII characters are
present in attribute values.

Fixes #1105
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Allow unicode in all elements
2 participants