Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NormalizerNFKC: add unify_katakana_g_sounds option #1501

Merged
merged 12 commits into from
Feb 2, 2023

Conversation

HashidaTKS
Copy link
Contributor

@HashidaTKS HashidaTKS commented Feb 1, 2023

When unify_katakana_g_sounds is specified, NormalizerNFKC* normalize characters as below.

グァ -> ガ
グィ -> ギ
グェ -> ゲ
グォ -> ゴ

Usage:

normalize \
  'NormalizerNFKC130("unify_katakana_g_sounds", true, \
                     "report_source_offset", true)' \
  "グァグィグェグォ" \
  WITH_CHECKS|WITH_TYPES

@HashidaTKS HashidaTKS changed the title NormalizerNFKC: add unify_katakana_g_sounds NormalizerNFKC: add unify_katakana_g_sounds option Feb 1, 2023
@HashidaTKS HashidaTKS marked this pull request as ready for review February 2, 2023 03:17
lib/normalizer.c Outdated Show resolved Hide resolved
lib/normalizer.c Show resolved Hide resolved
@kou
Copy link
Member

kou commented Feb 2, 2023

Could you rebase on master and add a test for NormalizerNFKC150?

@HashidaTKS
Copy link
Contributor Author

Could you rebase on master and add a test for NormalizerNFKC150?

Added.

@HashidaTKS
Copy link
Contributor Author

@kou

Thank you for your comments.
I have addressed your comments.

@kou kou merged commit b16ba79 into master Feb 2, 2023
@kou kou deleted the add-unify_katakana_g_sounds branch February 2, 2023 07:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants