Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow interpuncts in phrases #776

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

groverlynn
Copy link
Contributor

@groverlynn groverlynn commented Dec 12, 2023

Pull request

Issue tracker

Fixes will automatically close the related issue

Fixes #

Feature

Allow certain interpuncts in phrases in dictionaries. Essay already has some of them in the entries but disabled for now.
This allows easier input of transliterated names (e.g. 尚-雅克‧盧梭 or 让‐雅克·卢梭, with the same code as 尚雅克盧梭 or 让雅克卢梭), disambiguation names (刚果(金), with the same code as 刚果金), and poem verses (e.g. 鵝,鵝,鵝,曲項向天歌, with the same code as 鵝鵝鵝曲項向天歌)

Setting in dictionary: encoder/free_puncts
The value is one single string containing all of the intended free punctuations (can include any characters, not necessarily punctuations).
The default value is ·‧‐-,() (U+00B7 middle dot, U+2027 hyphenation point, U+2010 hyphen, U+FF0D fullwidth hyphen-minus, U+FF0C fullwidth comma, U+FF08 fullwidth left parenthesis, U+FF09 fullwidth right parenthesis). Use the default free puncts, set encoder/free_puncts to "default" or "preset". To opt-out, set the value to an empty string ("")

Unit test

  • Done

Manual test

  • Done

Code Review

  1. Unit and manual test pass
  2. GitHub Action CI pass
  3. At least one contributor reviews and votes
  4. Can be merged clean without conflicts
  5. PR will be merged by rebase upstream base

Additional Info

@lotem
Copy link
Member

lotem commented Dec 17, 2023

This feature should not be added as a hidden behaviour. There are input schemas which do encode symbols.

Consider creating a separate tool for editing code in the yaml dictionary.
Then the Rime deployer do not need to handle these cases.

@groverlynn groverlynn force-pushed the interpuncts branch 2 times, most recently from ecfda1b to 52961a2 Compare February 2, 2024 22:39
@groverlynn
Copy link
Contributor Author

This feature should not be added as a hidden behaviour. There are input schemas which do encode symbols.

Consider creating a separate tool for editing code in the yaml dictionary. Then the Rime deployer do not need to handle these cases.

just revised as you suggested

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants