Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add korean #787

Merged
merged 9 commits into from
May 14, 2024
Merged

add korean #787

merged 9 commits into from
May 14, 2024

Conversation

StefanVukovic99
Copy link
Member

For now, this is groundwork for @Lyroxide

Korean apparently needs hangul to be decomposed into jamo, deinflected and then composed back into hangul. Text preprocessors can handle the disassembly, this adds "text postprocessors" to reassemble.

Copy link

github-actions bot commented Mar 24, 2024

✔️ No visual differences introduced by this PR.

View Playwright Report (note: open the "playwright-report" artifact)

@StefanVukovic99 StefanVukovic99 added area/linguistics The issue or PR is related to linguistics kind/enhancement The issue or PR is a new feature or request labels Mar 24, 2024
Lyroxide and others added 2 commits March 25, 2024 11:18
* add hangul functions

* hangul lib

* fix tests

* test inputs

* fix hangul disassemble, fix postprocessing

* rename test

* reset transformedText for multiple postprocessors

* add credits

* fix conflict

---------

Co-authored-by: Stefan Vukovic <stefanvukovic44@gmail.com>
Lyroxide and others added 3 commits May 12, 2024 15:04
* add hangul functions

* hangul lib

* fix tests

* test inputs

* fix hangul disassemble, fix postprocessing

* rename test

* reset transformedText for multiple postprocessors

* add credits

* fix conflict

* add transforms

* move substring

* textprocessing function

* mostly fixed

* delete comment

* mostly fixed

(cherry picked from commit 268c896)

* prepare preprocessors

(cherry picked from commit d139258)

* (reset) remove textSourceMap

* rawSource before textreplacements

* update tests

* fix transforms

* fix transforms

* 100% vetted

* add link to dict

---------

Co-authored-by: Stefan Vukovic <stefanvukovic44@gmail.com>
@StefanVukovic99 StefanVukovic99 marked this pull request as ready for review May 12, 2024 13:49
@StefanVukovic99 StefanVukovic99 requested a review from a team as a code owner May 12, 2024 13:49
@StefanVukovic99
Copy link
Member Author

Ready for review 🙏

jamesmaa
jamesmaa previously approved these changes May 14, 2024
@jamesmaa jamesmaa added this pull request to the merge queue May 14, 2024
Merged via the queue into themoeway:master with commit 82049f3 May 14, 2024
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/linguistics The issue or PR is related to linguistics kind/enhancement The issue or PR is a new feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants