Skip to content

More punctuation normalization fixes for Chinese#1107

Merged
evgenyrp merged 2 commits intomainfrom
zh-punctuation
May 2, 2025
Merged

More punctuation normalization fixes for Chinese#1107
evgenyrp merged 2 commits intomainfrom
zh-punctuation

Conversation

@ZJaume
Copy link
Copy Markdown
Collaborator

@ZJaume ZJaume commented May 2, 2025

This change adds more flexibility for some of the regex and adds normalization for parenthesis. It also updates OpusCleaner, which now has the OpusFilter regex filter as monolingual to avoid messing up columns or English side.

This change adds more flexibility for some of the regex and adds
normalization for parenthesis. It also updates OpusCleaner, which now
has the OpusFilter regex filter as monolingual to avoid messing up
columns or English side.
@ZJaume ZJaume requested a review from a team as a code owner May 2, 2025 15:19
@ZJaume ZJaume requested a review from evgenyrp May 2, 2025 15:19
@evgenyrp evgenyrp merged commit cdeb57c into main May 2, 2025
53 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants