Skip to content

feat : Add keepDecimalPoint option to Nori (KoreanTokenizer)#15895

Closed
twosom wants to merge 2 commits into
apache:mainfrom
twosom:add_nori_keep_decimal_point
Closed

feat : Add keepDecimalPoint option to Nori (KoreanTokenizer)#15895
twosom wants to merge 2 commits into
apache:mainfrom
twosom:add_nori_keep_decimal_point

Conversation

@twosom

@twosom twosom commented Mar 31, 2026

Copy link
Copy Markdown
Contributor

Description

This PR implements the keepDecimalPoint configuration option for KoreanTokenizer in the Nori module, as proposed in issue #15894 .

The goal is to prevent numeric values containing decimal points (e.g., "10.1") from being fragmented into multiple tokens during the lattice construction phase. This ensures that the semantic integrity of measurements, version numbers, and economic data is preserved from the initial analysis stage.

@github-actions github-actions Bot added this to the 11.0.0 milestone Mar 31, 2026
@twosom twosom closed this Mar 31, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant