Skip to content

skip keyword for GermanNormalizationFilter#14415

Closed
xzhang9292 wants to merge 4 commits intoapache:mainfrom
xzhang9292:xinzhang92
Closed

skip keyword for GermanNormalizationFilter#14415
xzhang9292 wants to merge 4 commits intoapache:mainfrom
xzhang9292:xinzhang92

Conversation

@xzhang9292
Copy link

Current GermanNormalizationFilter tries to normalize special German characters like ä to a, ü to u. For some words it makes sense to do so, äpfel - > apfel is like apples -> apple. But for some words, it doesn't make sense, Bär -> Bar is like Bear -> Bar. Adding KeywordAttribute to allow users to bypass normalization on some specific words.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant