Load a user-defined dictionary #663
Replies: 3 comments 1 reply
-
|
Hello everyone 👋 We just released a 🧪 prototype that allows customizing tokenization and we'd love your feedback. How to get the prototype?Using docker, use the following command:
From source, compile Meilisearch on the How to use the prototype?You can find all the details in the PR. Feedback and bug reporting when using this prototype are encouraged! Thanks in advance for your involvement. It means a lot to us ❤️ |
Beta Was this translation helpful? Give feedback.
-
|
Hello everyone 👋 We have just released the first RC (release candidate) of Meilisearch containing this new feature! You can test it by using: You are welcome to leave your feedback in this discussion. If you encounter any bugs, please report them here. 🎉 Official and stable release containing this change will be available on September 25th, 2023 |
Beta Was this translation helpful? Give feedback.
-
|
Hey folks 👋 v1.4.0 has been released! 🦓 You can now expand the default dictionaries and improve accuracy when using domain-specific terms. ✨ Note: 📚 https://www.meilisearch.com/docs/reference/api/settings#dictionary |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Why?
Some Languages are not always space separated (Chinese, Japanese, Korean...) forcing Meilisearch to rely on
common word dictionariesto segment a text into words, however, this strategy works only to segment common words and struggles when a text contains domain-specific words, technical words, or People Nouns.Because it is impossible to have an exhaustive list of words of all the possible domains or a complete list of Nouns, Meilisearch should allow the user to add their own list of words specialized for their use case.
Relative discussions/issues
#3590
#2289
#1879
#135
532#discussioncomment-5995300
#468
#160 (comment)
meilisearch/meilisearch#2879
Relative discord messages
https://discord.com/channels/1006923006964154428/1120407180823232672/1120407180823232672
Beta Was this translation helpful? Give feedback.
All reactions