-
Notifications
You must be signed in to change notification settings - Fork 668
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Plural Stemmer for English #1750
Comments
@jgschis Thanks for bringing this up. We'd welcome a contribution from you. If not, we will look into this eventually. |
Yes I am doing it now... |
I have ported the code to go. This is the first time i have made a go program, so no sure if I've done this right https://github.com/jgschis/pluralstem/blob/master/english/english.go |
HI, Should I |
Feel free to add this to the bleve project once done, i will make a pull request to add the entry point to this library to Bleve. |
Hi, @abhinavdangeti |
Hi, I am not changing bleve yet. First I need to add my plural stemmer repository to https://github.com/blevesearch. How do I do that? Once it's added, I will make a pull request to update bleve so that it calls the code in the pluralstem project. |
+ This contribution was made by https://github.com/jgschis . + This has not been incorporated into the `en` analyzer. + The user will however be able to build a custom analyzer with the `en` components alongside this. + For: #1750
+ This contribution was made by https://github.com/jgschis . + This has not been incorporated into the `en` analyzer. + The user will however be able to build a custom analyzer with the `en` components alongside this. + For: #1750 + Also: https://issues.couchbase.com/browse/MB-56359
+ This contribution was made by https://github.com/jgschis . + This has not been incorporated into the `en` analyzer. + The user will however be able to build a custom analyzer with the `en` components alongside this. + For: #1750 + Also: https://issues.couchbase.com/browse/MB-56359
Thanks |
+ This contribution was made by https://github.com/jgschis . + This has not been incorporated into the `en` analyzer. + The user will however be able to build a custom analyzer with the `en` components alongside this. + For: #1750 + Also: https://issues.couchbase.com/browse/MB-56359
The snowball and porter stemmers are too aggressive for ecommerce. For example, the word dressing gets stemmed to dress. But dressing and dress are two different concepts and shouldn't be conflated.
I think we need a stemmer that just reduces a plural to its singular form. The OpenSearch project recently added a stemmer that does this:
https://github.com/opensearch-project/OpenSearch/blob/main/modules/analysis-common/src/main/java/org/opensearch/analysis/common/EnglishPluralStemFilter.java
If no one else wants to do this, I will add this to Bleve...
The text was updated successfully, but these errors were encountered: