Chinese Search #3

hustcc · 2016-04-27T14:59:14Z

whether supports Chinese search?
can i index some text which are not in database, but all text have ID.

thanks for your answer.

nticaric · 2016-04-27T15:14:53Z

I would say that it even works for Chinese. The stemming process simply would do nothing since the stemming concept is not applicable in Chinese.

If you take a look at the demo page, try to search for: 指原の乱
I don't know what this means or if this is even Chinese, but it gives me some results.

Regarding your second question, I am not sure what you meant. If you have some text in your database then yes, it can be searched. Where else could the text be if it's not in the db?

hustcc · 2016-04-27T15:28:33Z

after post this issue, I read the code ofthe project.
I think may need a Chinese Tokenizer analyzer, and then write a Chinese stemmer. If I hava time, maybe can push a request.o̖⸜((̵̵́ ̆͒͟˚̩̭ ̆͒)̵̵̀)⸝o̗

the second question, I found the answer after read the code.

typed use ipad, it is not convenient

thx for your reply.

nticaric · 2016-04-27T15:57:27Z

I think the current tokenization process should also work for Chinese. It's a simple regular expression that breaks text into words. After that, each word is stemmed. The stemming concept cannot be applied to Chinese but to Indo-European group of languages, so the stemming will simply be ignored and will do nothing to the word.

dryyun · 2016-11-26T14:42:20Z

Chinese a bit complex, the test results are not good，I think have a Chinese Tokenizer analyzer is better

* commit '1e3135846c74efe9818ef5517b8499b24c1f0eb5': remove changes removed removed default - order - order requests_foreign date de - most change from kaidl

nticaric closed this as completed Apr 27, 2016

totomtornado mentioned this issue Nov 3, 2017

Please update the readme #58

Closed

naptalky mentioned this issue Sep 19, 2018

Call to a member function stem() on null #169

Closed

grindthemall mentioned this issue Feb 17, 2020

Issues with creating custom Tokenizer #182

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chinese Search #3

Chinese Search #3

hustcc commented Apr 27, 2016

nticaric commented Apr 27, 2016

hustcc commented Apr 27, 2016

nticaric commented Apr 27, 2016

dryyun commented Nov 26, 2016

Chinese Search #3

Chinese Search #3

Comments

hustcc commented Apr 27, 2016

nticaric commented Apr 27, 2016

hustcc commented Apr 27, 2016

nticaric commented Apr 27, 2016

dryyun commented Nov 26, 2016