New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Finding CJK text is failed #544
Comments
Finding Chinese doesn't work correctly too, below is my case. Firstly match all messages
and then search some Chinese
It works fine, but try to search Chinese substring
Any tips? |
This is because CJK word sementation is not supported in Xapian. |
I changed the title because this is not only related to Japanese but also Chinese. |
+1. I am having problems finding mail with Chinese text. |
Sadly, no much mu/mu4e can do about this.... perhaps ask the Xapian people? |
I think that this is not Xapian related issue, @djcb Could you please re-open this issue? |
@YukiChiba: oh, perhaps you can create a new one? And can you attach a raw email file with some body-text and a subject in Japanese,so we can test it? Thanks. |
Oh actually, just reopened this one -- anyway, having an example message would be great. |
@djcb Thank you for reopening this issue. The following is an example of message in Japanese:
I also uploaded this raw message to the following: |
The following query is succeeded to find:
But, the following is failed even though "漢字" appears in the body:
|
As discussed in issue #544, it's possible to search for CJK text, as long as you see the environment variable XAPIAN_CJK_NGRAM to non-empty with Xapian >= 1.2.8.
Thanks, I tried this, and
So closing, this once more. |
I set an environment variable XAPIAN_CJK_NGRAM to 1 in order to handle Japanese text for finding messages.
http://xapian.org/docs/sourcedoc/html/namespaceCJK.html
http://trac.xapian.org/ticket/180
However, it is failed to find Japanese text when no filed is specified. The followings are examples of output for a message containing "手続き" in subject and body:
I think this is related to this closed issue:
#123
The issue above is closed because of no reply for a long time. This bug is still remained.
I highly appreciate if mu supports Japanese text for finding messages.
Thank you.
The text was updated successfully, but these errors were encountered: