Navigation Menu

Skip to content

Commit

Permalink
doc: add complementary explain about similar search against Japanese …
Browse files Browse the repository at this point in the history
…documents
  • Loading branch information
komainu8 committed Jun 26, 2018
1 parent 5adf440 commit 267502b
Show file tree
Hide file tree
Showing 2 changed files with 12 additions and 0 deletions.
9 changes: 9 additions & 0 deletions doc/locale/ja/LC_MESSAGES/reference.po
Expand Up @@ -23092,6 +23092,15 @@ msgstr ""
"す。この場合は、 ``I migrated all XXX system!`` という内容のレコードがマッチ"
"します。"

msgid ""
"You should use ``TokenMecab`` tokenizer for similar search against Japanese "
"documents. Because ``TokenMecab`` will tokenize target documents to almost "
"words, it improves similar search precision."
msgstr ""
"日本語の文書に対して類似文書検索を行う場合は、 ``TokenMecab`` を使う方が良い"
"です。``TokenMecab`` は、対象の文書をほぼ単語にトークナイズするので、類似文書"
"検索の精度が向上するためです。"

msgid "Term extract operator"
msgstr "単語抽出演算子"

Expand Down
3 changes: 3 additions & 0 deletions doc/source/reference/grn_expr/script_syntax.rst
Expand Up @@ -1059,6 +1059,9 @@ The expression matches records that have similar content to ``I
migrated all Solr system!``. In this case, records that have ``I
migrated all XXX system!`` content are matched.

You should use ``TokenMecab`` tokenizer for similar search against Japanese documents.
Because ``TokenMecab`` will tokenize target documents to almost words, it improves similar search precision.

.. _script-syntax-term-extract-operator:

Term extract operator
Expand Down

0 comments on commit 267502b

Please sign in to comment.