Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

中英文混合时能否也支持下提取英文单词首字母 #292

Open
hulizhen opened this issue Jul 19, 2023 · 0 comments
Open

中英文混合时能否也支持下提取英文单词首字母 #292

hulizhen opened this issue Jul 19, 2023 · 0 comments

Comments

@hulizhen
Copy link

hulizhen commented Jul 19, 2023

目前如果是中英文混合的情况下,只能对中文取首字母,英文还是完整单词。
比如:

GET /tests/_analyze
{
  "text": "我是谁 where am i",
  "tokenizer": {
    "type": "pinyin",
    "limit_first_letter_length": 64,
    "keep_full_pinyin": false,
    "keep_first_letter": true,
    "keep_none_chinese": false,
    "keep_none_chinese_together": true,
    "keep_none_chinese_in_first_letter": true,
    "none_chinese_pinyin_tokenize": true,
    "lowercase": false,
    "keep_original": false
  }
}

这会返回 token: wsswhereami
能否支持下返回 wsswai

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant