Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

人名分词问题 #1015

Closed
Zhangdeli1993 opened this issue Nov 6, 2018 · 4 comments
Closed

人名分词问题 #1015

Zhangdeli1993 opened this issue Nov 6, 2018 · 4 comments

Comments

@Zhangdeli1993
Copy link

例句:“拨张三的电话“ 分词后 “ 拨/v, 张/q, 三/m, 的/uj, 电话/n“
例句:"拨打张三的电话" 分词后 “拨打/v, 张三/nr, 的/uj, 电话/n”
为什么人名分词不一

@HitomeRyuu
Copy link

最好说一下你是调用的哪个分词方法,如果是感知机方法的话模型的影响也比较大

@Zhangdeli1993
Copy link
Author

最好说一下你是调用的哪个分词方法,如果是感知机方法的话模型的影响也比较大

我用的是标准分词 java版的

@jzhao833
Copy link

jzhao833 commented May 8, 2019

我也遇到了同样的问题,标准分词,请问原因是什么?

@HitomeRyuu
Copy link

标准分词的算法基础是词图词网,词图的基础是词典。首先确保你enableNameRecognize(true),然后在用户词典里面增加张三这个词条即可。
(我用的1.7版本所有词典都没搜到张三,增加后这两句话都能顺利识别张三了
概率转移矩阵中v->nr是4w多,v->q只有2w多,如果有张三nr词条的话肯定不会把张识别为q的。
如有错误还请指正。

hankcs added a commit that referenced this issue May 25, 2019
hankcs added a commit that referenced this issue Jan 10, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants