Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

各个分词方法的区别是什么,能介绍一下吗? #80

Closed
birdycn opened this issue Nov 28, 2020 · 1 comment
Closed

各个分词方法的区别是什么,能介绍一下吗? #80

birdycn opened this issue Nov 28, 2020 · 1 comment
Labels
Milestone

Comments

@birdycn
Copy link

birdycn commented Nov 28, 2020

cut search hm string
您好,有详细的方法区别介绍吗

@vcaesar
Copy link
Member

vcaesar commented Dec 2, 2020

Cut 试图将句子最精确地切开,适合文本分析;
CutAll 采用全模式, 把句子中所有的可以成词的词语都扫描出来, 速度非常快,但是不能解决歧义;
HMM 参数用来控制是否使用 HMM 模型;
CutSearch() 搜索引擎模式,在精确模式的基础上,对长词再次切分,提高召回率,适合用于搜索引擎分词

@vcaesar vcaesar closed this as completed Dec 2, 2020
@vcaesar vcaesar added this to the v0.70.0 milestone Dec 2, 2020
@go-ego go-ego locked as off-topic and limited conversation to collaborators Mar 13, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants