Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

字符集 怎么来的,代码报错了 #1

Closed
ljtlrh opened this issue Sep 11, 2019 · 2 comments
Closed

字符集 怎么来的,代码报错了 #1

ljtlrh opened this issue Sep 11, 2019 · 2 comments

Comments

@ljtlrh
Copy link

ljtlrh commented Sep 11, 2019

vocab_file = 'wx.chars' # 字符集

@ky941122
Copy link

ky941122 commented Sep 11, 2019

vocab_file = 'wx.chars' # 字符集

需要提前先编译好kenlm,这里都有写https://kexue.fm/archives/6920 ,用之前可以先看看。

@bojone
Copy link
Owner

bojone commented Sep 12, 2019

wx.chars是字符集要保存的文件名,kenlm的count_ngrams会帮你生成。

@bojone bojone closed this as completed Oct 9, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants