Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

读取cn.skipgram.bin.tar.gz错误 #9

Open
Jacky-Chiu opened this issue Nov 20, 2017 · 6 comments
Open

读取cn.skipgram.bin.tar.gz错误 #9

Jacky-Chiu opened this issue Nov 20, 2017 · 6 comments

Comments

@Jacky-Chiu
Copy link

model = gensim.models.KeyedVectors.load_word2vec_format(fdir + 'cn.skipgram.bin.tar.gz', binary=True)

ValueError: invalid literal for int() with base 10: 'cn.skipgram.bin\x00\x00。。。。。。

你好,读取cn.skipgram.bin.tar.gz文件出现这个错误,查了很久都不知道原因

@Senmumu
Copy link

Senmumu commented Jan 23, 2018

解压后再试试?

@Senmumu
Copy link

Senmumu commented Jan 23, 2018

can you unzip this file and try again?

@to-shimo
Copy link
Owner

请参考Mikolov的word2vec的源码读取方式

@hy9be
Copy link

hy9be commented Mar 31, 2018

I got 'utf-8' codec can't decode bytes in position 96-97: unexpected end of data when I try to load the unzipped bin file.

@liyonglion
Copy link

我使用gensim 加载,未解压的模型,报错:utf-8' codec can't decode bytes in position 96-97: unexpected end of data。如何加载这个模型呢?

@yydai
Copy link

yydai commented May 20, 2018

Using like this can work for me:

word2vec = gensim.models.KeyedVectors.load_word2vec_format( 'XXX', binary=True, unicode_errors='ignore')

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants