Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Encoding Error #27

Closed
ProxyCausal opened this issue Jun 10, 2018 · 2 comments
Closed

Encoding Error #27

ProxyCausal opened this issue Jun 10, 2018 · 2 comments

Comments

@ProxyCausal
Copy link

Getting this error when I try to run sif_embeddings, but I think the issue is with data io.
How are the files supplied meant to be used, besides running the demo? I'd like to use SIF for evaluating similarity of sentences I supply. There is no training needed if I were to just use the Glove embeddings correct? What are the neural nets in src used for then?

Thanks!

File "C:\Users\gdev\git\SIF\examples\sif_embedding.py", line 13, in <module>
    (words, We) = data_io.getWordmap(wordfile)
  File "../src\data_io.py", line 12, in getWordmap
    lines = f.readlines()
  File "C:\ProgramData\Anaconda3\lib\encodings\cp1252.py", line 23, in decode
    return codecs.charmap_decode(input,self.errors,decoding_table)[0]
UnicodeDecodeError: 'charmap' codec can't decode byte 0x90 in position 962: character maps to <undefined>
@ProxyCausal
Copy link
Author

Trying UTF-8:

File "../src\data_io.py", line 18, in getWordmap
    v.append(float(i[j]))
ValueError: could not convert string to float: '.'

@ProxyCausal
Copy link
Author

sorry this was already solved in post #19

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant