You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A couple improvement to how pre-trained embeddings are handled is required.
1. Out-source loading of vectors
Currently, the code to load pre-trained embeddings was written by me. That means its likely fragile and slow. See if I can load embeddings using Gensim which is likely to be faster and more reliable.
Note, this might actually solve the problem below.
2. Handle binary or plain text format
Currently, pre-trained embeddings in binary format (.bin) must be manually converted to a plaint text format (.txt) to be used with saber. This is an unnecessary additional step imposed on the user. Automatically detect if the embeddings are in binary or plain text format, and convert from binary to plain text automatically if necessary. To fix:
- [ ] Determine if embeddings are binary or plain text - [ ] Use Gensim to convert from binary to plain text if necessary
The text was updated successfully, but these errors were encountered:
A couple improvement to how pre-trained embeddings are handled is required.
1. Out-source loading of vectors
Currently, the code to load pre-trained embeddings was written by me. That means its likely fragile and slow. See if I can load embeddings using Gensim which is likely to be faster and more reliable.
2. Handle binary or plain text formatCurrently, pre-trained embeddings in binary format (.bin
) must be manually converted to a plaint text format (.txt
) to be used with saber. This is an unnecessary additional step imposed on the user. Automatically detect if the embeddings are in binary or plain text format, and convert from binary to plain text automatically if necessary. To fix:- [ ] Determine if embeddings are binary or plain text- [ ] Use Gensim to convert from binary to plain text if necessaryThe text was updated successfully, but these errors were encountered: