Word2Bits benchmark #1991
Labels
difficulty medium
Medium issue: required good gensim understanding & python skills
performance
Issue related to performance (in HW meaning)
testing
Issue related with testing (code, documentation, etc)
Description
Pretty interesting paper Word2Bits - Quantized Word Vectors by Maximilian Lam, looks like it possible to apply "quantization" to the current w2v algorithm and receive a memory-compact representation without sacrificing quality.
ToDo
accuracy
method (classical approach)If benchmark shows good-enough results, this will be a part of Gensim.
The text was updated successfully, but these errors were encountered: