ViCon comprises pairs of synonyms and antonyms across word classes, thus offering data to distinguish between similarity and dissimilarity. ViSim-400 provides degrees of similarity across five semantic relations, as rated by human judges.
The two datasets are verified through standard co-occurrence and neural network models, showing results comparable to the respective English datasets
📜 Papers
📁 Word Vectors
- vietnlp/etnlp - A toolkit to evaluate, extract, and visualize multiple embeddings
- Kyubyong/wordvectors
resource
- facebookresearch/fastText
resource
- sonvx/word2vecVN
resource