Skip to content

Batch IS NOT Heavy: Learning Word Representations From All Samples

License

Notifications You must be signed in to change notification settings

shubhampachori12110095/AllVec

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AllVec

Training algorithm for learning word embedding from all samples. Please use the GloVe code to build co-occurrence statistics.

word_vetors_v1 utilize gradient descent methods and word_vectors_v2 utilize Newton methods.

Examples to run the code:

./word_vectors_v2 -word-occu $co-occurrence file$ -read-vocab $vocab_file$ -output ./vectors.bin -w0 350 -size 50 -iter 50 -binary 1 -threads 16 -thro 0.8 -shift -0.5

About

Batch IS NOT Heavy: Learning Word Representations From All Samples

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 100.0%