Loss function is not squared in glove_cython? #22

ducovrossem · 2014-12-17T17:47:42Z

Not sure if I am missing something here but thought I'd ask for clarification - the loss function is not squared.

loss = entry_weight * (prediction - c_log(count))

Also this implementation does not generate seperate vectors for when word is used in context?

The text was updated successfully, but these errors were encountered:

maciejkula · 2014-12-17T17:51:12Z

Bad variable name. I think this is the gradient of the loss function.

Yes, this implementation does not generate separate vectors for context words. This makes it more memory efficient, as I can use an upper triangular matrix for the co-occurrence matrix (and only one matrix of vectors).

ducovrossem · 2014-12-17T18:43:06Z

Clear - thanks maciejkula.

piskvorky · 2015-11-10T11:02:20Z

FYI @maciejkula :
@dselivanov reports that this particular optimization (ignoring the context vectors) leads to a massive loss of accuracy:
http://rare-technologies.com/making-sense-of-word2vec/#comment-488

maciejkula · 2015-11-10T11:52:22Z

Interesting, I'll definitely have a look.

Incidentally, I think my more recent project (https://github.com/lyst/lightfm) should work really well on word embeddings (it uses a fancy learning-to-rank approach), I need to try it out.

ducovrossem closed this as completed Dec 17, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss function is not squared in glove_cython? #22

Loss function is not squared in glove_cython? #22

ducovrossem commented Dec 17, 2014

maciejkula commented Dec 17, 2014

ducovrossem commented Dec 17, 2014

piskvorky commented Nov 10, 2015

maciejkula commented Nov 10, 2015

Loss function is not squared in glove_cython? #22

Loss function is not squared in glove_cython? #22

Comments

ducovrossem commented Dec 17, 2014

maciejkula commented Dec 17, 2014

ducovrossem commented Dec 17, 2014

piskvorky commented Nov 10, 2015

maciejkula commented Nov 10, 2015