You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm going to use the python code of skip-gram (sg) in my research but recognize difference between the implementation and the original in Mikolov's paper.
The detail of the difference will be mentioned below.
Please let me know if this difference is intentionally or just a bug.
as a result, we're going to optimize P( input / context ) while, in the original paper, they tried to optimize P( context / input) in skip-gram architecture.
The text was updated successfully, but these errors were encountered:
@truythu169 I have a feeling you have found a bug but let's wait for a confirmation from the maintainers since this would be a major one conceptually 😅
No, zero chance there's a bug in the word2vec algo.
@AMR-KELEG The question of context-vs-target direction comes up a lot, check the mailing list. I remember @gojomo answered it repeatedly, although I cannot find his great answers now. @gojomo can you add it to the Gensim FAQ?
I recall answering this a few times, and though I can't find my answers at the moment, it was @piskvorky first at: #300 (comment)
While this has come up a few times – like confusion about the proper handling of averaging/dividing CBOW vectors/gradients – it's still very insider, for people obsessing over the source – I wouldn't assign it a slot in the overall FAQ. Maybe a new "implementation details FAQ"?
Problem description
I'm going to use the python code of skip-gram (sg) in my research but recognize difference between the implementation and the original in Mikolov's paper.
The detail of the difference will be mentioned below.
Please let me know if this difference is intentionally or just a bug.
Steps/code/corpus to reproduce
code in:
https://github.com/RaRe-Technologies/gensim/blob/f97d0e793faa57877a2bbedc15c287835463eaa9/gensim/models/word2vec.py#L399-L414
...
https://github.com/RaRe-Technologies/gensim/blob/f97d0e793faa57877a2bbedc15c287835463eaa9/gensim/models/word2vec.py#L443-L456
The text was updated successfully, but these errors were encountered: