Skip to content
This repository has been archived by the owner on Mar 19, 2024. It is now read-only.

Learn words representation with CBOW plus position-weights #445

Open
loretoparisi opened this issue Feb 26, 2018 · 12 comments
Open

Learn words representation with CBOW plus position-weights #445

loretoparisi opened this issue Feb 26, 2018 · 12 comments

Comments

@loretoparisi
Copy link

As for the recent paper "Learning Word Vectors for 157 Languages", the model CBOW is used with position dependent weights. Using that the new pre-trained model were produced.
Is it possible to train a unsupervised model with CBOW in this version of fastText using the same approach with position weights?

@luthfianto
Copy link

Also, which command in fasttext corresponds to the positional-weighting?

@matanox
Copy link

matanox commented Mar 2, 2018

I don't think this is implemented in the publicly available codebase.

@loretoparisi
Copy link
Author

@matanster you are right, it is not, at least not yet, while the most two recent papers show models that has been trained FastText with position dependent weights.

@ghost
Copy link

ghost commented Jul 4, 2018

Do you know if there are any updates of positional-weighting in fasttext base code?

@loretoparisi
Copy link
Author

@omriFdna so far I'm not aware of a version that implements the positional weights bow, but I will have a look maybe someone did...

@ghost
Copy link

ghost commented Jul 8, 2018

@loretoparisi any luck?

@loretoparisi
Copy link
Author

@omriFdna not yet so far :

@loretoparisi
Copy link
Author

A dataset with position weights trained with CBOW has been recently released for Wikipedia and Common Crawl - https://fasttext.cc/docs/en/crawl-vectors.html

The model parameters were:

DIM NGRAM WS NEG
300 5 5 10

It would be great to have this as training option as well.

@ghost
Copy link

ghost commented Oct 11, 2018

I agree, they also report in their papers that position weights improves the performance, I wish it was part of the training options.

@adam2326
Copy link

adam2326 commented Oct 11, 2018 via email

@Witiko
Copy link

Witiko commented Aug 4, 2020

I am attempting to add this feature to Gensim, see piskvorky/gensim#2905.

@Witiko
Copy link

Witiko commented Jan 14, 2022

The work from piskvorky/gensim#2905 has been accepted as a journal paper that is to appear in J.UCS 28:2. To conduct experiments for the paper, I have produced the high-level PInE library that uses a fork of Gensim 3.8.3 for model training. Perhaps the PInE library can serve as an inspiration for an implementation to facebook's fastText and to current Gensim. 💡

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

6 participants