Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Topic worth investigating over: 'vector rejection' #595

Open
shirish93 opened this issue Jan 28, 2016 · 1 comment
Open

Topic worth investigating over: 'vector rejection' #595

shirish93 opened this issue Jan 28, 2016 · 1 comment
Labels
difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature wishlist Feature request

Comments

@shirish93
Copy link

@benSchmidt has written an interesting blog post on the use of a method he calls 'vector rejection' to separate words with ambiguous meanings.

During experimentation with a Nepali news corpus dataset, I found his method to be more useful to discard unwanted vectors than the existing method with most_similar.

I have recreated his method (which he has in R) in this gist and have been working with it for the last few days. In my (admittedly limited) series of experiments it seems to have quite a lot of value. Yoav Goldberg has a twitter thread about the operation/post here.

I bring this up because someone might want to look it over/possibly see if this aligns with the project? Please close the issue if you believe otherwise.

edit: correct link.

@piskvorky
Copy link
Owner

This is very interesting, thanks for the tip @shirish93 !

@menshikh-iv menshikh-iv added feature Issue described a new feature difficulty medium Medium issue: required good gensim understanding & python skills wishlist Feature request labels Oct 2, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature wishlist Feature request
Projects
None yet
Development

No branches or pull requests

3 participants