Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sign of lsi_vector #2348

Closed
dnabanita7 opened this issue Jan 22, 2019 · 1 comment
Closed

Sign of lsi_vector #2348

dnabanita7 opened this issue Jan 22, 2019 · 1 comment

Comments

@dnabanita7
Copy link

Linux-4.15.0-43-generic-x86_64-with-debian-buster-sid
Python 3.6.5 |Anaconda, Inc.| (default, Apr 29 2018, 16:14:56)
[GCC 7.2.0]
NumPy 1.15.0
SciPy 1.1.0
gensim 3.4.0
FAST_VERSION 1

The notebook is on Similarity_Queries

The original notebook-log of lsi 2nd dim:
2019-01-22 08:49:18,871 : INFO : topic #1(2.542): 0.623*"graph" + 0.490*"trees" + 0.451*"minors" + 0.274*"survey" + -0.167*"system" + -0.141*"eps" + -0.113*"human" + 0.107*"response" + 0.107*"time" + -0.072*"interface"

My notebook-log of lsi 2nd dim:
2019-01-22 08:50:27,166 : INFO : topic #1(2.542): -0.623*"graph" + -0.490*"trees" + -0.451*"minors" + -0.274*"survey" + 0.167*"system" + 0.141*"eps" + 0.113*"human" + -0.107*"response" + -0.107*"time" + 0.072*"interface"

that is why the vector is affected.
the original notebook-log of lsi vector is:
[(0, -0.4618210045327156), (1, -0.07002766527899992)]

while I get the complete opposite:
[(0, 0.46182100453271535), (1, 0.07002766527900031)]

I understand that because the 2nd dim of lsi_vec is complete opposite of my vec. But,it doesn't affect the cosine similarity sims,how?

@menshikh-iv
Copy link
Contributor

the topic just reverse all signs, that's not an issue: cosine_dist(u, v) == cosine_dist(-u, -v)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants