Formula for calculating USE cosine similarities: dividing by π #6

jxmorris12 · 2019-11-24T17:39:35Z

Hi,

I see you are actually using the scaled angular distance between the two embeddings instead of the raw cosine similarity score.

https://github.com/jind11/TextFooler/blob/master/attack_classification.py#L32

After the call to tf.acos, do you not need to divide by π to scale the value between 0 and 1? That is the practice recommended in the Universal Sentence Encoder paper, section 5. Did you forget to divide by pi or am I missing something?

The text was updated successfully, but these errors were encountered:

jind11 · 2019-11-24T18:16:50Z

Hi, tf.acos already considers this pi thing. Actually this code snippet is from the USE official example.

jxmorris12 · 2019-11-24T19:55:12Z

Hi again,

I'm not getting the same results.

>>> tf.acos(-1.0).numpy()
3.1415927

Looks like it definitely needs to be divided by pi to fit in the range [0,1]. Can you confirm your tensorflow version behaves differently?

jind11 · 2019-11-27T23:16:48Z

hi, I am sorry for the late response. I was using the tensorflow 1.4, but after double checking, I also found that tf.acos(1) = 0, tf.acos(0)=1.57, and tf.acos(-1)=3.14, so the final cos_sim value is not constraint between -1 and 1. However, the relationship between self.sim_scores and clip_cosine_similarities is still positive so it is a matter of what threshold I should use. I am thinking directly using clip_cosine_similarities as the similarity score without using the tf.acos, which makes sense in my intuition. How do you think? Thank you for pointing this out!

jxmorris12 · 2019-11-29T18:12:55Z

Hi. I think that either way -- either leaving the acos and dividing by pi, or just using the raw similarity -- makes sense to me. It shouldn't affect the ordering of examples, it just affects the threshold.

jxmorris12 closed this as completed Nov 24, 2019

jxmorris12 reopened this Nov 24, 2019

jind11 closed this as completed Mar 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Formula for calculating USE cosine similarities: dividing by π #6

Formula for calculating USE cosine similarities: dividing by π #6

jxmorris12 commented Nov 24, 2019

jind11 commented Nov 24, 2019

jxmorris12 commented Nov 24, 2019 •

edited

jind11 commented Nov 27, 2019

jxmorris12 commented Nov 29, 2019

Formula for calculating USE cosine similarities: dividing by π #6

Formula for calculating USE cosine similarities: dividing by π #6

Comments

jxmorris12 commented Nov 24, 2019

jind11 commented Nov 24, 2019

jxmorris12 commented Nov 24, 2019 • edited

jind11 commented Nov 27, 2019

jxmorris12 commented Nov 29, 2019

jxmorris12 commented Nov 24, 2019 •

edited