-
-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ZSL] Results doesn't match hugging face demo #44
Comments
Seconded. I was about to post a similar issue. The results are inaccurate a lot of the time. On some images it even gives inverted results, classifying X as Y and Y as X... Not clear why this is happening. This library that has great potential, so any help is much appreciated, @monatis. Thank you! |
I think it's because the tokenization strategy is different from HuggingFace CLIP tokenizer. |
Fixed in #56 |
Thanks for the fix, @monatis! However, I'm still getting inaccurate results. For example, when trying to determine if it's a man or a woman, it almost always classifies women as men. Also, strangely enough, in some cases, the score of the text "man" is higher for some images of women than for some images of men! Please take a look at the example below: Expectation:Result:
Expectation:Result:
Expectation:Result:
As you can see in this example, the photo of the man got 0.9785 as the score for the text "man", while the 2 photos of women got 0.9889 and 0.9860, which is very weird. |
I will echo that I'm experiencing the same kind of bias when calling |
Expected results:
playing music = 1.000
playing sports = 0.000
https://huggingface.co/laion/CLIP-ViT-B-32-laion2B-s34B-b79K
The text was updated successfully, but these errors were encountered: