-
Notifications
You must be signed in to change notification settings - Fork 603
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Distinct labels with same description in dict.csv #31
Comments
Actually, it seems that some of the descriptions (e.g., 'basketball') are indeed related to distinct concepts while others (e.g., 'alfa romeo giulietta') seem to describe the same thing. |
Hi @chcomin, thank you for the find. You're correct, there are two classes of errors:
|
Checking the test images here http://openimages.oldjpg.com/, I see that sometimes the duplicates classes are actually the same (e.g. egg) but other times no. For example "mouse" as already said, but also "fish" (one is the animal and the other one is food). Please notice that the 3 "alfa romeo giulietta" are:
So resolving all the duplicates would be a useful work, but we have to check all the classes, a simple merge could be wrong. |
I agree. Let me check, if it's a good time to do with the Google team. |
Hi, I noticed that some labels have the same description in the file dict.csv.
Is that expected? Should these cases be treated as distinct entities or is it
better to merge them into a single label?
The list of repeated descriptions is:
The text was updated successfully, but these errors were encountered: