Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

One-hot or multi-hot-encoding? #5

Closed
sisrfeng opened this issue Oct 16, 2020 · 2 comments
Closed

One-hot or multi-hot-encoding? #5

sisrfeng opened this issue Oct 16, 2020 · 2 comments

Comments

@sisrfeng
Copy link

Hi, I find that in your dataset flickr-clean, each image contains only 1 category. In flickr-voc, some images contain 2 categories or more. In the latter case, should the label be multi-hot? For example, an image has two annotations [category_id:0] and [category_id:2]. Should the label be [1,0,1,……,0]?
image

ref:
如果一个样本属于多个类,比如在CelebA数据集中一张人脸照片是椭圆形的,戴眼镜的... 这个时候样本的label标注是multi-hot-encoding的,也就是会出现多个标签1。如[0,1,1,0], 1代表属性出现,0代表属性不出现。
https://www.jianshu.com/p/09c49f74308a
https://stats.stackexchange.com/questions/467633/what-exactly-is-multi-hot-encoding-and-how-is-it-different-from-one-hot

@sisrfeng
Copy link
Author

In flickr-clean, each image contains only 1 category. Is this what you expect or just a special case?
If the training data just contain a single object, will it lack enough feature to train a powerful object detector?

@shenyunhang
Copy link
Owner

shenyunhang commented Oct 19, 2020

Hi, I find that in your dataset flickr-clean, each image contains only 1 category. In flickr-voc, some images contain 2 categories or more. In the latter case, should the label be multi-hot? For example, an image has two annotations [category_id:0] and [category_id:2]. Should the label be [1,0,1,……,0]?

Yes, you are right. The paper uses one-hot labels to expose our method and it is easy to generalize to multi-hot labels.

In flickr-clean, each image contains only 1 category. Is this what you expect or just a special case?

flickr-clean is not created by us.
I think it is expected.

If the training data just contain a single object, will it lack enough feature to train a powerful object detector?

I agree with this.
I think it will lack the images with multi-class co-occurrence.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants