Possible data leak in lvis_v1_train_cat_info.json #79

wusize · 2022-09-16T03:19:26Z

Hi, Xinyi!

I loaded the file "lvis_v1_train_cat_info.json", it seems to contain image_count for rare classes. It may lead to data leak in the open-vocabulary setting when using the fed loss.

xingyizhou · 2022-09-18T23:29:37Z

Hi Size,

Thank you for bring up this. I remembered I noted this issue, but thought we could ignore this because the zero-shot embeddings in the FedLoss will never receive a positive loss. They will receive negative losses (and thus have a negative impact on the performance), but these are rare due to the design of Fedloss. I should have tried to remove them in the loss but it was likely not better since I used this in the final version. Please feel free to try the corrected version and post numbers if you find a considerable difference. Thanks!

Best,
Xingyi

wusize · 2022-09-19T13:14:23Z

OK. Thanks for your response! It seems not likely to influence the FedLoss. But I still recommend to push a correct version of lvis_v1_train_cat_info.json that records 0 for novel classes. Otherwise, when setting ignore_zero_class = True for ce and bce loss, there would be some problem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible data leak in lvis_v1_train_cat_info.json #79

Possible data leak in lvis_v1_train_cat_info.json #79

wusize commented Sep 16, 2022 •

edited

Loading

xingyizhou commented Sep 18, 2022

wusize commented Sep 19, 2022 •

edited

Loading

Possible data leak in lvis_v1_train_cat_info.json #79

Possible data leak in lvis_v1_train_cat_info.json #79

Comments

wusize commented Sep 16, 2022 • edited Loading

xingyizhou commented Sep 18, 2022

wusize commented Sep 19, 2022 • edited Loading

wusize commented Sep 16, 2022 •

edited

Loading

wusize commented Sep 19, 2022 •

edited

Loading