Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any plan to release the code for transfer learning? #6

Open
xwen99 opened this issue May 13, 2022 · 2 comments
Open

Any plan to release the code for transfer learning? #6

xwen99 opened this issue May 13, 2022 · 2 comments

Comments

@xwen99
Copy link

xwen99 commented May 13, 2022

Hi, thank you for your nice work.

I'm now trying to reproduce some transfer learning (e.g., det, semantic seg) results with the released checkpoints, the checkpoint is transferred to pytorch-style with byol-convert and evaluated with another pytorch-based codebase (detectron2/mmseg), but the results are a bit lower than those reported in the paper (e.g., two-point drop in VOC semantic seg).

So I wonder if you have any plan to release the official code for transfer learning? As this may greatly help to reproduce your results.

@olivierhenaff
Copy link

Hello, we've added a checkpoint for a BYOL-pretrained ResNet-50 to our GCS bucket (resnet50_byol_imagenet1k.npy). Could you run your fine-tuning script on this model, to check the relative difference between BYOL and DetCon_B?

@xwen99
Copy link
Author

xwen99 commented May 23, 2022

Hi Oliver,

Recently I reproduced BYOL and DetCon_B with the provided weights (ImageNet 1k epoch pre-training), the results are as follows:

Method COCO det COCO seg VOC Cityscapes
  AP AP mIoU mIoU
BYOL (reported) 41.6 37.2 75.7 74.6
BYOL (reproduced) 41.6 37.6 71.8 (-3.9) 74.8
DetCon (reported) 42.7 38.2 77.3 77.0
DetCon (reproduced) 42.2 38.1 75.6 (-1.7) 76.6

The results on COCO and Cityscapes are acceptable, indicating that the checkpoints are okay.
The only problem lies in VOC. My implementation strictly follows MoCo (1st image), and I re-checked the implementation details in BYOL (2nd image) and DetCon (3rd image), and it seems that both works vary in the transfer learning settings on VOC (including the learning rate and operation on the inference image), just wonder does the difference harm fair comparison?

image
image
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants