Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Difference between DSBN and AdaBN #4

Closed
semi-supervised-paper opened this issue Jun 21, 2020 · 1 comment
Closed

Difference between DSBN and AdaBN #4

semi-supervised-paper opened this issue Jun 21, 2020 · 1 comment

Comments

@semi-supervised-paper
Copy link

Hi,
After reading the paper, I am still puzzled about the difference between DSBN and AdaBN. I think the main difference between DSBN and AdaBN is an additional pseudo label loss in DSBN. However, I can't find the experiment results comparing DSBN and AdaBN in the paper. Have you reproduced AdaBN in your experiments?
Thanks.

@woozch
Copy link
Owner

woozch commented Jun 21, 2020

To add more to the differences you mentioned, the main difference is that AdaBN doesn't have separate affine parameters for each domain. DSBN has separate affine parameters for each domain.
Also, AdaBN propose a online algorithm to re-estimate the mean and variance of BN layer for target domain, which is different from the common moving update scheme. The algorithm simply re-estimates batch mean and variance on the target domain, while DSBN jointly updates batch statistics(mean and variance) using moving average scheme of BN paper during training.
When we wrote the paper, the source code of AdaBN was not available, so we couldn't verify the source code and reproduce it. Also, they didn't use ResNet and didn't provide experimental results on VisDA dataset.

@woozch woozch closed this as completed Jun 22, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants