You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
After reading the paper, I am still puzzled about the difference between DSBN and AdaBN. I think the main difference between DSBN and AdaBN is an additional pseudo label loss in DSBN. However, I can't find the experiment results comparing DSBN and AdaBN in the paper. Have you reproduced AdaBN in your experiments?
Thanks.
The text was updated successfully, but these errors were encountered:
To add more to the differences you mentioned, the main difference is that AdaBN doesn't have separate affine parameters for each domain. DSBN has separate affine parameters for each domain.
Also, AdaBN propose a online algorithm to re-estimate the mean and variance of BN layer for target domain, which is different from the common moving update scheme. The algorithm simply re-estimates batch mean and variance on the target domain, while DSBN jointly updates batch statistics(mean and variance) using moving average scheme of BN paper during training.
When we wrote the paper, the source code of AdaBN was not available, so we couldn't verify the source code and reproduce it. Also, they didn't use ResNet and didn't provide experimental results on VisDA dataset.
Hi,
After reading the paper, I am still puzzled about the difference between DSBN and AdaBN. I think the main difference between DSBN and AdaBN is an additional pseudo label loss in DSBN. However, I can't find the experiment results comparing DSBN and AdaBN in the paper. Have you reproduced AdaBN in your experiments?
Thanks.
The text was updated successfully, but these errors were encountered: