Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

student or teacher fine-tune #22

Open
ttkxyy opened this issue Sep 28, 2023 · 5 comments
Open

student or teacher fine-tune #22

ttkxyy opened this issue Sep 28, 2023 · 5 comments

Comments

@ttkxyy
Copy link

ttkxyy commented Sep 28, 2023

Dear team,
Thank you again for your work on the code!

Do you fine-tune using the student model or the teacher model?

@kahnchana
Copy link
Owner

We use the teacher model. But our experiments showed that using the student does not lead to much difference.

@ttkxyy
Copy link
Author

ttkxyy commented Sep 28, 2023

We use the teacher model. But our experiments showed that using the student does not lead to much difference.

I don't know where the parameters in my pre training section were set incorrectly. The results I ran were only 92.01 on the ucf101 dataset and 64.17 on the hmdb51 dataset. I reproduced the weights you provided, which can reach 94.23 and 68.18, respectively

@ttkxyy
Copy link
Author

ttkxyy commented Sep 29, 2023

Can you disclose some information about the values of pretraining stages momentum_teacher and teacher_temp? Thank you!

@memoiry
Copy link

memoiry commented Oct 13, 2023

@ttkxyy , I noticed that in the default setting, the code will runs for 100 epochs, which is different from the 20 epoch mentioned by papers. Did you notice the difference?

@ttkxyy
Copy link
Author

ttkxyy commented Oct 16, 2023

@memoiry Yes, I noticed that the parameters of SVT are the same as those of the Dino model. I think they should be modified according to the parameters proposed by the author in the paper, but I have not been able to successfully reproduce it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants