Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

not sure about crop_audio_window in color_syncnet_train.py #7

Closed
zhouyong64 opened this issue Aug 31, 2020 · 3 comments
Closed

not sure about crop_audio_window in color_syncnet_train.py #7

zhouyong64 opened this issue Aug 31, 2020 · 3 comments

Comments

@zhouyong64
Copy link

About crop_audio_window in color_syncnet_train.py. I'm not unsure if the 0-based indexing to 1-based indexing is done correctly. For example, if the file name of the frame is 0.jpg (beginning of the video), current implementation would give a non-zero start_idx for spec, which I think is wrong. It seems to me that for 0.jpg, the start_idx for spec should be 0.

@prajwalkr
Copy link
Collaborator

Fixed now.

@zhouyong64
Copy link
Author

The pretrained syncnet model was trained on the old code. Doesn't it mean that the model was not trained on the best possible data, thus the model might not be as good as it could be? What do you think?

@prajwalkr
Copy link
Collaborator

Yes, it can be improved a little bit, but we do not think that the difference will be large. I say this because originally, it was without that + 1 and it had a very similar performance. But your logic is definitely sound, thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants