not sure about crop_audio_window in color_syncnet_train.py #7

zhouyong64 · 2020-08-31T09:11:56Z

About crop_audio_window in color_syncnet_train.py. I'm not unsure if the 0-based indexing to 1-based indexing is done correctly. For example, if the file name of the frame is 0.jpg (beginning of the video), current implementation would give a non-zero start_idx for spec, which I think is wrong. It seems to me that for 0.jpg, the start_idx for spec should be 0.

prajwalkr · 2020-08-31T10:42:56Z

Fixed now.

zhouyong64 · 2020-08-31T10:54:56Z

The pretrained syncnet model was trained on the old code. Doesn't it mean that the model was not trained on the best possible data, thus the model might not be as good as it could be? What do you think?

prajwalkr · 2020-08-31T10:57:23Z

Yes, it can be improved a little bit, but we do not think that the difference will be large. I say this because originally, it was without that + 1 and it had a very similar performance. But your logic is definitely sound, thank you.

prajwalkr closed this as completed Aug 31, 2020

SebaGenetic mentioned this issue Sep 13, 2020

Shared notebook fails in last step #47

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

not sure about crop_audio_window in color_syncnet_train.py #7

not sure about crop_audio_window in color_syncnet_train.py #7

zhouyong64 commented Aug 31, 2020

prajwalkr commented Aug 31, 2020

zhouyong64 commented Aug 31, 2020

prajwalkr commented Aug 31, 2020

not sure about crop_audio_window in color_syncnet_train.py #7

not sure about crop_audio_window in color_syncnet_train.py #7

Comments

zhouyong64 commented Aug 31, 2020

prajwalkr commented Aug 31, 2020

zhouyong64 commented Aug 31, 2020

prajwalkr commented Aug 31, 2020