Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Misalignment of audio-visual frames and labels #2

Open
SAGNIKMJR opened this issue Feb 23, 2023 · 3 comments
Open

Misalignment of audio-visual frames and labels #2

SAGNIKMJR opened this issue Feb 23, 2023 · 3 comments

Comments

@SAGNIKMJR
Copy link

SAGNIKMJR commented Feb 23, 2023

Hi,

There is a chance of misalignment between the AV frames and the labels in dataloader.py due to the interpolation in

if gt_frames > framenum.shape[0]:
. I think that there could be 2 ways to handle this: 1) have contiguous AV frames and do label interpolation, or 2) have discontiguous AV frames and not do label interpolation. Do you expect either of these options to work better?

P.S. This is similar to #1 (comment).

Thanks,
Sagnik

@zcxu-eric
Copy link
Owner

Hi, I think both are good enough if there is not any difference in performance, but it is hard to evaluate because the labels are not so accurate. Feel free to raise a pull request if you would like to help fix it. Thanks a lot.

@SAGNIKMJR
Copy link
Author

Also, some clip frame numbers that are there in the bbox annotations (stored in data/ego4d/bbox) are missing in data/video_imgs. For e.g., there is frame number 8991 at index 291 of the list stored in data/ego4d/bbox/34bc808a-4b98-42c5-a1ea-6f565dd4aa20:track_0:29.json but there is no corresponding image frame for it, i.e., data/video_imgs/34bc808a-4b98-42c5-a1ea-6f565dd4aa20/img_08991.jpg doesn't exist. I looked at https://github.com/zcxu-eric/Ego4d_TalkNet_ASD/blob/main/scripts/extract_frame.sh and it looks like it dumps all clip frames in data/video_imgs. Any idea why this is the case? Thanks.

@zcxu-eric
Copy link
Owner

Hi, we didn't encounter this bug on our side. One quick solution is to check the data sanity before training and discard these labels if necessary.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants