-
Notifications
You must be signed in to change notification settings - Fork 609
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Random clip per video #4039
Comments
Hi, thanks for creating the issue. Unfortunately, currently something like this is not supported in DALI. We are working on improvements to video support in DALI. We introduced new I am adding the feature you requested as something to be considered for the new video reader. Tracked internally as DALI-2881 Are there any more features that you would like to see in DALI with regards to video support? We are very interested in making the reworked video reader relevant for our users. Any suggestions and feedback is appreciated. Thanks! |
@awolant |
Hi @davidsvy, As @awolant already stated, this is not possible in DALI but it is good to know about such a use case so we can account for it during new video reader development. |
hi @awolant, thank you for the response. meanwhile i have read more through the DALI docs and became aware of the what i'm currently working on is to integrate this with a pytorch dataloader, such that dali also works with multiple workers. i hope this can work out :) thank you for making me aware of the new VideoReaderDecoder. will this be an additional feature or a replacement for fn.readers.video? i'm unsure how it is relevant to my issue. i read here that this new feature supports CFR and VFR videos which would definitely be important when processing the original videos from the Kinetics dataset. i currently got this issue out of the way by preprocessing the videos with ffmpeg. |
Hi @dschoerk,
DALI is a drop-in replacement for PyTorch dataloader, running it from multiple threads won't yield any benefit. Please check this part of the DALI documentation to see how to use DALI with PyTorch.
Long term we plan to replace, but we cannot commit to any timeline. |
@awolant |
I think it would be best to check [DeepStream][(https://docs.nvidia.com/metropolis/deepstream/dev-guide/). I think it should support a video stream for inference. |
@JanuszL |
Can you use DALI and DeepSteam together? DeepSteam to get the video and DALI to process it further (it means to have two ensembles for data processing? |
I am not sure, need time to explore. |
Was DALI-2881 implemented or not yet? Is it visible for you when it will be available? |
@Alwahsh, |
Hi, I'm currently experimenting with action recognition on the Kinetics 400 dataset. For training an action recognition model I would like to extract a single (sub-) clip per video in the dataset. What I so far came up with is the following pipeline.
Setting
step
to a high value ensures that I only get one clip per video, but it always starts at the beginning of the video. What I'm trying to achieve is to get a sub clip of N=16 frames at a random starting time. Similar to what is possible in Pytorchvideo. The clip_sampler here samples a random clip of _CLIP_DURATION length.Best regards and thanks for your help
The text was updated successfully, but these errors were encountered: