UCF101: Dataloader Fail on assertion #4112

YanCote · 2021-06-24T15:27:50Z

🐛 Bug

When loading UCF101 with different value of frames_per_clip and step_between_clips, it often yield frames_per_clip + 1 images which results in the following assertion to fail:

assert len(video) == self.num_frames, "{} x {}".format(
    video.shape, self.num_frames

Code To Reproduce

import torch
from torchvision.datasets import UCF101

ucf_loc = "/dataset/ucf"
ucf_data_dir = f"{ucf_loc}/UCF101/UCF-101"
ucf_label_dir = f"{ucf_loc}/ucfTrainTestlist"
frames_per_clip = 5
step_between_clips = 1
num_workers = 4

def custom_collate(batch):
    filtered_batch = []
    for video, _, label in batch:
        filtered_batch.append((video, label))
    return torch.utils.data.dataloader.default_collate(filtered_batch)

if __name__ == '__main__':
    # create train loader (allowing batches and other extras)
    test_dataset = UCF101(ucf_data_dir, ucf_label_dir, frames_per_clip=frames_per_clip,
                           step_between_clips=step_between_clips, train=False, transform=None, num_workers=num_workers)
    test_loader = torch.utils.data.DataLoader(test_dataset, batch_size=8, shuffle=True,
                                               collate_fn=custom_collate,num_workers=num_workers)

    for i, (video, label) in enumerate(test_loader):
        print(video.size())
        print(label)

Steps to reproduce the behavior:

Download UCF101
Modify ucf_loc and run the code included above

stack trace

Original Traceback (most recent call last):
File "/opt/miniconda3/envs/p38_ucf/lib/python3.8/site-packages/torch/utils/data/_utils/worker.py", line 287, in _worker_loop
data = fetcher.fetch(index)
File "/opt/miniconda3/envs/p38_ucf/lib/python3.8/site-packages/torch/utils/data/_utils/fetch.py", line 44, in fetch
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/opt/miniconda3/envs/p38_ucf/lib/python3.8/site-packages/torch/utils/data/_utils/fetch.py", line 44, in
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/opt/miniconda3/envs/p38_ucf/lib/python3.8/site-packages/torchvision/datasets/ucf101.py", line 102, in getitem
video, audio, info, video_idx = self.video_clips.get_clip(idx)
File "/opt/miniconda3/envs/p38_ucf/lib/python3.8/site-packages/torchvision/datasets/video_utils.py", line 382, in get_clip
assert len(video) == self.num_frames, "{} x {}".format(
AssertionError: torch.Size([6, 240, 320, 3]) x 5

Expected behavior

The return tensor data should be composed of a total of frames_per_clip images consistently no matter the properties of the input video and the parameter values provided to UCF101 class.

Environment

PyTorch =1.9.0 and TorchVision=0.10.0.
OS (e.g., Linux): MacOs BigSur
How you installed PyTorch / torchvision (conda, pip, source): conda
Build command you used (if compiling from source): NA
Python version: 3.8.10
CUDA/cuDNN version: NA
GPU models and configuration: CPU
Any other relevant information:

Additional context

Preliminary Investigation:

My guess is that with the provided argument, _read_from_stream function return +/- 1 frames in video.py: read_video(). I did not dig deeper to understand why

The text was updated successfully, but these errors were encountered:

fmassa · 2021-06-24T15:48:18Z

This is probably related to #3791, as now when using sec to index into the video, there are rounding errors which leads to the error.

cc @prabhat00155 @bjuncek I've brought this potential problem during our call a few weeks ago, we should fix it

YanCote · 2021-06-28T17:07:35Z

Thank you for following up.
A quick question, Is there a possible workaround, not sure how to modify torchvision/io/video.py:_read_from_stream() to address this issue.

bjuncek · 2021-07-22T18:22:54Z

@YanCote there is an easy workaround - I've submitted the PR and it's readily usable.
I think I might wanna change the defaults and update other datasets just in case, but this should patch up the issue for the time being

xianyuanliu · 2021-12-22T13:11:10Z

Hi guys, I have the same problem loading ucf101 with frames_per_clip=16. May I know if it is resolved now, please?

bjuncek · 2022-04-14T11:15:17Z

Yup, it's now resolved on the latest main.

shehan360 · 2022-04-20T00:41:47Z

@bjuncek How can I get these changes to my environment? By installing the nightly build?

ex -

pip3 install --pre torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/nightly/cpu

bjuncek · 2022-04-21T10:27:39Z

Hi @shehan360

Nightly build should work provided that you have pyav installed as well.
Otherwise, I'd suggest building from source (and having pyav and ffmpeg installed if you want the full video-reader support.

xiaoguan1206 · 2022-12-18T16:43:23Z

How can I solve this problem? PyTorch =1.10.1 and TorchVision= 0.11.2 .

fmassa added bug high priority module: io module: video labels Jun 24, 2021

pytorch-probot bot added the triage review label Jun 24, 2021

bjuncek self-assigned this Jul 21, 2021

bjuncek mentioned this issue Jul 22, 2021

UCF101 Sketchy Fix #4204

Open

fmassa mentioned this issue Nov 3, 2021

Assertion error during kinetics400 validation #4839

Closed

fmassa mentioned this issue Nov 18, 2021

VideoClips Assertion Error #1884

Closed

fmassa assigned prabhat00155 Nov 18, 2021

bjuncek mentioned this issue Apr 1, 2022

2022: state of video IO in torchvision #5720

Open

18 tasks

bjuncek closed this as completed Apr 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UCF101: Dataloader Fail on assertion #4112

UCF101: Dataloader Fail on assertion #4112

YanCote commented Jun 24, 2021

fmassa commented Jun 24, 2021

YanCote commented Jun 28, 2021

bjuncek commented Jul 22, 2021

xianyuanliu commented Dec 22, 2021

bjuncek commented Apr 14, 2022

shehan360 commented Apr 20, 2022

bjuncek commented Apr 21, 2022

xiaoguan1206 commented Dec 18, 2022

UCF101: Dataloader Fail on assertion #4112

UCF101: Dataloader Fail on assertion #4112

Comments

YanCote commented Jun 24, 2021

🐛 Bug

Code To Reproduce

Steps to reproduce the behavior:

stack trace

Expected behavior

Environment

Additional context

fmassa commented Jun 24, 2021

YanCote commented Jun 28, 2021

bjuncek commented Jul 22, 2021

xianyuanliu commented Dec 22, 2021

bjuncek commented Apr 14, 2022

shehan360 commented Apr 20, 2022

bjuncek commented Apr 21, 2022

xiaoguan1206 commented Dec 18, 2022