Adding Uniform temporal Subsampling for Video #6812

datumbox · 2022-10-21T16:06:32Z

Resolves #6768

Adding temporal sampling Transforms similar to the ones on PyTorch Video:

Dispatcher + Kernel
Transform Class
Tests

pmeier

I'm ok with having custom tests for this now, but they only cover correctness at this point. Unless this is super urgent, we should really implement a KernelInfo and DispatcherInfo for it to also cover stuff like JIT. @datumbox if you don't have time to add that LMK and I will push a commit.

torchvision/prototype/transforms/functional/_temporal.py

pmeier · 2022-10-21T18:45:46Z

torchvision/prototype/transforms/functional/_temporal.py

+def uniform_temporal_subsample_video(video: torch.Tensor, num_samples: int, temporal_dim: int = -4) -> torch.Tensor:
+    # Reference: https://github.com/facebookresearch/pytorchvideo/blob/a0a131e/pytorchvideo/transforms/functional.py#L19
+    t_max = video.size(temporal_dim) - 1
+    indices = torch.linspace(0, t_max, num_samples, device=video.device).clamp_(0, t_max).long()


Not sure if this is just for parity, but is there a reason to create the linspace in int32 to later convert it to int64?

Plus, maybe this is just personal preference, but I would appreciate us to be "explicit" about the types

Suggested change

indices = torch.linspace(0, t_max, num_samples, device=video.device).clamp_(0, t_max).long()

indices = torch.linspace(0, t_max, num_samples, device=video.device).clamp_(0, t_max).to(torch.int64)

Plusplus, the docs for torch.linspace state:

Creates a one-dimensional tensor of size steps whose values are evenly spaced from start to end, inclusive.

What is the clamp for if the function by definition does not return values outside this range?

Let's be careful on how we are porting this. Here is why casting later is not the same:

t_max = 9 num_samples = 8 indices = torch.linspace(0, t_max, num_samples, dtype=torch.int64) indices2 = torch.linspace(0, t_max, num_samples).clamp(0, t_max).long() assert indices.equal(indices2), f"({t_max}, {num_samples})\n{indices}\n{indices2}"

Result:

assert indices.equal(indices2), f"({t_max}, {num_samples})\n{indices}\n{indices2}" AssertionError: (9, 8) tensor([0, 1, 2, 3, 5, 6, 7, 8]) tensor([0, 1, 2, 3, 5, 6, 7, 9])

IMO, this looks like a bug in torch linspace as its docs states that the last element is end (=t_max)

torchvision/prototype/transforms/functional/_temporal.py

torchvision/prototype/transforms/functional/__init__.py

torchvision/prototype/transforms/_temporal.py

datumbox · 2022-10-22T10:13:10Z

@pmeier Thanks for the review.

I've addressed the comments to unblock. We should review on what degree these nits can be covered by the linter or adopt simpler conventions on the future. Regardless I've made the necessary changes to cut down back-and-forth as we need this ASAP to assist the migration of a couple of internal teams.

Concerning the tests, I was hoping for more pointers from you on where I'm supposed to add them (the current structure has a lot of abstraction and it's not obvious). Concerning JIT testing, I have a test but it's probably on the wrong place. You are welcome to push a commit on this branch to improve on the tests as you are more familiar with the new test infra but if that is going to take longer, we should do it on a separate PR.

…pe/temporal

pmeier

@datumbox I've added the tests I've wanted and over-explained them in my comments below. To run these tests, you can use

pytest test/test_prototype_transforms_functional.py -k "uniform_temporal_subsample"

This will run ~200 tests. LMK if any of my explanations is unclear or I missed something that you want to know.

test/prototype_transforms_kernel_infos.py

.github/workflows/prototype-tests.yml

test/prototype_transforms_kernel_infos.py

test/prototype_transforms_dispatcher_infos.py

test/test_prototype_transforms_functional.py

test/test_prototype_transforms.py

pmeier

There is one relevant test failure:

_____________ TestUniformTemporalSubsample.test__transform[inpt2] ______________
Traceback (most recent call last):
  File "/home/runner/work/vision/vision/test/test_prototype_transforms.py", line 1917, in test__transform
    output = transform(inpt)
  File "/opt/hostedtoolcache/Python/3.7.15/x64/lib/python3.7/site-packages/torch/nn/modules/module.py", line 1363, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/runner/work/vision/vision/torchvision/prototype/transforms/_transform.py", line 40, in forward
    for inpt in flat_inputs
  File "/home/runner/work/vision/vision/torchvision/prototype/transforms/_transform.py", line 40, in <listcomp>
    for inpt in flat_inputs
  File "/home/runner/work/vision/vision/torchvision/prototype/transforms/_temporal.py", line 16, in _transform
    return F.uniform_temporal_subsample(inpt, self.num_samples, temporal_dim=self.temporal_dim)
  File "/home/runner/work/vision/vision/torchvision/prototype/transforms/functional/_temporal.py", line 20, in uniform_temporal_subsample
    raise ValueError("Video inputs must have temporal_dim equivalent to -4")
ValueError: Video inputs must have temporal_dim equivalent to -4

Otherwise LGTM if CI is green. Thanks Vasilis!

torchvision/prototype/transforms/functional/_temporal.py

test/test_prototype_transforms_functional.py

pmeier

Fixed the bug mentioned above for you. PR is now ready to be merged from my side unless you have any objections about the stuff I added.

torchvision/prototype/transforms/functional/_temporal.py

test/test_prototype_transforms.py

torchvision/prototype/transforms/functional/_temporal.py

Summary: * Adding temporal sampling kernel and dispatcher. * Adding the UniformTemporalSubsample class. * Add it on init * Adding tests. * Addressing comments. * Reverting proposal as it led to different results. * add more tests for uniform_temporal_subsample * cleanup * fix logic * fix logic * make test more strict * lint * Update torchvision/prototype/transforms/functional/_temporal.py * remove pytorchvideo again per request Reviewed By: YosuaMichael Differential Revision: D40722910 fbshipit-source-id: 68af13821890d1784f47ddb7cfbfea409b6ee6a0 Co-authored-by: Philip Meier <github.pmeier@posteo.de>

Adding temporal sampling kernel and dispatcher.

1dfb90b

datumbox added enhancement module: transforms prototype labels Oct 21, 2022

facebook-github-bot added the cla signed label Oct 21, 2022

datumbox marked this pull request as draft October 21, 2022 16:06

datumbox added 3 commits October 21, 2022 17:13

Adding the UniformTemporalSubsample class.

4523fad

Add it on init

a02f84e

Adding tests.

5310116

datumbox changed the title ~~[WIP] Adding Uniform temporal sampling~~ Adding Uniform temporal Subsampling for Video Oct 21, 2022

datumbox requested review from pmeier and vfdev-5 and removed request for pmeier October 21, 2022 16:55

datumbox marked this pull request as ready for review October 21, 2022 16:55

Merge branch 'main' into prototype/temporal

c3fcd3f

pmeier reviewed Oct 21, 2022

View reviewed changes

datumbox and others added 2 commits October 22, 2022 11:05

Addressing comments.

49afac0

Merge branch 'main' into prototype/temporal

3b31924

datumbox and others added 4 commits October 22, 2022 11:34

Reverting proposal as it led to different results.

c68dc3c

Merge remote-tracking branch 'origin/prototype/temporal' into prototy…

dea3e6c

…pe/temporal

add more tests for uniform_temporal_subsample

7f1afcd

cleanup

c68b7d4

pmeier reviewed Oct 23, 2022

View reviewed changes

fix logic

a7d6eeb

pmeier approved these changes Oct 23, 2022

View reviewed changes

torchvision/prototype/transforms/functional/_temporal.py Show resolved Hide resolved

test/test_prototype_transforms_functional.py Outdated Show resolved Hide resolved

pmeier added 2 commits October 23, 2022 16:13

fix logic

5a267e6

make test more strict

dbe0c34

pmeier approved these changes Oct 23, 2022

View reviewed changes

torchvision/prototype/transforms/functional/_temporal.py Outdated Show resolved Hide resolved

test/test_prototype_transforms.py Show resolved Hide resolved

lint

03bde05

datumbox commented Oct 24, 2022

View reviewed changes

torchvision/prototype/transforms/functional/_temporal.py Outdated Show resolved Hide resolved

datumbox and others added 2 commits October 24, 2022 09:11

Update torchvision/prototype/transforms/functional/_temporal.py

038fcea

remove pytorchvideo again per request

82c0aee

datumbox merged commit e96860d into pytorch:main Oct 24, 2022

datumbox deleted the prototype/temporal branch October 24, 2022 08:50

pmeier mentioned this pull request Oct 25, 2022

[prototype] Minor improvements on functional #6832

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Uniform temporal Subsampling for Video #6812

Adding Uniform temporal Subsampling for Video #6812

datumbox commented Oct 21, 2022 •

edited

Loading

pmeier left a comment

pmeier Oct 21, 2022

pmeier Oct 21, 2022

pmeier Oct 21, 2022

datumbox Oct 22, 2022

vfdev-5 Oct 24, 2022

datumbox commented Oct 22, 2022

pmeier left a comment •

edited

Loading

pmeier left a comment

pmeier left a comment

	indices = torch.linspace(0, t_max, num_samples, device=video.device).clamp_(0, t_max).long()
	indices = torch.linspace(0, t_max, num_samples, device=video.device).clamp_(0, t_max).to(torch.int64)

Adding Uniform temporal Subsampling for Video #6812

Adding Uniform temporal Subsampling for Video #6812

Conversation

datumbox commented Oct 21, 2022 • edited Loading

pmeier left a comment

Choose a reason for hiding this comment

pmeier Oct 21, 2022

Choose a reason for hiding this comment

pmeier Oct 21, 2022

Choose a reason for hiding this comment

pmeier Oct 21, 2022

Choose a reason for hiding this comment

datumbox Oct 22, 2022

Choose a reason for hiding this comment

vfdev-5 Oct 24, 2022

Choose a reason for hiding this comment

datumbox commented Oct 22, 2022

pmeier left a comment • edited Loading

Choose a reason for hiding this comment

pmeier left a comment

Choose a reason for hiding this comment

pmeier left a comment

Choose a reason for hiding this comment

datumbox commented Oct 21, 2022 •

edited

Loading

pmeier left a comment •

edited

Loading