gradgradcheck for torch.repeat and torch.tile is outrageously slow #49962

mruberry · 2020-12-30T04:57:32Z

torch.repeat and torch.tile (which is implemented using torch.repeat) are relatively fast compared to NumPy's torch.tile, but attempting to gradgradcheck them is incredible slow in some cases. For example:

x = torch.randn(5, 5, 5, requires_grad=True, dtype=torch.double)

def partial(x):
    return x.repeat(5, 5, 5, 5)

gradgradcheck(partial, x)

takes 77.93s on my devfair! While not an apples to apples comparison, computing the function's Hessian is relatively fast:

x = torch.randn(5, 5, 5, requires_grad=True, dtype=torch.double)

def partial(x):
    return x.repeat(5, 5, 5, 5).sum()

torch.autograd.functional.hessian(partial, x)

takes only .07s to run. That is, it is 1000x faster than the gradgradcheck.

gradgradcheck being so slow appears to have a real impact. See those two tests on ASAN:

Dec 30 02:18:32   test_tile_more_reps_dims_cpu (__main__.TestAutogradDeviceTypeCPU) ... ok (1023.734s)
Dec 30 02:20:21   test_tile_same_reps_dims_cpu (__main__.TestAutogradDeviceTypeCPU) ... ok (109.164s)

or on the clang build:

Dec 30 01:51:11   test_tile_more_reps_dims_cpu (__main__.TestAutogradDeviceTypeCPU) ... ok (115.829s)
Dec 30 01:51:24   test_tile_same_reps_dims_cpu (__main__.TestAutogradDeviceTypeCPU) ... ok (13.091s)

cc @ezyang @albanD @zou3519 @gqchen @pearu @nikitaved @soulitzer @mruberry @VitalyFedyunin @walterddr

The text was updated successfully, but these errors were encountered:

albanD · 2020-12-30T13:38:18Z

Well, the .sum() makes all the difference.
You can reduce the size of the input/output to reduce this.

anjali411 · 2021-01-04T20:45:12Z

reduced the input size for tile tests and they don't timeout anymore!

mruberry added module: autograd Related to torch.autograd, and the autograd engine in general module: tests Issues related to tests (not the torch.testing module) triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Dec 30, 2020

mruberry mentioned this issue Dec 30, 2020

Complex backward for indexing, slicing, joining, and mutating ops #49552

Closed

kshitij12345 mentioned this issue Jan 6, 2021

[numpy] torch.exp: promote integer inputs to float #50093

Closed

mruberry closed this as completed Jan 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gradgradcheck for torch.repeat and torch.tile is outrageously slow #49962

gradgradcheck for torch.repeat and torch.tile is outrageously slow #49962

mruberry commented Dec 30, 2020 •

edited by pytorch-probot bot

albanD commented Dec 30, 2020

anjali411 commented Jan 4, 2021

gradgradcheck for torch.repeat and torch.tile is outrageously slow #49962

gradgradcheck for torch.repeat and torch.tile is outrageously slow #49962

Comments

mruberry commented Dec 30, 2020 • edited by pytorch-probot bot

albanD commented Dec 30, 2020

anjali411 commented Jan 4, 2021

mruberry commented Dec 30, 2020 •

edited by pytorch-probot bot