try to make at::cat in mm_tree_reduction operate on contig tensors #18816

ngimel · 2019-04-03T21:22:51Z

Sometimes at::cat gets transposed inputs and goes on a slow path. Also, make jit_premul lstm benchmark add bias to the whole input tensor to avoid separate reduction kernels in the backward pass.

t-vi

Great patch! I think it would be nice to add commentary to both the matmul->mm and the cat optimization.
Also I'm not quite sure about dropping b_ih in the cell but not in the arguments.

t-vi · 2019-04-05T12:19:36Z

benchmarks/fastrnns/cells.py

Shouldn't we either take out b_ih out of the arguments or keep it here?

apaszke · 2019-04-07T16:10:57Z

torch/csrc/jit/passes/batch_mm.cpp

Since we know that they're 2D can't we just do the stride check manually? .t().is_contiguous() is quite convenient, but it allocates a whole new tensor which is a total overkill for this case.

apaszke · 2019-04-07T16:11:39Z

torch/csrc/jit/passes/batch_mm.cpp

nit: this is just

return fmap(inputs, [](const at::Tensor& i) { return i.t(); });

apaszke · 2019-04-07T16:12:21Z

torch/csrc/jit/passes/batch_mm.cpp

This requires a change of the function name, because you have completely change the semantics. Is it really slower if the strides are not all the same, or is it just a guess?

It's so that in the transpose check I could check the strides of only the first tensor, and honestly it's hard to imagine graph that would have tensors of same sizes eligible for tree reduction, but with the different strides. In any case, I'll just make transpose check go over all the tensors and leave have_same_shape alone.

apaszke · 2019-04-07T16:13:03Z

benchmarks/fastrnns/factory.py

Honestly I think we should add a new benchmark to see the effect of this change. We've been using this one for quite a while, so I'd rather keep its meaning consistent with what people expect

The effect is on the order of couple percent, I'll add a separate benchmark.

Actually, I'm getting crazy big (7%) improvement from adding bias on my current system, but it's very system-dependent.

root@7a3abf660096:/workspace/ALL/pytorch_upstream/benchmarks# python -m benchmarks.fastrnns.bench --group rnns --inputSize 1024 --hiddenSize 1024 --rnns jit_premul jit_premul_bias jit cudnn --nloops 100 Namespace(cnns=None, device='cuda', group=['rnns'], hiddenSize=1024, inputSize=1024, miniBatch=64, nloops=100, numLayers=1, print_json=False, rnns=['jit_premul', 'jit_premul_bias', 'jit', 'cudnn'], sep=' ', seqLength=100, variable_lstms=False, warmup=10) Benchmarking LSTMs... name avg_fwd std_fwd avg_bwd std_bwd jit_premul 10.52 0.02504 21.2 1.051 jit_premul_bias 10.7 0.04063 19.62 0.2769 jit 11.49 0.02089 20.96 0.2493 cudnn 9.815 0.04521 18.98 0.09339

ngimel · 2019-04-09T16:55:08Z

@pytorchbot retest this please

ngimel · 2019-04-10T20:18:55Z

@apaszke can you please take a look? CI failures look unrelated.

zdevito

Looks like comments are all addressed and the code seems fine to me.

facebook-github-bot

@zdevito is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@wanchaol has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@wanchaol is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-04-25T07:14:38Z

@wanchaol merged this pull request in 3875e1b.

…ytorch#18816) Summary: Sometimes at::cat gets transposed inputs and goes on a slow path. Also, make jit_premul lstm benchmark add bias to the whole input tensor to avoid separate reduction kernels in the backward pass. Pull Request resolved: pytorch#18816 Differential Revision: D15013576 Pulled By: wanchaol fbshipit-source-id: bcfa1cf44180b11b05b0f55f034707012f66281a

ngimel requested a review from apaszke April 3, 2019 21:22

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Apr 3, 2019

soumith requested a review from wanchaol April 3, 2019 21:40

t-vi reviewed Apr 5, 2019

View reviewed changes

t-vi approved these changes Apr 7, 2019

View reviewed changes

apaszke suggested changes Apr 7, 2019

View reviewed changes

ngimel requested review from ebetica, ezyang, fmassa, goldsborough, mrshenli, pietern, soumith and yf225 as code owners April 10, 2019 16:24

Natalia Gimelshein added 5 commits April 10, 2019 16:28

try to make at::cat in mm_tree_reduction operate on contig tensors

fb3017a

remove b_ih argument from cell function, add comments

771ce81

address review comments

0a90e68

add new jit benchmarking mode, jit_premul_bias

73f13f6

typo

1fe5e56

ngimel force-pushed the contig_cat branch from 3e38c67 to 1fe5e56 Compare April 10, 2019 16:44

ezyang removed their request for review April 10, 2019 20:36

zdevito approved these changes Apr 19, 2019

View reviewed changes

facebook-github-bot reviewed Apr 19, 2019

View reviewed changes

facebook-github-bot reviewed Apr 24, 2019

View reviewed changes

facebook-github-bot reviewed Apr 25, 2019

View reviewed changes

facebook-github-bot closed this in 3875e1b Apr 25, 2019

facebook-github-bot added the merged label Apr 25, 2019

ezyang added the open source label Jun 24, 2019

try to make at::cat in mm_tree_reduction operate on contig tensors #18816

try to make at::cat in mm_tree_reduction operate on contig tensors #18816

Uh oh!

Conversation

ngimel commented Apr 3, 2019

Uh oh!

t-vi left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ngimel commented Apr 9, 2019

Uh oh!

ngimel commented Apr 10, 2019

Uh oh!

zdevito left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Apr 25, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants