Need to canonicalize or optimize high-dim concat to concat+transpose #1296

nadavrot · 2018-07-19T20:13:25Z

The picture below depicts a concat node that joins nodes on dimension number 1. The IR that we generate for this code is inefficient for two reasons. First, we can't optimize the operator that writes the result because the result is scattered across the 2st dimension (dim zero is the first). And second, we emit a sequence of insert_tensor instructions that process the tensor several times invalidating cache. A much better way would be to represent this as dim-0 concat followed by a transpose.

Design question: I am not sure if this should be the canonical representation, the only representation or simply a target specific optimization.

sparkingdark · 2020-12-02T05:30:36Z

hey @nadavrot can you elaborate

glowbucky · 2021-09-08T06:02:20Z

Hey @nadavrot I would love to work on this issue. Can you please tell me where should I start, should this be designed as an graph optimization pass?

nickgg added enhancement good first issue labels Apr 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need to canonicalize or optimize high-dim concat to concat+transpose #1296

Need to canonicalize or optimize high-dim concat to concat+transpose #1296

nadavrot commented Jul 19, 2018

sparkingdark commented Dec 2, 2020

glowbucky commented Sep 8, 2021

Need to canonicalize or optimize high-dim concat to concat+transpose #1296

Need to canonicalize or optimize high-dim concat to concat+transpose #1296

Comments

nadavrot commented Jul 19, 2018

sparkingdark commented Dec 2, 2020

glowbucky commented Sep 8, 2021