Add torch.stack support #524

yf225 · 2025-08-29T06:09:34Z

Fixes #523.

helion/language/memory_ops.py

jansel

Could this be handled similar to:

helion/helion/_compiler/inductor_lowering.py

Lines 823 to 836 in 3c0348a

    
           @register_lowering( 
        
               torch.ops.aten.permute.default,  # pyright: ignore[reportAttributeAccessIssue] 
        
               masked_value_fn=passthrough_masked_value, 
        
           ) 
        
           def codegen_permute(ctx: GraphInterpreter, node: torch.fx.Node) -> object: 
        
               assert not node.kwargs, "getitem kwargs not supported" 
        
               tensor, dims = map_arg(node.args, lambda arg: ctx.env[arg]) 
        
               assert isinstance(tensor, ast.AST) 
        
               dims = [*dims]  # pyright: ignore[reportGeneralTypeIssues,reportOptionalIterable] 
        
               assert {*dims} == {*range(len(dims))}, dims 
        
               return expr_from_string( 
        
                   f"tl.permute({{tensor}}, {dims!r})", 
        
                   tensor=tensor, 
        
               )

I'd expect that to be simpler with less need for special casing view ops.

yf225 · 2025-08-30T23:02:49Z

helion/_compiler/device_ir.py

-        ).graph
+        decomp_table = select_decomp_table()
+        decomp_table.pop(torch.ops.aten.stack.default, None)
+        return proxy_tensor.make_fx(fn, decomposition_table=decomp_table)(*args).graph


Normally, torch.stack is decomposed to unsqueeze + cat, but I haven’t figure out a way to make codegen_cat work, so as a workaround we disable the decomp for torch.stack and implement codegen_stack instead.

helion/_compiler/device_ir.py

yf225 · 2025-08-31T05:03:09Z

test/test_views.py

+        torch.testing.assert_close(result, expected, rtol=1e-5, atol=1e-5)
+        self.assertExpectedJournal(code)
+
+        # Verify torch.compile still decomposes aten.stack to aten.cat


Added test to make sure _get_custom_decomp_table doesn't affect normal torch.compile decomp for torch.stack

yf225 requested review from jansel and oulgen August 29, 2025 06:09

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 29, 2025

yf225 commented Aug 29, 2025

View reviewed changes

helion/language/memory_ops.py Outdated Show resolved Hide resolved

yf225 force-pushed the torch_stack_v1 branch 2 times, most recently from c076d0e to 46cc702 Compare August 29, 2025 07:45

jansel requested changes Aug 30, 2025

View reviewed changes

yf225 force-pushed the torch_stack_v1 branch from 46cc702 to 2d4ebe6 Compare August 30, 2025 22:59

yf225 commented Aug 30, 2025

View reviewed changes

yf225 requested a review from jansel August 30, 2025 23:04

yf225 force-pushed the torch_stack_v1 branch 2 times, most recently from e8138a1 to dcfb3fe Compare August 31, 2025 00:37

jansel requested changes Aug 31, 2025

View reviewed changes

helion/_compiler/device_ir.py Outdated Show resolved Hide resolved

yf225 force-pushed the torch_stack_v1 branch 2 times, most recently from b766ebf to a83637d Compare August 31, 2025 04:58

torch.stack support

71cfb78

yf225 force-pushed the torch_stack_v1 branch from a83637d to 71cfb78 Compare August 31, 2025 05:01

yf225 commented Aug 31, 2025

View reviewed changes

yf225 requested a review from jansel August 31, 2025 05:03

jansel approved these changes Sep 1, 2025

View reviewed changes

yf225 merged commit 0f3e2d5 into main Sep 1, 2025
13 checks passed

lolpack pushed a commit to lolpack/helion that referenced this pull request Oct 13, 2025

Add torch.stack support (pytorch#524)

4b16fba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add torch.stack support #524

Add torch.stack support #524

Uh oh!

yf225 commented Aug 29, 2025

Uh oh!

Uh oh!

jansel left a comment

Uh oh!

yf225 Aug 30, 2025

Uh oh!

Uh oh!

yf225 Aug 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	@register_lowering(
	torch.ops.aten.permute.default, # pyright: ignore[reportAttributeAccessIssue]
	masked_value_fn=passthrough_masked_value,
	)
	def codegen_permute(ctx: GraphInterpreter, node: torch.fx.Node) -> object:
	assert not node.kwargs, "getitem kwargs not supported"
	tensor, dims = map_arg(node.args, lambda arg: ctx.env[arg])
	assert isinstance(tensor, ast.AST)
	dims = [*dims] # pyright: ignore[reportGeneralTypeIssues,reportOptionalIterable]
	assert {dims} == {range(len(dims))}, dims
	return expr_from_string(
	f"tl.permute({{tensor}}, {dims!r})",
	tensor=tensor,
	)

Add torch.stack support #524

Add torch.stack support #524

Uh oh!

Conversation

yf225 commented Aug 29, 2025

Uh oh!

Uh oh!

jansel left a comment

Choose a reason for hiding this comment

Uh oh!

yf225 Aug 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yf225 Aug 31, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants