executor should be able to put checks into prologue trace #263

jjsjann123 · 2024-04-24T12:27:31Z

🚀 Feature

dynamic constraints would want to insert executor specific checks into prologue trace, given that backends might have specific dynamic constraints.

A quick example as below:

Given a program to be compiled, where we would expect the reduction axis to be a direct input to the program vvv

def foo(a, reducedim):
    return a.sum(reducedim)

The compute trace looks like below vvv.

import thunder
import thunder.torch as ltorch
import torch
from thunder.executors.torchex import no_autocast

@torch.no_grad()
@no_autocast()
def computation(a, i0, i1):
  # a: "cuda:0 f32[8, 16, 32]"
  # i0: "int 0"
  # i1: "int 1"
  t2 = ltorch.sum(a, (i0, i1), False, dtype=None)  # t2: "cuda:0 f32[32]"
    # t2 = prims.sum(a, (i0, i1))  # t2: "cuda:0 f32[32]"
  return t2

Depends on which backend claims the ltorch.sum. i.e. for torchex, since aten can handle arbitrary reducedim given at runtime, we can re-use the cache and there's no need to insert any check on arg[1].
On the contrary, nvfuserex would require the program to bake in reduction axis as compile time constant. So we'd want to insert that as part of the prologue trace checks.

Alternatives

Alternative 0: we can converge on the most conservative backends and apply a simpler caching strategy at the primitive level. In the example above, we'll just require reduction axis to stay as a compile time constant thing across the board, even though it could be re-used for some executors. This would unfortunately gives us some negative cache hit but would be easier to plumb through.

Alternative 1: thunder as a system can establish a caching strategy. When a backend sees a cache requirement on a certain op that it cannot fulfill, the backend could just reject the operation.

The text was updated successfully, but these errors were encountered:

jjsjann123 added enhancement New feature or request dynamic constraints labels Apr 24, 2024

jjsjann123 mentioned this issue Apr 24, 2024

Dynamic constraints and NumberProxies #262

Open

3 tasks

mruberry added triage review and removed triage review labels Apr 29, 2024

jjsjann123 mentioned this issue May 15, 2024

grad transform introduces reduction operation with NumberProxy in axes #403

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

executor should be able to put checks into prologue trace #263

executor should be able to put checks into prologue trace #263

jjsjann123 commented Apr 24, 2024 •

edited

Loading

executor should be able to put checks into prologue trace #263

executor should be able to put checks into prologue trace #263

Comments

jjsjann123 commented Apr 24, 2024 • edited Loading

🚀 Feature

Alternatives

jjsjann123 commented Apr 24, 2024 •

edited

Loading