benchmarking suite should initialize cuda graphs / profiler interaction

As per: https://pytorch.org/docs/stable/torch.compiler_profiling_torch_compile.html#working-around-cuda-graph-profiling-issues, we may need to do some initialization when using cuda graphs.

We are not yet using cuda graphs, but the benchmarking code should just invoke this at the start of execution anyway. Thus if we add a benchmark that graphs around something nvFuser gives, or if we start internally using graphs down the road, we won't hit surprising profiling issues.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

benchmarking suite should initialize cuda graphs / profiler interaction #4008

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

benchmarking suite should initialize cuda graphs / profiler interaction #4008

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions