How to cache the compilation result? #43

huntzhan · 2023-12-12T09:56:09Z

torch.compile always re-compiles a function from scratch in a new Python session, which takes a lot of time.
I'm wondering if there's a way to cache the compilation result in the file system (like gcc/clang) to speed up the development & debugging process.
@Chillee

gpt-fast/generate.py

Lines 16 to 18 in db7b273

    
           torch._inductor.config.coordinate_descent_tuning = True 
        
           torch._inductor.config.triton.unique_kernel_names = True 
        
           torch._inductor.config.fx_graph_cache = True # Experimental feature to reduce compilation times, will be on by default in future

The text was updated successfully, but these errors were encountered:

Chillee · 2023-12-17T01:56:22Z

This is currently an issue we're aware of, unfortunately. In theory, it's possible to use AOTInductor https://www.youtube.com/watch?v=w7d4oWzwZ0c to completely AOT compile everything, however it's somewhat finicky to use.

We also have some plans to offer an easier way to cache compilation results.

To be clear, a number of components should already be cached on recompile - triton autotuning decisions, inductor compilation, etc. It typically takes me on the order of 30-40 seconds for a warm recompile, although we should certainly try to drive this down even further.

huntzhan · 2023-12-18T14:58:15Z

thanks for reply.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to cache the compilation result? #43

How to cache the compilation result? #43

huntzhan commented Dec 12, 2023 •

edited

Loading

Chillee commented Dec 17, 2023

huntzhan commented Dec 18, 2023

How to cache the compilation result? #43

How to cache the compilation result? #43

Comments

huntzhan commented Dec 12, 2023 • edited Loading

Chillee commented Dec 17, 2023

huntzhan commented Dec 18, 2023

huntzhan commented Dec 12, 2023 •

edited

Loading