README

This is a repository for recording techniques that can speed up your training in pytorch.

Includes some tricks and demos.

I sincerely hope that you will submit pull requests

`torch.compile()`

https://pytorch.org/docs/stable/generated/torch.compile.html#torch-compile

model = GPT()
model = torch.nn.compile(model)

Important: torch.compile() need the torch's version >= 2.0

Torch.compile is the python equivalent of gcc for c++. A model running in torch.compile can speed things up significantly. The acceleration comes mainly from reduced Python overhead and GPU read/write

Gradient Checkpoint

https://pytorch.org/docs/stable/checkpoint.html

model = nn.Sequential(...)
input_var = checkpoint_sequential(model, chunks, input_var)

Time => GPU memory

Try this to "slow down" when you speed up too much to run out of memory.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

`torch.compile()`

Gradient Checkpoint

Reference

About

Uh oh!

Releases

Packages

devilran6/accelerate_pytorch

Folders and files

Latest commit

History

Repository files navigation

README

torch.compile()

Gradient Checkpoint

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

`torch.compile()`

Packages