Skip to content

v0.2.0: add implementation based on torch.compile

Latest
Compare
Choose a tag to compare
@arogozhnikov arogozhnikov released this 05 Dec 08:33
· 6 commits to main since this release
  • new implementation is faster, but not cudagraph-compatible
  • old implementation is moved to cudagraph.py
  • requires torch >= 2.0