[gguf] Refactor __torch_function__ to avoid unnecessary computation #11551

anijain2305 · 2025-05-13T18:00:08Z

This helps with torch.compile compilation latency. Avoiding unnecessary computation should also lead to a slightly improved eager latency.

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

This helps with torch.compile compilation lantency. Avoiding unnecessary computation should also lead to a slightly improved eager latency.

anijain2305 · 2025-05-13T18:00:26Z

cc @sayakpaul

sayakpaul

Nice! Thanks for this. Do you want to also include the speedups you obtained with this patch?

sayakpaul · 2025-05-14T04:39:04Z

Along with this, do we think using regional compilation (cc: huggingface/accelerate#3529) could also benefit the compilation latency?

HuggingFaceDocBuilderDev · 2025-05-14T04:44:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

anijain2305 · 2025-05-14T05:19:11Z

I am going through a stack of PRs to tackle compilation time. I will update once the stack lands. Overall the compile time is roughly 280 seconds, and I am able to take off roughly 30 seconds till now .

Regional compilation will definitely benefit this model. @StrongerXi has the latest numbers once.

It seems that workflow needs some approval?

sayakpaul · 2025-05-14T05:39:30Z

@bot /style

github-actions · 2025-05-14T05:40:21Z

Style fixes have been applied. View the workflow run here.

StrongerXi · 2025-05-14T17:05:55Z

Oh yeah regional compilation would speed things up massively, when I tested a while back it went from 300s to 30s. Might be worth offering a similar api in diffusers and transformers?

anijain2305 · 2025-05-15T03:34:10Z

@DN6 a gentle ping in case this missed through the cracks

sayakpaul · 2025-05-15T04:01:33Z

Oh yeah regional compilation would speed things up massively, when I tested a while back it went from 300s to 30s. Might be worth offering a similar api in diffusers and transformers?

@StrongerXi #11556

[gguf] Refactor __torch_function__ to avoid unnecessary computation

cd06c87

This helps with torch.compile compilation lantency. Avoiding unnecessary computation should also lead to a slightly improved eager latency.

yiyixuxu requested a review from DN6 May 13, 2025 18:05

StrongerXi mentioned this pull request May 13, 2025

[ued] Slow start up time for torch.compile on GGUF Auraflow pytorch/pytorch#150706

Closed

sayakpaul approved these changes May 14, 2025

View reviewed changes

Merge branch 'main' into improve-gguf-tf

5e6d0d4

Apply style fixes

f28688a

Merge branch 'main' into improve-gguf-tf

911d2a7

DN6 approved these changes May 15, 2025

View reviewed changes

DN6 merged commit 3a6caba into huggingface:main May 15, 2025
12 checks passed

DN6 added the roadmap Add to current release roadmap label Jun 5, 2025

github-project-automation bot added this to Diffusers Roadmap 0.36 Jun 5, 2025

github-project-automation bot moved this to In Progress in Diffusers Roadmap 0.36 Jun 5, 2025

DN6 moved this from In Progress to Done in Diffusers Roadmap 0.36 Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[gguf] Refactor __torch_function__ to avoid unnecessary computation #11551

[gguf] Refactor __torch_function__ to avoid unnecessary computation #11551

Uh oh!

anijain2305 commented May 13, 2025

Uh oh!

anijain2305 commented May 13, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul commented May 14, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 14, 2025

Uh oh!

anijain2305 commented May 14, 2025

Uh oh!

sayakpaul commented May 14, 2025

Uh oh!

github-actions bot commented May 14, 2025

Uh oh!

StrongerXi commented May 14, 2025

Uh oh!

anijain2305 commented May 15, 2025

Uh oh!

sayakpaul commented May 15, 2025

Uh oh!

Uh oh!

Uh oh!

[gguf] Refactor __torch_function__ to avoid unnecessary computation #11551

[gguf] Refactor __torch_function__ to avoid unnecessary computation #11551

Uh oh!

Conversation

anijain2305 commented May 13, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

anijain2305 commented May 13, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented May 14, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 14, 2025

Uh oh!

anijain2305 commented May 14, 2025

Uh oh!

sayakpaul commented May 14, 2025

Uh oh!

github-actions bot commented May 14, 2025

Uh oh!

StrongerXi commented May 14, 2025

Uh oh!

anijain2305 commented May 15, 2025

Uh oh!

sayakpaul commented May 15, 2025

Uh oh!

Uh oh!

Uh oh!