Add Neuron to auto-compile hardware list by dacorvo · Pull Request #44757 · huggingface/transformers

dacorvo · 2026-03-16T14:54:38Z

Summary

_valid_auto_compile_criteria() gates auto-compilation on device.type in ["cuda", "xpu"], excluding Neuron devices. This means torch.compile never triggers automatically on Neuron even when StaticCache is used (which sets is_compileable = True).
Adds "neuron" to the valid hardware list so that Neuron devices benefit from auto-compilation like CUDA and XPU.

Addresses the "Auto-compilation gate missing Neuron" item in #44742.

🤖 Generated with Claude Code

Co-authored-by: Claude Opus 4.6 noreply@anthropic.com

_valid_auto_compile_criteria() gates auto-compilation on device type but excluded Neuron, so torch.compile never triggers automatically even when StaticCache is used. Add "neuron" to the valid hardware list alongside "cuda" and "xpu". Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

This PR adds "neuron" to the list of valid hardware device types in _valid_auto_compile_criteria(), enabling auto-compilation (torch.compile) on AWS Neuron (Trainium/Inferentia) devices when a compilable cache (e.g., StaticCache) is used. This is one item from the broader issue #44742 tracking static-shape generation support for Neuron.

Changes:

Added "neuron" to the valid_hardware device type check in _valid_auto_compile_criteria(), alongside "cuda" and "xpu".

HuggingFaceDocBuilderDev · 2026-03-16T15:05:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2026-04-14T12:43:21Z

src/transformers/generation/utils.py


        # Base logic
-        valid_hardware = self.device.type in ["cuda", "xpu"] or bool(
+        valid_hardware = self.device.type in ["cuda", "xpu", "neuron"] or bool(


isn't it dependent on adding a full static-shape generation loop first?

Hmm not sure what you mean? I think this is a general list of devices that support compile OOB - you can also hack CPU etc with some private flag iirc

The static shapes etc come later in the input preparation

will it not auto-compile, and then error out down the line due to dynamic inputs? From what I understood this device cannot support full compile without complete static shapes

See the line below the condition is set to use valid hardware + cache --> if you don't set static cache (and hence all the static prep), you are out of luck either way

There is no real dynamic thing going on

Ok discussed internally, now understanding it: With this we enable compile for neuron when we set static caches but there are still dynamic traces within the whole generate loop so it potentially doesn't make sense to add yet - we should rather wait for feature completeness before adding this. That's at least what I understood now

For testing purposes, we can enable via the private flags within the compile config

Copilot AI review requested due to automatic review settings March 16, 2026 14:54

dacorvo mentioned this pull request Mar 16, 2026

[Neuron] Static-shape generation loop for compilation-friendly inference #44742

Open

Copilot started reviewing on behalf of dacorvo March 16, 2026 14:57 View session

Copilot AI reviewed Mar 16, 2026

View reviewed changes

dacorvo requested review from vasqu and zucchini-nlp April 14, 2026 12:08

Merge branch 'main' into neuron_auto_compile

8d6f864

zucchini-nlp reviewed Apr 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Neuron to auto-compile hardware list#44757

Add Neuron to auto-compile hardware list#44757
dacorvo wants to merge 2 commits intomainfrom
neuron_auto_compile

dacorvo commented Mar 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Mar 16, 2026

Uh oh!

zucchini-nlp Apr 14, 2026

Uh oh!

vasqu Apr 14, 2026

Uh oh!

zucchini-nlp Apr 14, 2026

Uh oh!

vasqu Apr 14, 2026 •

edited

Loading

Uh oh!

vasqu Apr 14, 2026

Uh oh!

vasqu Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

dacorvo commented Mar 16, 2026

Summary

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

HuggingFaceDocBuilderDev commented Mar 16, 2026

Uh oh!

zucchini-nlp Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

vasqu Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

vasqu Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vasqu Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

vasqu Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vasqu Apr 14, 2026 •

edited

Loading