Processing Utils: honor pre-built sub-processor kwargs in from_pretrained by javierdejesusda · Pull Request #45627 · huggingface/transformers

javierdejesusda · 2026-04-24T09:48:25Z

What does this PR do?

Addresses @ArthurZucker's follow-up on #44987. When a caller passes a pre-built sub-processor to AutoProcessor.from_pretrained — e.g. tokenizer=tok or bpe_tokenizer=tok — the instance is now used directly instead of being silently forwarded into the sub-loader calls.

Exact attribute names always take precedence (bpe_tokenizer=). For processors with a single sub-processor of a given modality, the canonical modality name (tokenizer=) is also accepted as an alias — this matches the reproducer in the issue comment, where the remote UniversalActionProcessor takes bpe_tokenizer but the caller passes tokenizer=.

The OP's zero-kwarg reproducer (AutoProcessor.from_pretrained("physical-intelligence/fast", trust_remote_code=True)) is a separate hub repo layout issue and is not addressed here.

Before submitting

Did you read the contributor guideline?
Did you write any new necessary tests?

Who can review?

cc @ArthurZucker @itazap

…ined When a caller passes a pre-built sub-processor via kwargs to `AutoProcessor.from_pretrained` (e.g. `tokenizer=tok` or `bpe_tokenizer=tok`), use the instance directly instead of silently forwarding it into the sub-loader calls. Exact attribute names take precedence; the canonical modality name is also accepted as an alias when a single sub-processor has that modality.

github-actions · 2026-04-24T09:49:49Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

zucchini-nlp · 2026-04-24T10:13:42Z

+    def _pop_prebuilt_subprocessors(cls, kwargs: dict) -> dict:
+        """Pop pre-built sub-processors from `kwargs` by exact attribute name, or by modality
+        alias (e.g. `tokenizer=` → `bpe_tokenizer`) when that modality is unambiguous.
+        """
+        sub_processors = cls.get_attributes()
+        modality_counts = Counter(_get_modality_for_attribute(s) for s in sub_processors)
+        prebuilt = {}
+        for sub_processor_type in sub_processors:
+            modality = _get_modality_for_attribute(sub_processor_type)
+            instance = kwargs.pop(sub_processor_type, None)
+            if instance is None and modality != sub_processor_type and modality_counts[modality] == 1:
+                instance = kwargs.pop(modality, None)
+            if instance is not None:
+                prebuilt[sub_processor_type] = instance
+        return prebuilt


not really sure about this. The error from GH issue is due to old remote code, and we don't yet support the Pi0-FAST natively in transformers. also cc @yonigozlan ig you might have seen similar issue when refactoring processor loading

We're planning native support though, and waiting for lerobot team to test and convert the configs correctly

Thanks for taking a look, @zucchini-nlp!

Quick scope note: this PR isn't targeting the OP's zero-kwarg traceback (that's the hub layout / old remote code path you mentioned, which I agree is out of scope here and will be obsoleted by native support). It's targeting @ArthurZucker's follow-up comment on the issue:

p = AutoProcessor.from_pretrained("physical-intelligence/fast", tokenizer=tokenizer, trust_remote_code=True, use_fast=False)

"this does not work and it should!"

The underlying behavior is general to ProcessorMixin: when a caller supplies a pre-built sub-processor via kwargs (whether tokenizer= or the exact attribute name like bpe_tokenizer=), the instance is silently dropped and the loader tries to reload from disk anyway. Any processor with a non-primary tokenizer attribute runs into this, so native Pi0-FAST support wouldn't fix it on its own, it'd just mean one fewer processor hitting it.

That said, happy to defer fully. If you and @yonigozlan / @ArthurZucker feel this should wait (or be folded into the native support work, or handled differently), I'm glad to close or rescope, just let me know.

yeah, totally get it. Personally, I think we can deliberately not support it as remote code and not-v5 compatible unless Arthur/Yoni have a different opinion

HuggingFaceDocBuilderDev · 2026-04-28T02:59:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp reviewed Apr 24, 2026

View reviewed changes

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Processing Utils: honor pre-built sub-processor kwargs in from_pretrained#45627

Processing Utils: honor pre-built sub-processor kwargs in from_pretrained#45627
javierdejesusda wants to merge 1 commit intohuggingface:mainfrom
javierdejesusda:fix/autoprocessor-honor-tokenizer-kwarg-44987

javierdejesusda commented Apr 24, 2026

Uh oh!

github-actions Bot commented Apr 24, 2026

Uh oh!

zucchini-nlp Apr 24, 2026 •

edited

Loading

Uh oh!

javierdejesusda Apr 24, 2026

Uh oh!

zucchini-nlp Apr 27, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

javierdejesusda commented Apr 24, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

github-actions Bot commented Apr 24, 2026

Uh oh!

zucchini-nlp Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

javierdejesusda Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zucchini-nlp Apr 24, 2026 •

edited

Loading