fix: prevent input side-effects in processor text args #36866

nph4rd · 2025-03-20T17:57:34Z

What does this PR do?

Fixes input side-effects in multiple processor classes when the text input is a list.

Fixes #36865

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker @qubvel

github-actions · 2025-03-20T17:57:45Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. When it is ready for review, please click the Ready for review button (at the bottom of the PR page).

zucchini-nlp · 2025-03-20T19:44:22Z

For me it seems the problem is in how qwen2-vl is written. Other VLMs do not override the text and use save expanded prompts in a new list. So rewriting Qwen2-VL is more preferable than adding a copy() in all processors

nph4rd · 2025-03-20T20:18:32Z

@zucchini-nlp - thanks for the comment! 😃

Just to be clear: I added the copy() in all the processors that have a similar pattern like:

for sample in text:
    sample = sample.replace(...)

This isn't just Qwen-based processors. One example: Pixtral.

So, if I understand correctly, you're saying it'd be preferrable to change all these processors to have something like:

expanded_samples = []
for sample in text:
    expanded_sample =  sample.replace(...)
    expanded_samples.append(expanded_sample)

Correct?

I must say: I don't find that cleaner 🤔 and if it's for the sake of consistency then I'd suggest adding the .copy() to the rest of the VLM processors. That would imply less changed lines and better readability IMO -- e.g. no variable name change. However, if what you're saying is the consensus then I'd be happy to make the changes.

nph4rd · 2025-03-20T20:20:38Z

That would imply less changed lines

ha! 🤔 don't think that's true. My bad.

Still, it seems a bit less readable to me.

Again, happy to go either way if that's your suggestion.

zucchini-nlp · 2025-03-25T09:20:30Z

@nph4rd not a big difference for me, I am good whichever way as long as it doesn't do in-place modifications. I was surprised to see so many processor's changed, while the error is applicable only to Qwen VL

Let's also see what @molbap thinks

pbarker · 2025-04-09T16:36:51Z

@molbap any thoughts here? This is a pretty tricky bug when encountered downstream that is hurting the larger ecosystem

qubvel

Thanks for fixing! Agree, we should avoid modifying original inputs.

P.S. Let's merge main and resolve conflicts, to be able to merge the PR

zucchini-nlp · 2025-04-21T11:50:14Z

Sorry, qwen has been fixed already as part of a different PR. We might not need this fix anymore, other VLMs don't do in-place modification

nph4rd · 2025-05-14T05:01:35Z

@qubvel - apologies for the delay. I've just merged main.

molbap · 2025-05-14T09:12:37Z

Missed out on this - I think we should not mutate inputs indeed.

zucchini-nlp · 2025-05-14T09:47:15Z

I think it was already fixed as part of another PR already for qwen models, and other models don't mutate input

nph4rd · 2025-05-15T01:08:01Z

I think it was already fixed as part of another PR already for qwen models, and other models don't mutate input

@zucchini-nlp - could you link the PR? I can't find it

I'm not sure about the other models. I don't see why they wouldn't. I tried to include all of the processors that had in-place mutations.

zucchini-nlp · 2025-05-15T08:49:47Z

@nph4rd it was #37342 because some tests would fail otherwise (I think that test was modified right before merging though). And since the test passed for other models, I assume no other processor has in-place modifications. We usually save expanded inputs in a new list, and qwen was an more of an exception 😄

If you see other models doing in-place modification, feel free to rebase and update the PR. Will be very much welcome. And adding a test would be nice so we catch these processors before merging

nph4rd · 2025-05-15T14:43:23Z

Ah, I see, @zucchini-nlp. Looks good! Thanks for the heads up.

In that case, @molbap, I'll close this PR and the corresponding issue 👍

nph4rd added 2 commits March 20, 2025 11:03

prevent input side-effects in processor text args

0b64ab3

Merge branch 'main' into fix-processor-mutability

8dbd55b

github-actions bot marked this pull request as draft March 20, 2025 17:57

This was referenced Mar 20, 2025

Multiple processor classes have input side-effects #36865

Closed

[GRPO] add vlm training capabilities to the trainer huggingface/trl#3072

Open

Merge branch 'main' into fix-processor-mutability

de6baf4

nph4rd marked this pull request as ready for review March 20, 2025 18:04

github-actions bot requested review from molbap and yonigozlan March 20, 2025 18:04

fix modular file

2d26ef9

qubvel approved these changes Apr 21, 2025

View reviewed changes

Merge branch 'main' into fix-processor-mutability

d98ac17

molbap requested a review from ArthurZucker May 14, 2025 09:12

Merge branch 'main' into fix-processor-mutability

f80257b

nph4rd closed this May 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: prevent input side-effects in processor text args #36866

fix: prevent input side-effects in processor text args #36866

Uh oh!

nph4rd commented Mar 20, 2025

Uh oh!

github-actions bot commented Mar 20, 2025

Uh oh!

zucchini-nlp commented Mar 20, 2025

Uh oh!

nph4rd commented Mar 20, 2025

Uh oh!

nph4rd commented Mar 20, 2025

Uh oh!

zucchini-nlp commented Mar 25, 2025

Uh oh!

pbarker commented Apr 9, 2025

Uh oh!

qubvel left a comment •

edited

Loading

Uh oh!

zucchini-nlp commented Apr 21, 2025

Uh oh!

nph4rd commented May 14, 2025

Uh oh!

molbap commented May 14, 2025

Uh oh!

zucchini-nlp commented May 14, 2025

Uh oh!

nph4rd commented May 15, 2025

Uh oh!

zucchini-nlp commented May 15, 2025 •

edited

Loading

Uh oh!

nph4rd commented May 15, 2025

Uh oh!

Uh oh!

fix: prevent input side-effects in processor text args #36866

fix: prevent input side-effects in processor text args #36866

Uh oh!

Conversation

nph4rd commented Mar 20, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

github-actions bot commented Mar 20, 2025

Uh oh!

zucchini-nlp commented Mar 20, 2025

Uh oh!

nph4rd commented Mar 20, 2025

Uh oh!

nph4rd commented Mar 20, 2025

Uh oh!

zucchini-nlp commented Mar 25, 2025

Uh oh!

pbarker commented Apr 9, 2025

Uh oh!

qubvel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp commented Apr 21, 2025

Uh oh!

nph4rd commented May 14, 2025

Uh oh!

molbap commented May 14, 2025

Uh oh!

zucchini-nlp commented May 14, 2025

Uh oh!

nph4rd commented May 15, 2025

Uh oh!

zucchini-nlp commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nph4rd commented May 15, 2025

Uh oh!

Uh oh!

qubvel left a comment •

edited

Loading

zucchini-nlp commented May 15, 2025 •

edited

Loading