You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I noticed a bug in the new chunked prefill code: the implementation does not check whether or not the forward method is compilable, as is done elsewhere in the code.
An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)
Reproduction
Any model that should not be compiled will be compiled when using GenerationConfig that contains prefill_chunk_size > 0, even if using disable_compile=True
Expected behavior
disable_compile=True should cause the model NOT to get compiled, even when using prefill_chunk_size
The text was updated successfully, but these errors were encountered:
System Info
Hi, I noticed a bug in the new chunked prefill code: the implementation does not check whether or not the forward method is compilable, as is done elsewhere in the code.
Specifically,
transformers/src/transformers/generation/utils.py
Line 4910 in d231f5a
should include compilability checking code like found at
transformers/src/transformers/generation/utils.py
Line 3437 in d231f5a
This error was encountered on Version: 4.51.3 but the code links above reference the main branch. Thanks!
Who can help?
@gante
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Any model that should not be compiled will be compiled when using GenerationConfig that contains prefill_chunk_size > 0, even if using disable_compile=True
Expected behavior
disable_compile=True should cause the model NOT to get compiled, even when using prefill_chunk_size
The text was updated successfully, but these errors were encountered: