-
Notifications
You must be signed in to change notification settings - Fork 30.7k
🚨 [generate] update paligemma mask updates (and other assisted generation-related fixes) #40917
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
32 commits
Select commit
Hold shift + click to select a range
0484787
tmp
gante 4990c46
fix modular inheritance
gante 5424b99
nit
gante e2f6550
paligemma 1 doesn't have swa
gante d8d02ff
use same pattern as in models with hybrid layers
gante 93ff456
PR comments
gante 132e35f
helium also needs layer_typed (bc it relies on gemma)
gante e28a3ed
paligemma/gemma3: same mask creation fn in fwd and generate
gante 76613f0
propagate changes to helium (gemma-based)
gante 2c188f9
tmp commit
gante 5beb976
slow paligemma tests passing, let's see what breaks
gante 6f2d326
fix test_left_padding_compatibility
gante bef1beb
tmp commit
gante 8b0f34d
tmp commit
gante 90f165a
rebase error
gante 8979531
docs
gante aac2956
reduce diff
gante 15d99a2
like this?
gante 46362b4
t5gemma
gante a9d71e1
better comment
gante 8686d39
shorter diff
gante a3ac80c
exception
gante 40eed3d
ffs type
gante c725120
optional
gante a74fb93
shorter modular_gemma.py
gante b79e312
helium model actually needs no changes -- the tester is the issue
gante f916d7c
t5gemma modular config
gante 5e518c8
a few more modular; paligemma BC
gante b0a9d50
fix processor issues?
gante c0c89b2
rm config exception
gante 3fcf7a7
lift warning in gemma
gante 9bf860d
Merge branch 'main' into flaky_assisted_gen_tests
gante File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
wow, this makes so much sense. I wonder how gemma3 worked prev, afair we didn't have a flag for defining bidirectional attention at release time
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I actually took it from gemma3 🤗 Most of the changes here are gemma3-inspired
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks like it was added recently. Prev it used
is_causal = True
🙈