Skip to content

Conversation

cyyever
Copy link
Contributor

@cyyever cyyever commented Sep 17, 2025

What does this PR do?

The dropout argument is not used in eager_paged_attention_forward.

@cyyever cyyever force-pushed the dropout_atten branch 2 times, most recently from c940b1b to e170575 Compare September 17, 2025 02:23
@Rocketknight1
Copy link
Member

cc @Cyrilvallez for attention change

@Cyrilvallez
Copy link
Member

Hey! This change does not make sense, paged attention is inference-only (tightly coupled to the Cache)! We never use dropout during inference!
cc @ArthurZucker or @remi-or in case I'm mistaken, but I don't think we ever want to use it for training!

@cyyever cyyever marked this pull request as draft September 17, 2025 16:31
@cyyever cyyever marked this pull request as ready for review September 17, 2025 16:42
@cyyever
Copy link
Contributor Author

cyyever commented Sep 17, 2025

@Cyrilvallez That is quite reasonable, so we should ignore dropout.

Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>
@cyyever cyyever changed the title Use dropout in eager_paged_attention_forward Don't list dropout in eager_paged_attention_forward Sep 17, 2025
Copy link
Collaborator

@ArthurZucker ArthurZucker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sure!

@ArthurZucker ArthurZucker enabled auto-merge (squash) September 18, 2025 08:57
@ArthurZucker ArthurZucker merged commit 438343d into huggingface:main Sep 18, 2025
23 checks passed
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@cyyever cyyever deleted the dropout_atten branch September 18, 2025 10:22
ErfanBaghaei pushed a commit to ErfanBaghaei/transformers that referenced this pull request Sep 25, 2025
Remove dropout argument

Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>
vijayabhaskar-ev pushed a commit to vijayabhaskar-ev/transformers that referenced this pull request Oct 2, 2025
Remove dropout argument

Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>
yuchenxie4645 pushed a commit to yuchenxie4645/transformers that referenced this pull request Oct 4, 2025
Remove dropout argument

Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants