Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS device #29439

currybab · 2024-03-04T13:18:18Z

What does this PR do?

The issue on MPS devices was caused by the merge of #29285 in version 4.38.2.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

… MPS devices

ArthurZucker

That is indeed a problem. Was not aware that autocast is not available for mps.
We probably need to do a patch for this!
I think we can use cpu device even if the tensors are not on CPU no?

src/transformers/models/gemma/modeling_gemma.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ArthurZucker

LGTM thank you for the prompt fix

ArthurZucker · 2024-03-06T23:57:37Z

FYI @fxmarty and @gante !

ArthurZucker · 2024-03-06T23:58:15Z

I have not tested this with compile but the dtype should be alright to check / we can always check self.dtype to not be input dependant

… MPS device (huggingface#29439) * Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS devices * Update src/transformers/models/gemma/modeling_gemma.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update llama ang gemma rope use cpu in mps device --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for…

5a6a838

… MPS devices

ArthurZucker reviewed Mar 6, 2024

View reviewed changes

src/transformers/models/gemma/modeling_gemma.py Outdated Show resolved Hide resolved

currybab and others added 2 commits March 6, 2024 12:39

Update src/transformers/models/gemma/modeling_gemma.py

3092a65

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

Update llama ang gemma rope use cpu in mps device

43484d9

currybab requested a review from ArthurZucker March 6, 2024 13:56

ArthurZucker approved these changes Mar 6, 2024

View reviewed changes

ArthurZucker merged commit d45f47a into huggingface:main Mar 6, 2024
18 checks passed

amyeroberts mentioned this pull request Mar 14, 2024

RuntimeError: User specified an unsupported autocast device_type 'mps' #29431

Closed

4 tasks

KJY-dev mentioned this pull request Mar 21, 2024

Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for meta device #29770

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS device #29439

Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS device #29439

currybab commented Mar 4, 2024 •

edited

ArthurZucker left a comment •

edited

ArthurZucker left a comment

ArthurZucker commented Mar 6, 2024

ArthurZucker commented Mar 6, 2024

Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS device #29439

Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS device #29439

Conversation

currybab commented Mar 4, 2024 • edited

What does this PR do?

Before submitting

Who can review?

ArthurZucker left a comment • edited

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker commented Mar 6, 2024

ArthurZucker commented Mar 6, 2024

currybab commented Mar 4, 2024 •

edited

ArthurZucker left a comment •

edited