Skip to content

Conversation

@DenisVieriu97
Copy link
Collaborator

  • Fix the FP16 clamp issue (FP32 and FP16 are not broadcast compatible)
  • Fix clamp (cached graph nodes were previously replaced with the cast version)

@DenisVieriu97 DenisVieriu97 added the ciflow/mps Run MPS tests (subset of trunk) label Jul 21, 2023
@DenisVieriu97 DenisVieriu97 requested a review from razarmehr July 21, 2023 16:07
@DenisVieriu97 DenisVieriu97 requested a review from kulinseth as a code owner July 21, 2023 16:07
@pytorch-bot
Copy link

pytorch-bot bot commented Jul 21, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105747

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 1eae4c3:

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the release notes: mps Release notes category label Jul 21, 2023
@DenisVieriu97 DenisVieriu97 changed the title Fix MPS clamp issue with FP16 max/min and FP32 input [MPS] Fix MPS clamp issue with FP16 max/min and FP32 input Jul 21, 2023
@DenisVieriu97 DenisVieriu97 changed the title [MPS] Fix MPS clamp issue with FP16 max/min and FP32 input [MPS] Fix MPS clamp issue with different dtypes between input and min/max tensors Jul 21, 2023
@janeyx99 janeyx99 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jul 24, 2023
@DenisVieriu97
Copy link
Collaborator Author

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

kulinseth and others added 5 commits August 1, 2023 23:53
* Fix the FP16 clamp issue.

* Fix clamp (cached graph nodes were previously replaced with the cast version)

---------

Co-authored-by: Denis Vieriu <dvieriu@apple.com>
@pytorchmergebot
Copy link
Collaborator

Successfully rebased clamp_fix onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout clamp_fix && git pull --rebase)

@DenisVieriu97
Copy link
Collaborator Author

@pytorchbot merge -f "MPS tests & lint are green"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@malfet
Copy link
Contributor

malfet commented Aug 16, 2023

@kulinseth , @DenisVieriu97 just curious, have you check if it matches cpu/CUDA behavior? I.e. I assume it should do something like that only if AMP is enable, isn't it?

@kulinseth
Copy link
Collaborator

@kulinseth , @DenisVieriu97 just curious, have you check if it matches cpu/CUDA behavior? I.e. I assume it should do something like that only if AMP is enable, isn't it?

We checked with cpu @malfet

@kulinseth
Copy link
Collaborator

@kulinseth , @DenisVieriu97 just curious, have you check if it matches cpu/CUDA behavior? I.e. I assume it should do something like that only if AMP is enable, isn't it?

We checked with cpu @malfet. Also it’s affecting correctness of few language models

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/mps Run MPS tests (subset of trunk) Merged open source release notes: mps Release notes category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants