MPS: Add amax and amin Ops with tests #79682

kulinseth · 2022-06-16T04:59:23Z

Add amax and amin with tests

Fixes #ISSUE_NUMBER

* Add amax and amin with tests

facebook-github-bot · 2022-06-16T04:59:32Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/79682
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

❌ 3 New Failures

As of commit 7a740ea (more details on the Dr. CI page):

Expand to see more

3/3 failures introduced in this PR

🕵️ 3 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages

pull / linux-xenial-cuda11.3-py3.7-gcc7 / test (default, 1, 4, linux.4xlarge.nvidia.gpu) (1/3)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

2022-06-16T05:54:07.1120232Z RuntimeError: CUDA error: an illegal memory access was encountered

2022-06-16T05:54:07.1116247Z   File "/opt/conda/lib/python3.7/site-packages/torch/testing/_internal/common_utils.py", line 1184, in set_rng_seed
2022-06-16T05:54:07.1116612Z     torch.manual_seed(seed)
2022-06-16T05:54:07.1117072Z   File "/opt/conda/lib/python3.7/site-packages/torch/random.py", line 40, in manual_seed
2022-06-16T05:54:07.1117449Z     torch.cuda.manual_seed_all(seed)
2022-06-16T05:54:07.1117937Z   File "/opt/conda/lib/python3.7/site-packages/torch/cuda/random.py", line 113, in manual_seed_all
2022-06-16T05:54:07.1118291Z     _lazy_call(cb, seed_all=True)
2022-06-16T05:54:07.1118765Z   File "/opt/conda/lib/python3.7/site-packages/torch/cuda/__init__.py", line 156, in _lazy_call
2022-06-16T05:54:07.1119103Z     callable()
2022-06-16T05:54:07.1119508Z   File "/opt/conda/lib/python3.7/site-packages/torch/cuda/random.py", line 111, in cb
2022-06-16T05:54:07.1119876Z     default_generator.manual_seed(seed)
2022-06-16T05:54:07.1120232Z RuntimeError: CUDA error: an illegal memory access was encountered
2022-06-16T05:54:07.1120709Z CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
2022-06-16T05:54:07.1121136Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
2022-06-16T05:54:07.1121354Z 
2022-06-16T05:54:07.1121624Z ----------------------------------------------------------------------
2022-06-16T05:54:07.1122101Z Ran 125 tests in 27.992s
2022-06-16T05:54:07.1122290Z 
2022-06-16T05:54:07.1122444Z FAILED (errors=1, skipped=8, expected failures=3)
2022-06-16T05:54:07.1122646Z 
2022-06-16T05:54:07.1122773Z Generating XML reports...
2022-06-16T05:54:07.1280087Z Generated XML report: test-reports/python-unittest/test_ops/TEST-TestCommonCUDA-20220616055338.xml

pull / linux-xenial-cuda11.3-py3.7-gcc7 / test (default, 3, 4, linux.4xlarge.nvidia.gpu) (2/3)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

2022-06-16T05:52:01.2918366Z RuntimeError: CUDA error: an illegal memory access was encountered

2022-06-16T05:52:01.2913635Z   File "/opt/conda/lib/python3.7/site-packages/torch/testing/_internal/common_utils.py", line 1184, in set_rng_seed
2022-06-16T05:52:01.2914090Z     torch.manual_seed(seed)
2022-06-16T05:52:01.2914597Z   File "/opt/conda/lib/python3.7/site-packages/torch/random.py", line 40, in manual_seed
2022-06-16T05:52:01.2915009Z     torch.cuda.manual_seed_all(seed)
2022-06-16T05:52:01.2915551Z   File "/opt/conda/lib/python3.7/site-packages/torch/cuda/random.py", line 113, in manual_seed_all
2022-06-16T05:52:01.2915981Z     _lazy_call(cb, seed_all=True)
2022-06-16T05:52:01.2916596Z   File "/opt/conda/lib/python3.7/site-packages/torch/cuda/__init__.py", line 156, in _lazy_call
2022-06-16T05:52:01.2917010Z     callable()
2022-06-16T05:52:01.2917491Z   File "/opt/conda/lib/python3.7/site-packages/torch/cuda/random.py", line 111, in cb
2022-06-16T05:52:01.2917968Z     default_generator.manual_seed(seed)
2022-06-16T05:52:01.2918366Z RuntimeError: CUDA error: an illegal memory access was encountered
2022-06-16T05:52:01.2918900Z CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
2022-06-16T05:52:01.2919418Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
2022-06-16T05:52:01.2919666Z 
2022-06-16T05:52:01.2919949Z ----------------------------------------------------------------------
2022-06-16T05:52:01.2920343Z Ran 150 tests in 17.629s
2022-06-16T05:52:01.2920535Z 
2022-06-16T05:52:01.2920695Z FAILED (errors=1, expected failures=3)
2022-06-16T05:52:01.2920912Z 
2022-06-16T05:52:01.2921054Z Generating XML reports...
2022-06-16T05:52:01.3097733Z Generated XML report: test-reports/python-unittest/test_meta/TEST-TestMetaCUDA-20220616055143.xml

pull / linux-xenial-cuda11.3-py3.7-gcc7 / test (default, 2, 4, linux.4xlarge.nvidia.gpu) (3/3)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

2022-06-16T05:55:02.2204251Z RuntimeError: CUDA error: an illegal memory access was encountered

2022-06-16T05:55:02.2200174Z   File "/opt/conda/lib/python3.7/site-packages/torch/testing/_internal/common_utils.py", line 1184, in set_rng_seed
2022-06-16T05:55:02.2200572Z     torch.manual_seed(seed)
2022-06-16T05:55:02.2201033Z   File "/opt/conda/lib/python3.7/site-packages/torch/random.py", line 40, in manual_seed
2022-06-16T05:55:02.2201393Z     torch.cuda.manual_seed_all(seed)
2022-06-16T05:55:02.2201892Z   File "/opt/conda/lib/python3.7/site-packages/torch/cuda/random.py", line 113, in manual_seed_all
2022-06-16T05:55:02.2202272Z     _lazy_call(cb, seed_all=True)
2022-06-16T05:55:02.2202730Z   File "/opt/conda/lib/python3.7/site-packages/torch/cuda/__init__.py", line 156, in _lazy_call
2022-06-16T05:55:02.2203082Z     callable()
2022-06-16T05:55:02.2203522Z   File "/opt/conda/lib/python3.7/site-packages/torch/cuda/random.py", line 111, in cb
2022-06-16T05:55:02.2203880Z     default_generator.manual_seed(seed)
2022-06-16T05:55:02.2204251Z RuntimeError: CUDA error: an illegal memory access was encountered
2022-06-16T05:55:02.2204733Z CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
2022-06-16T05:55:02.2205190Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
2022-06-16T05:55:02.2205394Z 
2022-06-16T05:55:02.2205669Z ----------------------------------------------------------------------
2022-06-16T05:55:02.2206013Z Ran 150 tests in 35.871s
2022-06-16T05:55:02.2206187Z 
2022-06-16T05:55:02.2206335Z FAILED (errors=1, expected failures=3)
2022-06-16T05:55:02.2206529Z 
2022-06-16T05:55:02.2206636Z Generating XML reports...
2022-06-16T05:55:02.2394693Z Generated XML report: test-reports/python-unittest/test_decomp/TEST-TestDecompCUDA-20220616055426.xml

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

kulinseth · 2022-06-16T14:13:26Z

======================================================================
[912](https://github.com/pytorch/pytorch/runs/6912900173?check_suite_focus=true#step:9:913)
ERROR [0.007s]: test_dtypes__masked_amin_cuda (__main__.TestCommonCUDA)
[913](https://github.com/pytorch/pytorch/runs/6912900173?check_suite_focus=true#step:9:914)
----------------------------------------------------------------------
[914](https://github.com/pytorch/pytorch/runs/6912900173?check_suite_focus=true#step:9:915)
Traceback (most recent call last):

The failure seems unrelated to this change.

albanD

Sounds good.
But we should move to generic testing to avoid these one-off version that don't cover most usecases

kulinseth · 2022-06-18T00:11:05Z

@pytorchbot merge

kulinseth · 2022-06-18T00:11:41Z

Sounds good.
But we should move to generic testing to avoid these one-off version that don't cover most usecases

Makes sense.

pytorchmergebot · 2022-06-18T00:14:00Z

@pytorchbot successfully started a merge job. Check the current status here

github-actions · 2022-06-18T00:14:43Z

Hey @kulinseth.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Summary: * Add amax and amin with tests Fixes #ISSUE_NUMBER Pull Request resolved: #79682 Approved by: https://github.com/albanD Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/074dc7465e4626eb10dffe8fd44312c4637b9e99 Reviewed By: malfet Differential Revision: D37278931 fbshipit-source-id: ad62ef18b8cda07dd7563a12f1e05c354d3465a7

* Add amax and amin with tests Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#79682 Approved by: https://github.com/albanD

* MPS: Fixes (#78930) Cast integer to float in UnaryOps Add tensor dtype in key generation Enable FP16 scalars and use placeholder for alpha tensor in add/sum ops Fixes #ISSUE_NUMBER Pull Request resolved: #78930 Approved by: https://github.com/albanD * MPS: Binary cast fix by proper type promotion and remove spurious copy warning (#79185) Fixes #78019, #78020 Fixes #79185 Pull Request resolved: #79185 Approved by: https://github.com/albanD, https://github.com/razarmehr * MPS: add exponential op (#79188) Add exponential distribution Fixes #ISSUE_NUMBER Pull Request resolved: #79188 Approved by: https://github.com/razarmehr, https://github.com/albanD * [MPS] Delete unused vars from OperationUtils.mm Pull Request resolved: #79514 Approved by: https://github.com/kulinseth, https://github.com/albanD * [MPS] Fix getDefaultGenerator and copy_kernel_mps Returning reference to stack memory is really bad Pull Request resolved: #79515 Approved by: https://github.com/albanD * [MPS][BE]Do not use `new/delete[]` in `chainViewOperation` `std::array` will do just fine Pull Request resolved: #79516 Approved by: https://github.com/albanD * [MPS] Support stride of stride Fixes #79181 Pull Request resolved: #79521 Approved by: https://github.com/kulinseth * MPS: TopK raise an error if K>16 (#79677) * Error out in TopK when k>16. * Add a test case too. Fixes #78915 Pull Request resolved: #79677 Approved by: https://github.com/albanD * [MPS]: Add fix for squeezed input axes handling in BCE loss (#79676) Fixes #79527 Pull Request resolved: #79676 Approved by: https://github.com/razarmehr, https://github.com/albanD * MPS: Add amax and amin Ops with tests (#79682) * Add amax and amin with tests Fixes #ISSUE_NUMBER Pull Request resolved: #79682 Approved by: https://github.com/albanD * [MPS] Fix torch.uint8 support (#80049) `ScalarType.Byte` should be cast to `MPSDataTypeUInt8` And support for `torch.int8` as well as test those conversions in `TestMPS.test_to` Fixes #80006 Pull Request resolved: #80049 Approved by: https://github.com/albanD * [MPS] Fix binary ops between int32 tensor with int64 scalar (#80220) For some reason, tensor *op* scalar does not follow the normal binary promotion rules So cast output tensor to expected type if needed It seems that one should have casted input tensors to expected output tensor type, but it does not really work for boolean binary ops, so... Add output tensor type/shape to cached graph key Extend `TestMPS. test_add_scalars` to test for this regression Fixes #79835 Pull Request resolved: #80220 Approved by: https://github.com/albanD * [MPS] Add equal operator (#80195) Which is, in essence is composite of `eq`->`all`->`item` `native/mps/operators/Equal.cpp` is an almost verbatim copy of `native/cuda/Equal.cpp` Fix codegen by generating MPSFunctions headers Pull Request resolved: #80195 Approved by: https://github.com/albanD * [MPS] add `aten::normal.Tensor_float` `aten::normal.float_Tensor` `aten::normal.Tensor_Tensor` (#80297) Fixes #ISSUE_NUMBER Pull Request resolved: #80297 Approved by: https://github.com/albanD, https://github.com/kulinseth * [MPS] Add flip (#80214) Fixes #ISSUE_NUMBER Pull Request resolved: #80214 Approved by: https://github.com/DenisVieriu97, https://github.com/albanD * [MPS] Add logical ops (#80216) This PR adds `logical_not`, `logical_and`, `logical_or`, `logical_xor`. Pull Request resolved: #80216 Approved by: https://github.com/albanD, https://github.com/kulinseth * [MPS] Add glu (#79866) Adds mps op for `aten::glu.out`. Pull Request resolved: #79866 Approved by: https://github.com/kulinseth, https://github.com/albanD * [MPS] Fix std/var cache issue (#80502) Use `getTensorsStringKey` which has tensor shape info added as part of the key to prevent cache lookup issue when the shape of input tensor is changed. Fixes #80499 Pull Request resolved: #80502 Approved by: https://github.com/malfet, https://github.com/kulinseth * Add scatter support for view operations (#79939) * Add scatter support for view operations; #78074, #78886, #79672 * Update test_slicing_replace_column to properly test different sizes * Handle in-place changes for binary ops; add new testcase * Add new view ops testing scatter; add MPSDebugConfig.h config file for debugging purposes * Merge gatherViewTensor and scatterViewTensor into a generic function * Add scatter on demand in scatterViewOperation instead of caching it into a generic graph * Create separate graphs for scatter and gather; * Create scatter graph at scatter time Fixes #ISSUE_NUMBER Pull Request resolved: #79939 Approved by: https://github.com/razarmehr * MPS: Fix handling of 1D tensors in linear backward (#80759) Fixes ##79784 Pull Request resolved: #80759 Approved by: https://github.com/ezyang * [MPS] Move the View ops to a separate file and reduce the number of graphs created (#80491) This is dependent on the PR to go in first: #79939 Remove the data_ptr from the View Graph key which reduces the number of graphs created significantly. Don't wait when copying from MPS to MPS tensors Pull Request resolved: #80491 Approved by: https://github.com/malfet * [MPS] Add softplus backward (#79873) Fixes #ISSUE_NUMBER Pull Request resolved: #79873 Approved by: https://github.com/malfet * [MPS] Add argmin (#80828) This PR 1. adds argmin 2. refactors `reduction_type` in `ReduceOps.mm` with enum. Co-authored by Kulin Seth <kulinseth@gmail.com> Pull Request resolved: #80828 Approved by: https://github.com/malfet * [MPS] Fix LSTM batch_first output transposed (#80597) The output of LSTM with `batch_first` should be transposed back to batch first format. Fixes #80306 Pull Request resolved: #80597 Approved by: https://github.com/kulinseth * [MPS][BE] Introduce MPSUnaryCachedGraph (#81033) I.e. CachedGraph that has input and output tensors Also, add `MPSGraphCache::LookUpAs` template, which combines LookUp with static_cast to target type Pull Request resolved: #81033 Approved by: https://github.com/kulinseth * [MPS] Add test consistency from OpInfo based tests from PR 78504 (#79532) Pull Request resolved: #79532 Approved by: https://github.com/albanD, https://github.com/malfet * [MPS] Add huber loss (#80163) Fixes #ISSUE_NUMBER Pull Request resolved: #80163 Approved by: https://github.com/kulinseth, https://github.com/malfet * Remove two tests dependent on the MPS serialization checkin. * Fix lint error (FLAKE8) F401 * Remove the serialization test from test_mps as its support is not there in 1.12.1. Co-authored-by: Kulin Seth <kulinseth@gmail.com> Co-authored-by: Nikita Shulga <nikita.shulga@gmail.com> Co-authored-by: Kulin Seth <kulin_seth@apple.com> Co-authored-by: Abhishek Pathak <abhipathak97@gmail.com> Co-authored-by: Nikita Shulga <nshulga@fb.com> Co-authored-by: qqaatw <qqaatw@gmail.com> Co-authored-by: Ramin Azarmehr <razarmehr@apple.com>

Add amax and amin with tests (#33)

7a740ea

* Add amax and amin with tests

kulinseth requested a review from albanD June 16, 2022 04:59

kulinseth requested a review from bdhirsh as a code owner June 16, 2022 04:59

facebook-github-bot added the cla signed label Jun 16, 2022

kulinseth added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 16, 2022

pytorchbot added the open source label Jun 16, 2022

albanD approved these changes Jun 17, 2022

View reviewed changes

pytorchmergebot added the Merged label Jun 18, 2022

pytorchmergebot closed this in 074dc74 Jun 18, 2022

miladm pushed a commit to miladm/pytorch that referenced this pull request Jun 27, 2022

MPS: Add amax and amin Ops with tests (pytorch#79682)

52c9034

* Add amax and amin with tests Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#79682 Approved by: https://github.com/albanD

kulinseth mentioned this pull request Jul 7, 2022

General MPS op coverage tracking issue #77764

Open

kulinseth pushed a commit to kulinseth/pytorch that referenced this pull request Jul 9, 2022

MPS: Add amax and amin Ops with tests (pytorch#79682)

04aacfb

* Add amax and amin with tests Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#79682 Approved by: https://github.com/albanD

atalman pushed a commit to atalman/pytorch that referenced this pull request Jul 22, 2022

MPS: Add amax and amin Ops with tests (pytorch#79682)

5cbf6fe

* Add amax and amin with tests Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#79682 Approved by: https://github.com/albanD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPS: Add amax and amin Ops with tests #79682

MPS: Add amax and amin Ops with tests #79682

kulinseth commented Jun 16, 2022

facebook-github-bot commented Jun 16, 2022 •

edited

Loading

🕵️ 3 new failures recognized by patterns

pull / linux-xenial-cuda11.3-py3.7-gcc7 / test (default, 1, 4, linux.4xlarge.nvidia.gpu) (1/3)

pull / linux-xenial-cuda11.3-py3.7-gcc7 / test (default, 3, 4, linux.4xlarge.nvidia.gpu) (2/3)

pull / linux-xenial-cuda11.3-py3.7-gcc7 / test (default, 2, 4, linux.4xlarge.nvidia.gpu) (3/3)

kulinseth commented Jun 16, 2022

albanD left a comment

kulinseth commented Jun 18, 2022

kulinseth commented Jun 18, 2022

pytorchmergebot commented Jun 18, 2022

github-actions bot commented Jun 18, 2022

MPS: Add amax and amin Ops with tests #79682

MPS: Add amax and amin Ops with tests #79682

Conversation

kulinseth commented Jun 16, 2022

facebook-github-bot commented Jun 16, 2022 • edited Loading

🔗 Helpful links

❌ 3 New Failures

🕵️ 3 new failures recognized by patterns

pull / linux-xenial-cuda11.3-py3.7-gcc7 / test (default, 1, 4, linux.4xlarge.nvidia.gpu) (1/3)

pull / linux-xenial-cuda11.3-py3.7-gcc7 / test (default, 3, 4, linux.4xlarge.nvidia.gpu) (2/3)

pull / linux-xenial-cuda11.3-py3.7-gcc7 / test (default, 2, 4, linux.4xlarge.nvidia.gpu) (3/3)

kulinseth commented Jun 16, 2022

albanD left a comment

Choose a reason for hiding this comment

kulinseth commented Jun 18, 2022

kulinseth commented Jun 18, 2022

pytorchmergebot commented Jun 18, 2022

github-actions bot commented Jun 18, 2022

facebook-github-bot commented Jun 16, 2022 •

edited

Loading