Reduction cast f16 to f32 only on macOS 12 #332

Ronian526 · 2023-02-15T07:28:33Z

unblock rdiv float16

- unblock rdiv float16

razarmehr

Looks good.

- unblock rdiv float16

Remove torch._six from test_mps (#326) Fix test_zero_grad() (#330) Fix bilinear backward pass (#331) * Fix bilinear backward pass * Remove comment Update macOS 12 blocklist (#323) * Update macOS 12 blocklist - move sum, masked.var, mul to low precision list - unblock them from running * - mark __rdiv__ failures as accumulate error exceeds atol/rtol Fix nn.functional.embedding grad (#335) - casting the input tensor to float32 and cast back the output tensor - unblock the test Fix prelu backward (#334) Reduction cast f16 to f32 only on macOS 12 (#332) - unblock rdiv float16 Fix trace op (#340) - give warnings of converting int64 for reduction ops - use cast tensor for reduction sum on trace - unblock trace from running Update random result list (#339) * - move nn.functional.feature_alpha_dropoutwith_train, normalnumber_mean, new_empty_strided to expected failures * - update new_empty_strided --------- Co-authored-by: Kulin Seth <kulin_seth@apple.com> Enable int8 in TestConsistency (#347) Dev/skotapati/copy broadcasting (#350) * Handle broadcasting by expanding src tensor in Copy.mm * Unblock linalg_matrix_power * Improved formatting Add the functionality to dump MPS ops. 1. DUMP_MPS_OPS to use LoggingTensor to dump out the ATen ops. 2. Skip running the EXPECTTEST list, as some tests are still seg-faulting Fix lintrunner errors (#353) * Fix lintrunner errors * - move normal_in_place to random result list Fixed the test_mps. Test mps is updated.

Remove torch._six from test_mps (#326) Fix test_zero_grad() (#330) Fix bilinear backward pass (#331) * Fix bilinear backward pass * Remove comment Update macOS 12 blocklist (#323) * Update macOS 12 blocklist - move sum, masked.var, mul to low precision list - unblock them from running * - mark __rdiv__ failures as accumulate error exceeds atol/rtol Fix nn.functional.embedding grad (#335) - casting the input tensor to float32 and cast back the output tensor - unblock the test Fix prelu backward (#334) Reduction cast f16 to f32 only on macOS 12 (#332) - unblock rdiv float16 Fix trace op (#340) - give warnings of converting int64 for reduction ops - use cast tensor for reduction sum on trace - unblock trace from running Update random result list (#339) * - move nn.functional.feature_alpha_dropoutwith_train, normalnumber_mean, new_empty_strided to expected failures * - update new_empty_strided --------- Co-authored-by: Kulin Seth <kulin_seth@apple.com> Enable int8 in TestConsistency (#347) Dev/skotapati/copy broadcasting (#350) * Handle broadcasting by expanding src tensor in Copy.mm * Unblock linalg_matrix_power * Improved formatting Add the functionality to dump MPS ops. 1. DUMP_MPS_OPS to use LoggingTensor to dump out the ATen ops. 2. Skip running the EXPECTTEST list, as some tests are still seg-faulting Fix lintrunner errors (#353) * Fix lintrunner errors * - move normal_in_place to random result list Fixed the test_mps.

Remove torch._six from test_mps (#326) Fix test_zero_grad() (#330) Fix bilinear backward pass (#331) * Fix bilinear backward pass * Remove comment Update macOS 12 blocklist (#323) * Update macOS 12 blocklist - move sum, masked.var, mul to low precision list - unblock them from running * - mark __rdiv__ failures as accumulate error exceeds atol/rtol Fix nn.functional.embedding grad (#335) - casting the input tensor to float32 and cast back the output tensor - unblock the test Fix prelu backward (#334) Reduction cast f16 to f32 only on macOS 12 (#332) - unblock rdiv float16 Fix trace op (#340) - give warnings of converting int64 for reduction ops - use cast tensor for reduction sum on trace - unblock trace from running Update random result list (#339) * - move nn.functional.feature_alpha_dropoutwith_train, normalnumber_mean, new_empty_strided to expected failures * - update new_empty_strided --------- Co-authored-by: Kulin Seth <kulin_seth@apple.com> Enable int8 in TestConsistency (#347) Dev/skotapati/copy broadcasting (#350) * Handle broadcasting by expanding src tensor in Copy.mm * Unblock linalg_matrix_power * Improved formatting Add the functionality to dump MPS ops. 1. DUMP_MPS_OPS to use LoggingTensor to dump out the ATen ops. 2. Skip running the EXPECTTEST list, as some tests are still seg-faulting Fix lintrunner errors (#353) * Fix lintrunner errors * - move normal_in_place to random result list Fixed the test_mps. Test mps is updated.

Reduction cast f16 to f32 only on macOS 12

bf194c8

- unblock rdiv float16

Ronian526 requested review from skotapati, razarmehr, ssaladis and DenisVieriu97 February 15, 2023 07:28

Ronian526 requested a review from kulinseth as a code owner February 15, 2023 07:28

razarmehr approved these changes Feb 15, 2023

View reviewed changes

DenisVieriu97 approved these changes Feb 15, 2023

View reviewed changes

razarmehr merged commit c65b823 into master Feb 15, 2023

razarmehr deleted the ronian/reduction_cast branch February 15, 2023 23:56

razarmehr added a commit that referenced this pull request Feb 16, 2023

Reduction cast f16 to f32 only on macOS 12 (#332)

c4a44ba

- unblock rdiv float16

razarmehr added the Upstreamed Change has been upstreamed to PyTorch master label Feb 16, 2023

kulinseth pushed a commit that referenced this pull request Feb 22, 2023

Reduction cast f16 to f32 only on macOS 12 (#332)

4b503a2

- unblock rdiv float16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduction cast f16 to f32 only on macOS 12 #332

Reduction cast f16 to f32 only on macOS 12 #332

Ronian526 commented Feb 15, 2023

razarmehr left a comment

Reduction cast f16 to f32 only on macOS 12 #332

Reduction cast f16 to f32 only on macOS 12 #332

Conversation

Ronian526 commented Feb 15, 2023

razarmehr left a comment

Choose a reason for hiding this comment