Workaround arm64 gcc error in `std::copysign` #51900

malfet · 2021-02-08T20:46:52Z

Move definition of copysign template and specialization for
bfloat16/half types before first use of copysign in that file

Add comment explaining why this is necessary

Fixes #51889

Move definition of copysign template and specialization for bfloat16/half types before first use of copysign in that file Add comment explaining why this is necessary Fixes pytorch#51889

facebook-github-bot · 2021-02-08T20:57:43Z

💊 CI failures summary and remediations

As of commit 2c4c41e (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

facebook-github-bot

@malfet has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-02-09T13:13:25Z

@malfet merged this pull request in 5603463.

Summary: Move definition of copysign template and specialization for bfloat16/half types before first use of copysign in that file Add comment explaining why this is necessary Fixes pytorch#51889 Pull Request resolved: pytorch#51900 Reviewed By: walterddr Differential Revision: D26321741 Pulled By: malfet fbshipit-source-id: 888858b11d9708fa140fe9c0570cc5a24599205b

Summary: Move definition of copysign template and specialization for bfloat16/half types before first use of copysign in that file Add comment explaining why this is necessary Fixes #51889 Pull Request resolved: #51900 Reviewed By: walterddr Differential Revision: D26321741 Pulled By: malfet fbshipit-source-id: 888858b11d9708fa140fe9c0570cc5a24599205b

… / 8 for CUDA (#51834) Summary: It seems that the std::copysign code introduced in #51706 is too much for gcc 7.5 / 8 when compiled on arm64 (e.g. on Jetson with latest Jetpack) and causes it to produce an internal compiler error with segfault during compilation. This avoids the compiler bug it by not using std::copysign. A very kind person sent a Jetson Xavier NX {emoji:1f381} thank you {emoji:2764}. After #51900 fixed this for CPU-only arm64 (eg Raspberry), this fixes it for CUDA-using arm64 (e.g. Jetson). CUDA device lambdas must also be present as host functions for technical reasons but they are never used, so we just assert in the CPU variant instead of actually doing the operation. Pull Request resolved: #51834 Reviewed By: mrshenli Differential Revision: D27622277 Pulled By: malfet fbshipit-source-id: a1dc4c3a67f925019782e24b796919e17339749f

… / 8 for CUDA (pytorch#51834) Summary: It seems that the std::copysign code introduced in pytorch#51706 is too much for gcc 7.5 / 8 when compiled on arm64 (e.g. on Jetson with latest Jetpack) and causes it to produce an internal compiler error with segfault during compilation. This avoids the compiler bug it by not using std::copysign. A very kind person sent a Jetson Xavier NX {emoji:1f381} thank you {emoji:2764}. After pytorch#51900 fixed this for CPU-only arm64 (eg Raspberry), this fixes it for CUDA-using arm64 (e.g. Jetson). CUDA device lambdas must also be present as host functions for technical reasons but they are never used, so we just assert in the CPU variant instead of actually doing the operation. Pull Request resolved: pytorch#51834 Reviewed By: mrshenli Differential Revision: D27622277 Pulled By: malfet fbshipit-source-id: a1dc4c3a67f925019782e24b796919e17339749f

Workaround arm64 gcc error in std::copysign

2c4c41e

Move definition of copysign template and specialization for bfloat16/half types before first use of copysign in that file Add comment explaining why this is necessary Fixes pytorch#51889

malfet requested review from t-vi, mruberry and a team February 8, 2021 20:46

malfet mentioned this pull request Feb 8, 2021

avoid CPU std::copysign segfault when compiling on arm64 with gcc 7.5 / 8 for CUDA #51834

Closed

facebook-github-bot added the cla signed label Feb 8, 2021

facebook-github-bot reviewed Feb 8, 2021

View reviewed changes

walterddr approved these changes Feb 8, 2021

View reviewed changes

facebook-github-bot closed this in 5603463 Feb 9, 2021

facebook-github-bot added the Merged label Feb 9, 2021

malfet deleted the malfet/fix-copysign-ice branch February 9, 2021 17:35

This was referenced Feb 10, 2021

[1.8] Workaround arm64 gcc error in std::copysign (#51900) #52049

Merged

[v.1.8.0] Release Tracker #51886

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workaround arm64 gcc error in `std::copysign` #51900

Workaround arm64 gcc error in `std::copysign` #51900

malfet commented Feb 8, 2021

facebook-github-bot commented Feb 8, 2021 •

edited

facebook-github-bot left a comment

facebook-github-bot commented Feb 9, 2021

Workaround arm64 gcc error in std::copysign #51900

Workaround arm64 gcc error in std::copysign #51900

Conversation

malfet commented Feb 8, 2021

facebook-github-bot commented Feb 8, 2021 • edited

💊 CI failures summary and remediations

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Feb 9, 2021

Workaround arm64 gcc error in `std::copysign` #51900

Workaround arm64 gcc error in `std::copysign` #51900

facebook-github-bot commented Feb 8, 2021 •

edited