[ROCm] Enable deterministic rocBLAS mode #48654

ashishfarmer · 2020-12-01T17:48:57Z

The PR adds a feature to disable atomics in rocblas calls thereby making the output deterministic when it is expected in pyTorch. This mode of rocBLAS can be exercised using the global setting torch.set_deterministic(True)

cc: @ezyang @jeffdaily @sunway513

dr-ci · 2020-12-01T18:57:25Z

💊 CI failures summary and remediations

As of commit f2afa4c (more details on the Dr. CI page):

✅ None of the CI failures appear to be your fault 💚

1/1 broken upstream at merge base 98fddc1 since Nov 30

🚧 1 ongoing upstream failure:

These were probably caused by upstream breakages that are not fixed yet:

pytorch_linux_xenial_py3_clang7_onnx_ort_test2 since Nov 30
- 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 4 times.

jeffdaily

LGTM

zasdfgbnm

What if I call rocBLAS with deterministic=True, then set it to False, and call rocBLAS again? Would the performance of the second rocBLAS hurt by this?

This handle is not just used once and discarded. It will be returned to the pool when the operation finishes, so later the rocblas_atomics_not_allowed will remain in the handle until the end of PyTorch process.

ashishfarmer · 2020-12-01T23:22:15Z

Thanks @zasdfgbnm for the catch. In the case when the global setting is toggled after the handle is returned to the pool, if it is used again, it will still use no_atomics mode. Just added the check to query the mode each time getCurrentCUDABlasHandle() is called.

facebook-github-bot

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-12-02T19:18:59Z

@ezyang merged this pull request in b2ec21a.

Summary: The PR adds a feature to disable atomics in rocblas calls thereby making the output deterministic when it is expected in pyTorch. This mode of rocBLAS can be exercised using the global setting `torch.set_deterministic(True)` cc: ezyang jeffdaily sunway513 Pull Request resolved: pytorch#48654 Reviewed By: bdhirsh Differential Revision: D25272296 Pulled By: ezyang fbshipit-source-id: 70400572b0ab37c6db52636584de0ae61bb5270a

allow disabling atomics in rocblas

5729353

facebook-github-bot added the cla signed label Dec 1, 2020

jeffdaily added the module: rocm AMD GPU support for Pytorch label Dec 1, 2020

ashishfarmer mentioned this pull request Dec 1, 2020

Enable deterministic rocBLAS calls in PyTorch ROCm/pytorch#774

Closed

pytorchbot added the open source label Dec 1, 2020

zhangguanheng66 requested a review from zasdfgbnm December 1, 2020 22:22

zhangguanheng66 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Dec 1, 2020

zhangguanheng66 requested a review from jeffdaily December 1, 2020 22:22

jeffdaily approved these changes Dec 1, 2020

View reviewed changes

zasdfgbnm reviewed Dec 1, 2020

View reviewed changes

more robust deterministic mode checking

f2afa4c

zasdfgbnm approved these changes Dec 1, 2020

View reviewed changes

facebook-github-bot reviewed Dec 2, 2020

View reviewed changes

facebook-github-bot closed this in b2ec21a Dec 2, 2020

facebook-github-bot added the Merged label Dec 2, 2020

ashishfarmer mentioned this pull request Dec 2, 2020

Fix test_linear_transformation for ROCm pytorch/vision#3099

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm] Enable deterministic rocBLAS mode #48654

[ROCm] Enable deterministic rocBLAS mode #48654

ashishfarmer commented Dec 1, 2020

dr-ci bot commented Dec 1, 2020 •

edited

jeffdaily left a comment

zasdfgbnm left a comment •

edited

ashishfarmer commented Dec 1, 2020

facebook-github-bot left a comment

facebook-github-bot commented Dec 2, 2020

[ROCm] Enable deterministic rocBLAS mode #48654

[ROCm] Enable deterministic rocBLAS mode #48654

Conversation

ashishfarmer commented Dec 1, 2020

dr-ci bot commented Dec 1, 2020 • edited

💊 CI failures summary and remediations

🚧 1 ongoing upstream failure:

jeffdaily left a comment

Choose a reason for hiding this comment

zasdfgbnm left a comment • edited

Choose a reason for hiding this comment

ashishfarmer commented Dec 1, 2020

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Dec 2, 2020

dr-ci bot commented Dec 1, 2020 •

edited

zasdfgbnm left a comment •

edited