Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workaround for cublas bug for 45724 #46001

Closed
wants to merge 7 commits into from
Closed

Workaround for cublas bug for 45724 #46001

wants to merge 7 commits into from

Conversation

zasdfgbnm
Copy link
Collaborator

Fixes #45724

@zasdfgbnm zasdfgbnm marked this pull request as ready for review October 8, 2020 00:05
return cublasGemmStridedBatchedEx(handle, transa, transb, m, n, k, alpha, A, Atype, lda, strideA, B, Btype, ldb, strideB, beta, C, Ctype, ldc, strideC, batchCount, computeType, algo);
}
cublasStatus_t result;
for(int64_t i = 0; i < batchCount; i += 65535) {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hm, can you make it something rounder than 65535 here? You have a good chance of having horrible alignment

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

changed to 63 * 1024

Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ngimel has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@codecov
Copy link

codecov bot commented Oct 8, 2020

Codecov Report

Merging #46001 into master will not change coverage.
The diff coverage is n/a.

Impacted file tree graph

@@           Coverage Diff           @@
##           master   #46001   +/-   ##
=======================================
  Coverage   68.25%   68.25%           
=======================================
  Files         410      410           
  Lines       53246    53246           
=======================================
  Hits        36343    36343           
  Misses      16903    16903           

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c8d76ff...e0fb315. Read the comment docs.

@facebook-github-bot
Copy link
Contributor

@ngimel merged this pull request in b2bff9e.

zasdfgbnm added a commit that referenced this pull request Oct 8, 2020
Summary:
Fixes #45724

Pull Request resolved: #46001

Reviewed By: mruberry

Differential Revision: D24184058

Pulled By: ngimel

fbshipit-source-id: 7d2bab3206ddbc10a7cae3efd9b5e253f38400a9
malfet pushed a commit that referenced this pull request Oct 8, 2020
Summary:
Fixes #45724

Pull Request resolved: #46001

Reviewed By: mruberry

Differential Revision: D24184058

Pulled By: ngimel

fbshipit-source-id: 7d2bab3206ddbc10a7cae3efd9b5e253f38400a9
@facebook-github-bot facebook-github-bot deleted the cublas65535 branch January 27, 2021 18:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

torch.matmul output contains some nan value for large size fp16 tensors in V100 GPU
4 participants