-
Notifications
You must be signed in to change notification settings - Fork 118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Updates for Volta architecture #414
Comments
It appears that it also has difficulty even running in a standard configuration on the Volta series. Attempts to run on a V100 cause these errors:
|
AWS has volta GPUs: https://aws.amazon.com/ec2/instance-types/p3/ |
I'm interested in understanding exactly what has to change here. Would merely moving to the synchronizing warp intrinsics do the job (with some sort of performance impact?), or does something more fundamental have to happen? What, if any, changes to |
I think it is just a matter of adding warp synchronisation at the right points. I have done this in the obvious places already (e.g. here) but haven't found a volta machine to actually test this on yet. Performance wise it could be beneficial to look at other implementations which don't require the extra synchronisation points, but, again, I'm not exactly sure yet... |
IIRC we've got some Titan Vs around at work somewhere; I'll see if I can grab one. If so I'd be happy to run whatever tests/benchmarks you think are relevant. |
If you could run just the standard test suite on it that would be amazing; if it works (🤞) that is a decent sanity check. stack test accelerate-llvm-ptx |
Manged to get my hands on one. The test suite has been running for almost 24 hours... I see that there's an issue for speeding up the test suite. |
That seems like it might have hung? If I get it to just run one test each it will complete in under 2 minutes on my machine:
|
Hah, for some reason |
tmcdonell/cuda@9351c1f includes device information for compute 7.x, so you shouldn't get those errors anymore. |
Fixed in v1.3 |
The Volta architecture (compute capability 7.0) makes some changes which we'll need to update for; in particular they've given up on the warp-synchronous programming model.
https://devblogs.nvidia.com/inside-volta/
The text was updated successfully, but these errors were encountered: