-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
High Performance Computing: CUDA and GCP #1
Comments
-tc tensor cores on dual 4090
13900K - 160Gb ram, two MSI 4090 suprim liquid X
|
RTX-A4000
|
NVIDIA TX-A4500 single PCIe-x16 13900K 1600W PSU - 128G DDR5
|
dual RTX-A4500 and RTX-A4000
|
add dual RTX-4500 add single RTX-3500 on lenovo P1 Gen 6
running with -d
running with -tc tensor cores
Hardware issues to fix |
Use Cases
Tensor cores have 3.5x the performance on NVidia GPUs than cuda cores
LLM and Generative AI
Collatz
Implementation
see https://github.com/obrienlabs/CUDA-Programs/tree/main/Chapter01/gpusum as part of the book from Richard Ansorge of University of Cambridge https://www.cambridge.org/core/books/programming-in-parallel-with-cuda/C43652A69033C25AD6933368CDBE084C
The text was updated successfully, but these errors were encountered: