-
Notifications
You must be signed in to change notification settings - Fork 15
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #8 from jade-hpc-gpu/dev__mihai
update to readthedocs (using JADE + cuda)
- Loading branch information
Showing
7 changed files
with
172 additions
and
31 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,47 @@ | ||
.. _software: | ||
|
||
CUDA | ||
==== | ||
|
||
.. sidebar:: CUDA | ||
|
||
:URL: http://www.nvidia.co.uk/object/cuda-parallel-computing-uk.html | ||
|
||
CUDA is a parallel computing platform and API model created and developed by Nvidia, which enables dramatic increases in computing performance by harnessing the power of GPUs | ||
|
||
|
||
Versions | ||
-------- | ||
Multiple CUDA versions are available through the module system | ||
|
||
|
||
Environment | ||
----------- | ||
The CUDA environment is managed through the modules, which set all the environment variables needed. The availability of different versions can be checked with :: | ||
|
||
module avail cuda | ||
|
||
The environment set by a particular module can be inspected, *e.g.* :: | ||
|
||
module show cuda/9.0 | ||
|
||
|
||
Learn more | ||
---------- | ||
To learn more about CUDA programming, either talk to your local RSE | ||
support, or visit Mike Giles' CUDA Programming course page at | ||
|
||
http://people.maths.ox.ac.uk/gilesm/cuda/ | ||
|
||
This one-week course is taught in Oxford at the end of July each year, | ||
but all of the lecture notes and practicals are provided online for | ||
self-study at other times. | ||
|
||
|
||
|
||
|
||
.. toctree:: | ||
:maxdepth: 2 | ||
:glob: | ||
|
||
learn/index |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,58 @@ | ||
.. _learn: | ||
|
||
CUDA documentation | ||
================== | ||
|
||
NVIDIA provides lots of documentation, both online and in downloadable form: | ||
|
||
* `Online CUDA documentation <http://docs.nvidia.com/cuda/index.html>`_ | ||
* `CUDA homepage <http://www.nvidia.com/object/cuda_home.html>`_ | ||
* `CUDA Runtime API <http://docs.nvidia.com/cuda/pdf/CUDA_Runtime_API.pdf>`_ | ||
* `CUDA C Best Practices Guide <http://docs.nvidia.com/cuda/pdf/CUDA_C_Best_Practices_Guide.pdf>`_ | ||
* `CUDA Compiler Driver NVCC <http://docs.nvidia.com/cuda/pdf/CUDA_Compiler_Driver_NVCC.pdf>`_ | ||
* `CUDA Visual Profiler <http://docs.nvidia.com/cuda/pdf/CUDA_Profiler_Users_Guide.pdf>`_ | ||
* `CUDA-gdb debugger <http://docs.nvidia.com/cuda/pdf/CUDA_GDB.pdf>`_ | ||
* `CUDA-memcheck memory checker <http://docs.nvidia.com/cuda/pdf/CUDA_Memcheck.pdf>`_ | ||
* `CUDA maths library <http://docs.nvidia.com/pdf/CUDA_Math_API.pdf>`_ | ||
* `CUBLAS library <http://docs.nvidia.com/cuda/pdf/CUDA_CUBLAS_Users_Guide.pdf>`_ | ||
* `CUFFT library <http://docs.nvidia.com/cuda/pdf/CUDA_CUFFT_Users_Guide.pdf>`_ | ||
* `CUSPARSE library <http://docs.nvidia.com/cuda/pdf/CUDA_CUSPARSE_Users_Guide.pdf>`_ | ||
* `CURAND library <http://docs.nvidia.com/cuda/pdf/CURAND_Library.pdf>`_ | ||
* `NCCL multi-GPU communications library <https://developer.nvidia.com/nccl>`_ | ||
* `NVIDIA blog article <https://devblogs.nvidia.com/parallelforall/fast-multi-gpu-collectives-nccl/>`_ | ||
* `GTC 2015 presentation on NCCL <http://images.nvidia.com/events/sc15/pdfs/NCCL-Woolley.pdf>`_ | ||
* `PTX (low-level instructions) <http://docs.nvidia.com/cuda/pdf/ptx_isa_4.1.pdf>`_ | ||
|
||
|
||
Nsight is NVIDIA's integrated development environment: | ||
|
||
* `Nsight Visual Studio <https://developer.nvidia.com/nvidia-nsight-visual-studio-edition>`_ | ||
* `Nsight Eclipse <https://developer.nvidia.com/nsight-eclipse-edition>`_ | ||
* `Nsight Eclipse -- Getting Started <http://docs.nvidia.com/cuda/nsight-eclipse-edition-getting-started-guide/index.html>`_ | ||
|
||
|
||
NVIDIA also provide helpful guides on the Pascal architecture: | ||
|
||
* `Floating point accuracy on NVIDIA GPUs <http://docs.nvidia.com/cuda/pdf/Floating_Point_on_NVIDIA_GPU_White_Paper.pdf>`_ | ||
* `CUDA SDK examples <http://developer.nvidia.com/object/cuda_sdk_samples.html">`_ | ||
* `OpenACC <http://www.openacc.org>`_ | ||
* `OpenMP 4.5 <http://on-demand.gputechconf.com/gtc/2016/presentation/s6510-jeff-larkin-targeting-gpus-openmp.pdf>`_ | ||
|
||
|
||
NVIDIA also provide helpful guides on the Pascal architecture: | ||
|
||
* `Pascal Tuning Guide <http://docs.nvidia.com/cuda/pascal-tuning-guide/>`_ | ||
* `Pascal P100 White Paper <https://images.nvidia.com/content/pdf/tesla/whitepaper/pascal-architecture-whitepaper.pdf>`_ | ||
|
||
|
||
Useful presentations at NVIDIA's 2017 GTC conference contain: | ||
|
||
* `Cooperative Groups <http://on-demand.gputechconf.com/gtc/2017/presentation/s7622-Kyrylo-perelygin-robust-and-scalable-cuda.pdf>`_ | ||
* `NCCL 2.0 <http://on-demand.gputechconf.com/gtc/2017/presentation/s7155-jeaugey-nccl.pdf>`_ | ||
* `Multi-GPU Programming <http://on-demand.gputechconf.com/gtc/2017/presentation/s7142-jiri-kraus-multi-gpu-programming-models.pdf>`_ | ||
* `The Making of Saturn-V <http://on-demand.gputechconf.com/gtc/2017/presentation/s7750-louis-capps-making-of-dgx-saturnv.pdf>`_ | ||
|
||
|
||
.. toctree:: | ||
:maxdepth: 1 | ||
:glob: |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -55,5 +55,6 @@ JADE hardware consists of: | |
|
||
jade/index | ||
software/index | ||
cuda/index | ||
more_info | ||
troubleshooting |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters