Your job is to
- Edit
vec_add.cuto actually use CUDA with a grid-stride loop (see notes or https://developer.nvidia.com/blog/even-easier-introduction-cuda/) - Adapt the submission script to run with
nvprofon Perlmutter - Submit the
vec_add.cuand the output ofnvprof