Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use nsight system inside containers #112

Open
Fizzbb opened this issue Mar 23, 2022 · 1 comment
Open

use nsight system inside containers #112

Fizzbb opened this issue Mar 23, 2022 · 1 comment

Comments

@Fizzbb
Copy link
Collaborator

Fizzbb commented Mar 23, 2022

Since all the workloads are deployed through container, we need to launch nsight system within the container.
procedure is like

  1. in the Dockerfile, add command to install cuda toolkit, nsight system and compute will be installed in the /usr/local/cuda-11.4 folder.
    RUN apt-get install -y cuda-toolkit-11-4
  2. mount a host path to the application container for passing nsight report file. when use kubectl cp, sometimes cause EOF error and corrupted file cannot be open by nsight
  3. Run the profiling
    nsys profile -o report --stats=true --force-overwrite true python XXX.py
    If nsys is not found, you can use the complete path /usr/local/cuda-11.4/bin/nsys
  4. Copy the report to the mount folder
  5. SCP download the report.qdrep file to local machine with nsight GUI installed, and check out the timeline

Limitation is it cannot profile cross containers.

@pint1022
Copy link
Collaborator

hope it works on top of multiple containers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants