-
Notifications
You must be signed in to change notification settings - Fork 25.3k
Closed
pytorch/tutorials
#2869Labels
oncall: profilerprofiler-related issues (cpu, gpu, kineto)profiler-related issues (cpu, gpu, kineto)
Description
🐛 Describe the bug
I'm attempting to profile a bit of PyTorch code, and the stack traces as exported by profiler.export_stacks()
are not accepted by flamegraph.pl. In particular I'm getting errors of the form
[nix-shell:~/dev/cifar10-fast]$ flamegraph.pl --title "pytorch single epoch" --countname "us." profiling/stacks_5.txt
ERROR: No stack counts found
<?xml version="1.0" standalone="no"?>
<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN" "http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
<svg version="1.1" width="1200" height="60" onload="init(evt)" viewBox="0 0 1200 60" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
<!-- Flame graph stack visualization. See https://github.com/brendangregg/FlameGraph for latest version, and http://www.brendangregg.com/flamegraphs.html for examples. -->
<!-- NOTES: -->
<text x="600.00" y="24" >ERROR: No valid input provided to flamegraph.pl.</text>
</svg>
You can see the operative bit of profiling code here:
For convenience I have also checked in the stack trace output files here.
To reproduce,
- Checkout https://github.com/samuela/cifar10-fast at samuela/cifar10-fast@a660ec0.
- Run
nix-shell
. - Run
nixGLNvidia-510.47.03 python dawn.py
.
Versions
[nix-shell:~/dev/cifar10-fast]$ python collect_env.py
PyTorch version: 1.10.0+cu113
Is debug build: False
CUDA used to build PyTorch: 11.3
ROCM used to build PyTorch: N/A
OS: Ubuntu 20.04.4 LTS (x86_64)
GCC version: (GCC) 10.3.0
Clang version: Could not collect
CMake version: Could not collect
Libc version: glibc-2.33
Python version: 3.9.10 (main, Jan 13 2022, 23:32:03) [GCC 10.3.0] (64-bit runtime)
Python platform: Linux-5.11.0-1028-aws-x86_64-with-glibc2.33
Is CUDA available: False
CUDA runtime version: Could not collect
GPU models and configuration: GPU 0: Tesla V100-SXM2-16GB
Nvidia driver version: 510.47.03
cuDNN version: Could not collect
HIP runtime version: N/A
MIOpen runtime version: N/A
Is XNNPACK available: True
Versions of relevant libraries:
[pip3] Could not collect
[conda] Could not collect
cc @ilia-cher @robieta @chaekit @gdankel @bitfort @ngimel @orionr @nbcsm @guotuofeng @guyang3532 @gaoteng-git
Metadata
Metadata
Assignees
Labels
oncall: profilerprofiler-related issues (cpu, gpu, kineto)profiler-related issues (cpu, gpu, kineto)