Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tblextr.py : bad kfd record / IndexError #42

Open
yoann-heitz opened this issue Mar 19, 2021 · 4 comments
Open

tblextr.py : bad kfd record / IndexError #42

yoann-heitz opened this issue Mar 19, 2021 · 4 comments

Comments

@yoann-heitz
Copy link

When running the following command

./rocprofiler/bin/rocprof --hsa-trace --hip-trace --kfd-trace -d traces python3 ./test.py

I sometimes get one of the following errors

  File "/home/yoann/rocprofiler/bin/tblextr.py", line 710, in <module>
    hip_trace_found = fill_api_db('HIP', db, indir, 'hip', HIP_PID, OPS_PID, [], {}, 1)
  File "/home/yoann/rocprofiler/bin/tblextr.py", line 441, in fill_api_db
    copy_data = list(copy_raws[copy_index])
IndexError: list index out of range
Profiling data corrupted: ' traces/rpl_data_210319_075021_860663/input_results_210319_075021/results.txt'

or

scan kfd API data 2803664:2803665                                                                                                    /home/yoann/rocprofiler/bin/tblextr.py: kfd bad record: ''
Profiling data corrupted: ' traces/rpl_data_210319_074742_860200/input_results_210319_074742/results.txt'

I use rocm-4.0.0 and I cloned and built rocprofiler and roctracer from the github repositories. I used both amd-master and rocm-4.0.x branches. The IndexError occurred in both cases. The kfd bad record only occurred with the rocm-4.0.0 branch (but the errors didn't occur at each run so maybe it could also occur with the amd-master branch.

index_error.txt
bad_record_error.txt
test.py.zip

@arfio
Copy link

arfio commented Apr 1, 2021

I get the same error using the latest 4.1 ROCm release.

@eshcherb
Copy link
Contributor

eshcherb commented Apr 8, 2021

Could you check with '--sys-trace' which should enable HIP + HSA tracing.

@arfio
Copy link

arfio commented Apr 9, 2021

Tracing with '--sys-trace' alone works correctly.

@arfio
Copy link

arfio commented Mar 21, 2022

I am not able to reproduce this issue with the latest ROCm (5.0.0).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants