Skip to content
This repository was archived by the owner on Apr 2, 2025. It is now read-only.

CUDA-11.6-fix#509

Merged
jmellorcrummey merged 4 commits intoHPCToolkit:masterfrom
Jokeren:cuda-11.6-fix
Jan 21, 2022
Merged

CUDA-11.6-fix#509
jmellorcrummey merged 4 commits intoHPCToolkit:masterfrom
Jokeren:cuda-11.6-fix

Conversation

@Jokeren
Copy link
Copy Markdown
Member

@Jokeren Jokeren commented Jan 21, 2022

Fix a cuda cfg parsing problem for CUDA >= 11.5: target labels are changed from .L_ to .L_x_, so new code is added to extract name using the new pattern.

Interestingly, using cuda-11.6 collects 13% less samples than cuda-11.4 for quicksilver, but it has nothing to do with postmortem analysis. I confirmed this issue using hpctoolkit's log file.

…ged from .L_<number> to .L_x_<number>, so new code is added to extract name using the new pattern
Copy link
Copy Markdown
Collaborator

@mxz297 mxz297 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jmellorcrummey
Copy link
Copy Markdown
Member

Keren and I had a discussion over zoom about making the code a bit less fragile. Rather than matching anything after .L, he could skip past non-digits and just grab the digits afterwards. There also is no need for two induction variables label_length and digit_pos, which both specify the length of the digit string. After the .L, the string position can be advanced to digit_start, digit_end can be found by skipping all digits, and the length can be computed as the difference.

@Jokeren
Copy link
Copy Markdown
Member Author

Jokeren commented Jan 21, 2022

@jmellorcrummey Thanks for the suggestion. Please check the simplified code.

Copy link
Copy Markdown
Member

@jmellorcrummey jmellorcrummey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The second version looks great. I'll merge it immediately.

@jmellorcrummey jmellorcrummey merged commit e06dd34 into HPCToolkit:master Jan 21, 2022
Jokeren added a commit that referenced this pull request Jan 29, 2022
* Fix cuda CFG parsing problem for CUDA >= 11.5: target label names were changed from .L_<number> to .L_x_<number>. Code was adjusted to be less sensitive to the form of the label name.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants