CUDA-11.6-fix by Jokeren · Pull Request #509 · HPCToolkit/hpctoolkit

Jokeren · 2022-01-21T22:25:47Z

Fix a cuda cfg parsing problem for CUDA >= 11.5: target labels are changed from .L_ to .L_x_, so new code is added to extract name using the new pattern.

Interestingly, using cuda-11.6 collects 13% less samples than cuda-11.4 for quicksilver, but it has nothing to do with postmortem analysis. I confirmed this issue using hpctoolkit's log file.

…ged from .L_<number> to .L_x_<number>, so new code is added to extract name using the new pattern

mxz297

LGTM

jmellorcrummey · 2022-01-21T23:27:26Z

Keren and I had a discussion over zoom about making the code a bit less fragile. Rather than matching anything after .L, he could skip past non-digits and just grab the digits afterwards. There also is no need for two induction variables label_length and digit_pos, which both specify the length of the digit string. After the .L, the string position can be advanced to digit_start, digit_end can be found by skipping all digits, and the length can be computed as the difference.

Jokeren · 2022-01-21T23:34:12Z

@jmellorcrummey Thanks for the suggestion. Please check the simplified code.

jmellorcrummey

The second version looks great. I'll merge it immediately.

* Fix cuda CFG parsing problem for CUDA >= 11.5: target label names were changed from .L_<number> to .L_x_<number>. Code was adjusted to be less sensitive to the form of the label name.

Fix cuda cfg parsing problem for CUDA >= 11.5: target labels are chan…

a071ecb

…ged from .L_<number> to .L_x_<number>, so new code is added to extract name using the new pattern

Jokeren mentioned this pull request Jan 21, 2022

hpcstruct fails for quicksilver GPU binary if using nvdisasm 11.5 or 11.6 #508

Closed

Jokeren requested review from jmellorcrummey and mxz297 January 21, 2022 22:26

Fix lint

76745e8

mxz297 approved these changes Jan 21, 2022

View reviewed changes

Jokeren added 2 commits January 21, 2022 17:31

Simplify the pattern matching logic

9fa8138

Protect corruption for corner cases

fd49c3e

jmellorcrummey approved these changes Jan 21, 2022

View reviewed changes

jmellorcrummey merged commit e06dd34 into HPCToolkit:master Jan 21, 2022

Jokeren added a commit that referenced this pull request Jan 29, 2022

CUDA-11.6-fix (#509)

f1d4734

* Fix cuda CFG parsing problem for CUDA >= 11.5: target label names were changed from .L_<number> to .L_x_<number>. Code was adjusted to be less sensitive to the form of the label name.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA-11.6-fix#509

CUDA-11.6-fix#509
jmellorcrummey merged 4 commits intoHPCToolkit:masterfrom
Jokeren:cuda-11.6-fix

Jokeren commented Jan 21, 2022

Uh oh!

mxz297 left a comment

Uh oh!

jmellorcrummey commented Jan 21, 2022

Uh oh!

Jokeren commented Jan 21, 2022

Uh oh!

jmellorcrummey left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Jokeren commented Jan 21, 2022

Uh oh!

mxz297 left a comment

Choose a reason for hiding this comment

Uh oh!

jmellorcrummey commented Jan 21, 2022

Uh oh!

Jokeren commented Jan 21, 2022

Uh oh!

jmellorcrummey left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants