Skip to content
This repository was archived by the owner on Apr 2, 2025. It is now read-only.

Conversation

@mxz297
Copy link
Collaborator

@mxz297 mxz297 commented Mar 24, 2022

This PR was originally merged to the develop branch and it is now cherry-picked to master so that we can do a release for AMD platforms. This seems to work reasonably well on Crusher using rocm-5.0.2 with the matrix multiplication test. Original PR message below:

This is a huge merge to develop . It includes three main components:

  1. Preliminary rocprofiler support for getting AMD GPU hardware counters.
  2. Prelminary support for AOMP (AMD's openmp implementation).
  3. Breakdown of hpcrun initialization and reordering.

(cherry picked from commit 627d4da)

This is a huge merge to develop. It includes three main components:

1. Preliminary rocprofiler support for getting AMD GPU hardware counters.
2. Prelminary support for AOMP (AMD's openmp implementation).
3. Breakdown of hpcrun initialization and reordering.

(cherry picked from commit 627d4da)
int verbose
);

#if defined(__cplusplus)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This if defined(__cplusplus) should be deleted. It is an unmatched duplicate inside an if 0. The whole if 0 can be deleted if you choose.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

AC_SUBST([OPT_GTPIN_LIBDIR])


#-------------------------------------------------
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove redundant copies of --with-igc, with-metrics-discovery, with-gtpin

Copy link
Member

@jmellorcrummey jmellorcrummey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The PR is OK. There is a bit of redundant code (e.g. in configure.ac). There is some unused code (the gpu_monitor layer) and the hip_api layer. However, our plan is to clean this up in develop rather than worry about it here. Master will soon be replaced.

@jmellorcrummey jmellorcrummey merged commit 419c1f9 into master Mar 28, 2022
@jmellorcrummey jmellorcrummey deleted the rocprofiler-master branch March 28, 2022 19:10
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants