-
Notifications
You must be signed in to change notification settings - Fork 12k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[OpenMP] Improve omp offload profiler #68016
Conversation
✅ With the latest revision this PR passed the C/C++ code formatter. |
…mprove LIBOMPTARGET profiling, as it allows to measure more events. It was part of PR llvm#68016
openmp/libomptarget/src/private.h
Outdated
std::string RTM = RegionTypeMsg; \ | ||
llvm::TimeTraceScope TimeScope(__FUNCTION__, ProfileLocation + RTM) | ||
// llvm::TimeTraceScope TimeScope(RegionTypeMsg, ProfileLocation + RTM) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
minor: commented-out code
I have prepared a presentation to better explain the proposed changes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LG. Please rebase and merge.
…ntime calls and different ones for kernel lunches and memory transfers.
…xtra info in kernels and device
…n that the profile added the colons to make the total more clear in the trace
…it will be a separate PR
…sary for the changes to the llvm/lib/Support/TimeProfiler.cpp, which was removed from this PR
53bcadb
to
d56e4b2
Compare
Summary:
Adding information to the LIBOMPTARGET profiler runtime kernel and API calls.
Key changes:
is executing. For example teams requested by the user, size of memory transfers.
to register some important details like key kernel duration
developers.