CUDA profiler hooks #81

jaredhoberock opened this Issue May 7, 2012 · 0 comments

1 participant

A Parallel Algorithms Library member

When profiling Thrust applications with tools like the CUDA Visual Profiler or Parallel Nsight it would be useful to have algorithms reported in a more straightforward way. Specifically, rather than having each individual kernel, with their lengthly mangled names, appear in the profile it would be preferable to have simple expressions like thrust::sort or perhaps thrust::sort<FooIterator>.

Ideally the tools above would support some sort of stack-based mechanism to aggregate and nest kernels into logical algorithms.

Forwarded from

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment