Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ring-buffer Inter-Process Interface #302

Open
cvonelm opened this issue Sep 13, 2023 · 0 comments
Open

Ring-buffer Inter-Process Interface #302

cvonelm opened this issue Sep 13, 2023 · 0 comments
Assignees

Comments

@cvonelm
Copy link
Member

cvonelm commented Sep 13, 2023

CUPTI PC Sampling (see #294) can only be done from the program that executes the CUDA Kernels itself.

This means that implementing CUPTI support in lo2s is only possible by creating a separate CUPTI sampling support library and using LD_PRELOAD to inject it into the application under measure.

This of course needs some mechanism for the injected library to communicate with lo2s itself, most likely using a ring buffer over shared-memory.

As such a foreign interface might be useful outside of the CUPTI directly, i think this inter-process interface warrants its own discussion.

There are two direct questions:

  1. How should the technical solution look like? shm_open+mmap+own ring buffer implementation, or is there already a turnkey solution for it?
  2. How much genericity should we bake into the design?
@cvonelm cvonelm self-assigned this May 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant