Shared memory in HOMI by NaddiNadja · Pull Request #35 · xnvme/aisio

NaddiNadja · 2026-03-19T12:57:59Z

This PR changes the setup from the current IPC setup for HOMI

From: using only sockets, both for connection/disconnection and for runtime requests and responses between the clients and the daemon. The main thread listens to requests and spawns worker threads for each new request. Threads were used to not block requests from other processes from completing, and as each client had its own socket_fd, there was no data races when completing requests concurrently.
To: using a socket to establish a connection from a client to the daemon. The daemon then maps out a piece of shared memory for that specific client, and starts a worker thread, which listens to and handles requests in that piece of shared memory. There is a segment of shared memory for each client, such that we can still use threads to complete requests from different processes concurrently without running into data races.

This PR does not change the client interface, and the any program written with the HOMI client before this commit (e.g. the test program from #27 ) still works.

safl

I initially suggested using the uPCIe hugepage helpers on Discord, but looking at the code more closely they're not a good fit here. That said, consider POSIX shared memory (shm_open/mmap) over SysV (shmget/shmat). It's consistent with the POSIX semaphores already used and segments are visible in /dev/shm/ for debugging.
That said, this is probably mostly my bias after fiddling with mmap for hugepages in uPCIe. Are there advantages to the SysV approach here that I'm missing?

safl · 2026-03-19T22:25:02Z

+
+	*hdr = shm->hdr;
+
+	if (hdr->payload_len > 0) {


Could you just return a pointer to shm->payload instead of allocating payload buffers? I am thinking that a benefit of shared-memory, is exactly that, the memory is shared, so as long as you are only reading the memory then you do not need to "copy it out of shared memory"?

On the client side, I will need to copy it out of shared memory if I want to be able to be able to use the information after sending another request, right? I can however change it, such that it's only in the client code that the data is copied out, since the daemon does not need to persist the data read.

In my current implementation, I have a small section of shared memory for each of the clients, and when they query something from the daemon, it copies the information into shared memory - e.g. if a client wanted extent info, the daemon would find it with xallib and copy it into shared memory. This was my "direct" translation of socket->shm.
However, yesterday I realised that I might be able to put the whole xal tree in shared memory, since shmat supports giving it an address, so maybe I could just give it the pointer to the xallib pool of extents? That way, xallib is responsible for creating the whole tree etc., and HOMI then just makes the memory pool shared. Do you think that would work? Because then I would not have to worry about copying to/from shared memory, I could just return the pointer to where the memory is (but I would maybe still have to copy the pointer into shared memory, so that the client knows where to read from)

Okay I don't think that approach would work. Would require changes to xallib directly

Okay, I was maybe on the right track anyways, and I understand now why you want to switch to the POSIX-style of shared memory: for example to be compatible with xallib. The pools are already mmapped, so I could make this shared, and in that way make it accessible to the client processes. I will fiddle around with this a bit more

safl · 2026-03-19T22:35:13Z

+		goto failed;
+	}
+
+	wargs->shm->done = 0;


potential-race-on-done -- make it atomic?

I mention this since the semaphore guarantees the semaphore value, not the done value. Thus, I suspect that there might be a race.. thus to avoid it, possibly drop in an atomic store here atomic_store(&wargs->shm->done, 0);

This would of course also require defining done as an atomic.

I don't understand this. If the semaphore does not guarantee that the rest of the shared memory is ready to be read, how can I ever make sure that the data I read from it is accurate? The done field is not updated across multiple threads in the daemon, it's only set to 1 by the client, when it exits - it could be called something like client_exited to make this more clear.

safl · 2026-03-19T22:36:29Z

-		if (!request) {
-			homid_log(LOG_ERR, "Error: Payload required for HELLOWORLD request");
-			goto exit;
+		if (wargs->shm->done) {


potential-race-on-done -- make it atomic?

I mention this since the semaphore guarantees the semaphore value, not the done value. Thus, I suspect that there might be a race.. thus to avoid it, possibly drop in an atomic load here

if (atomic_load(&wargs->shm->done)) { break; }

karlowich

I agree with @safl's comments. Otherwise it looks clean to me.

NaddiNadja · 2026-03-23T09:51:27Z

Are there advantages to the SysV approach here that I'm missing?

The only one I can think of is that I generate a shared memory segment for each client with a unique ID and by using IPC_PRIVATE, I get a kernel-assigned unique ID, so I don't have to worry about generating it myself. It does not seem that there is an equivalent for the POSIX approach, so moving to that, I would have to do some book keeping on this. I find it difficult to estimate whether that "inconvenience" is outweighed by the benefits of switching to POSIX?

Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

NaddiNadja · 2026-03-24T09:39:34Z

Hold off from reviewing this, I have to try some things out now that I have rebased onto Siu's commit with the xallib caching :)

Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

There are informations on the homid struct that can be necessary to know from the threaded workers that handle the requests from clients. The homid_ipc_accept() function's argument is changed to be the whole homid struct instead of just the connection struct, such that this can be passed on to the worker arguments. Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

A reader and writer function for shared memory, as using this for IPC is more performant than using sockets. Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

This commit changes the current from the current IPC setup for HOMI - From: using only sockets, both for connection/disconnection and for runtime requests and responses between the clients and the daemon. The main thread listens to requests and spawns worker threads for each new request. Threads were used to not block requests from other processes from completing, and as each client had its own socket_fd, there was no data races when completing requests concurrently. - To: using a socket to establish a connection from a client to the daemon. The daemon then maps out a piece of shared memory for that specific client, and starts a worker thread, which listens to and handles requests in that piece of shared memory. There is a segment of shared memory for each client, such that we can still use threads to complete requests from different processes concurrently without running into data races. This commit does not change the client interface, and the any program written with the HOMI client before this commit still works. Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

With the move to shared memory, the read and write function for the socket is no longer used or necessary in the shared protocol between the HOMI client and daemon. Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

NaddiNadja · 2026-05-12T14:06:44Z

Replaced by #65

NaddiNadja requested a review from safl March 19, 2026 13:58

safl reviewed Mar 19, 2026

View reviewed changes

karlowich reviewed Mar 20, 2026

View reviewed changes

NaddiNadja force-pushed the homi-shm branch from c51eb38 to 4ded4d3 Compare March 23, 2026 14:15

fix(homid): add error handling to homi_ipc_connect

6022380

Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

NaddiNadja force-pushed the homi-shm branch from 4ded4d3 to d28c02f Compare March 24, 2026 08:28

NaddiNadja marked this pull request as draft March 24, 2026 09:39

NaddiNadja added 5 commits March 25, 2026 12:57

fix(homi/headers): add include guards to headers

ee703e6

Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

feat(homi/proto): add interface for shared memory

f627d8a

A reader and writer function for shared memory, as using this for IPC is more performant than using sockets. Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

fix(homi/proto): remove unused socket_read/write

3124739

With the move to shared memory, the read and write function for the socket is no longer used or necessary in the shared protocol between the HOMI client and daemon. Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

NaddiNadja force-pushed the homi-shm branch from d28c02f to ff3979b Compare March 25, 2026 12:16

NaddiNadja added 3 commits March 26, 2026 14:27

fix(homi/xal): use struct instead of ptr size for calloc

e533c42

Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

feat(homi/xal): add device getter

d467271

Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

feat(homi/ipc): add ipc for xal struct

5296e9b

Signed-off-by: Nadja Brix Koch <n.koch@samsung.com>

NaddiNadja force-pushed the homi-shm branch from ff3979b to 5296e9b Compare March 26, 2026 13:30

safl force-pushed the main branch 2 times, most recently from 549d5c4 to 1f12ab7 Compare April 27, 2026 23:10

safl mentioned this pull request May 12, 2026

IPC for xal extent tree #65

Open

NaddiNadja closed this May 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shared memory in HOMI#35

Shared memory in HOMI#35
NaddiNadja wants to merge 9 commits into
mainfrom
homi-shm

NaddiNadja commented Mar 19, 2026

Uh oh!

safl left a comment

Uh oh!

safl Mar 19, 2026

Uh oh!

NaddiNadja Mar 23, 2026

Uh oh!

NaddiNadja Mar 24, 2026 •

edited

Loading

Uh oh!

NaddiNadja Mar 24, 2026

Uh oh!

safl Mar 19, 2026

Uh oh!

safl Mar 19, 2026

Uh oh!

NaddiNadja Mar 23, 2026

Uh oh!

safl Mar 19, 2026

Uh oh!

karlowich left a comment

Uh oh!

NaddiNadja commented Mar 23, 2026

Uh oh!

NaddiNadja commented Mar 24, 2026

Uh oh!

NaddiNadja commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Conversation

NaddiNadja commented Mar 19, 2026

Uh oh!

safl left a comment

Choose a reason for hiding this comment

Uh oh!

safl Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

NaddiNadja Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

NaddiNadja Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NaddiNadja Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

safl Mar 19, 2026

Choose a reason for hiding this comment

potential-race-on-done -- make it atomic?

Uh oh!

safl Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

NaddiNadja Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

safl Mar 19, 2026

Choose a reason for hiding this comment

potential-race-on-done -- make it atomic?

Uh oh!

karlowich left a comment

Choose a reason for hiding this comment

Uh oh!

NaddiNadja commented Mar 23, 2026

Uh oh!

NaddiNadja commented Mar 24, 2026

Uh oh!

NaddiNadja commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

NaddiNadja Mar 24, 2026 •

edited

Loading