Add Multicore Guide explaining the new memory model #145

talex5 · 2022-01-13T14:51:12Z

This is my understanding of OCaml's memory model. Someone more knowledgable should check it for accuracy.

doc/multicore.md

jmid · 2022-01-14T10:58:39Z

doc/multicore.md

+
+We can solve this by relying on a useful feature of atomics: every atomic also has a frontier of its own (a location on every non-atomic location's timeline).
+Writing to an atomic updates its frontier with information from the writing CPU's frontier
+(so it's at least as up-to-date as the writer).


I initially read this as updating only the frontier of the atomic.
Looking at the PLDI'18-paper https://kcsrk.info/papers/pldi18-memory.pdf
I understand rule (While-AT) in Fig.1c as updating both frontiers.
This is also how I understand KC's example of an atomic write on sl.21-22
https://speakerdeck.com/kayceesrk/bounding-data-races-in-space-and-time?slide=78
which updates both the red frontier for the atomic A and thread 2's frontier for variable b.

I'm fairly new to the memory-model though, so take this with a grain of salt.
If my understanding is correct, you could consider rephrasing to something like:

"Writing to an atomic updates both the atomic's frontier and the writing CPU's frontier
to contain the most up-to-date entries for each."

For the example it shouldn't change anything though.

I think you're right. I've fixed the text. I wonder why it's like that, though?

jmid · 2022-01-14T11:04:16Z

Thanks! This is a very nice explanation IMO - well done! 😀🙏‍

While reading it I found two small nits (pointed out inline). Thanks again!

ctk21

This is a good introduction to the memory model and I like the use of 'frontiers'.

There's one thing that's not quite right about the garbage collector. Multicore as upstreamed is using a 'parallel minor collector' in which there is no read barrier, private minor heaps or changes to the C-API. However the parallel minor collector does have synchronised minor heap promotion which means there are periods when no domain can execute OCaml code as all minor heaps are being promoted.

It isn't clear this document gains from discussing minor heap collection, so a fix could be to remove those lines.

For those interested https://arxiv.org/abs/2004.11663 has more on the parallel minor collector vs the concurrent minor collector.

ctk21 · 2022-01-14T11:41:22Z

doc/multicore.md

+The way this works on a real system is interesting.
+Neither `x` nor the list items are atomic, so the system is free to optimise things as it pleases.
+In particular, it might update `x` to point at the new list's address before writing the list.
+However, the new list will be allocated in the writing domain's minor heap, which is private to that domain.


In the parallel minor GC, all minor heaps can be read by all domains and links can occur between minor heaps.
To make this work, all mutators must stop to perform minor heap promotion across all domains; we call these periods where all domains are stopped "stop-the-world" sections.

ctk21 · 2022-01-14T11:42:39Z

doc/multicore.md

+In particular, it might update `x` to point at the new list's address before writing the list.
+However, the new list will be allocated in the writing domain's minor heap, which is private to that domain.
+If the second branch sees the new pointer value, it will notice that it points into another domain's minor heap.
+Instead of accessing it directly, it will send a message to that domain asking for the value to be promoted to the major heap.


In the 'concurrent minor collector' this is how things worked. However the 'parallel minor collector' does not impose read barriers and promotion. Instead it enforces stop-the-world sections where all minor heaps across all domains are promoted in parallel.

talex5 · 2022-01-14T16:35:22Z

@ctk21: thanks - I've removed that section.

But now I'm curious how it works in the new system. I suppose mutating a field creates some kind of barrier to ensure the thing it now points to is written first?

ctk21 · 2022-01-17T09:42:07Z

caml_modify contains the required barrier for mutable writes. Notice this is independent of the minor collection scheme and if the mutable field is in the minor heap, because we always have to deal with programs mutating blocks in the major heap.

talex5 force-pushed the multicore-guide branch 6 times, most recently from 9f024de to c372bcd Compare January 13, 2022 15:42

jmid reviewed Jan 14, 2022

View reviewed changes

doc/multicore.md Outdated Show resolved Hide resolved

jmid reviewed Jan 14, 2022

View reviewed changes

ctk21 reviewed Jan 14, 2022

View reviewed changes

Add Multicore Guide explaining the new memory model

1307c96

talex5 force-pushed the multicore-guide branch from c372bcd to d1acac7 Compare January 14, 2022 16:23

Apply corrections from Tom Kelly and Jan Midtgaard

6d72268

talex5 force-pushed the multicore-guide branch from d1acac7 to 6d72268 Compare January 14, 2022 16:24

talex5 merged commit 50a6b59 into ocaml-multicore:main Jan 17, 2022

talex5 deleted the multicore-guide branch January 17, 2022 08:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Multicore Guide explaining the new memory model #145

Add Multicore Guide explaining the new memory model #145

talex5 commented Jan 13, 2022

jmid Jan 14, 2022

talex5 Jan 14, 2022

jmid commented Jan 14, 2022

ctk21 left a comment

ctk21 Jan 14, 2022

ctk21 Jan 14, 2022

talex5 commented Jan 14, 2022

ctk21 commented Jan 17, 2022

Add Multicore Guide explaining the new memory model #145

Add Multicore Guide explaining the new memory model #145

Conversation

talex5 commented Jan 13, 2022

jmid Jan 14, 2022

Choose a reason for hiding this comment

talex5 Jan 14, 2022

Choose a reason for hiding this comment

jmid commented Jan 14, 2022

ctk21 left a comment

Choose a reason for hiding this comment

ctk21 Jan 14, 2022

Choose a reason for hiding this comment

ctk21 Jan 14, 2022

Choose a reason for hiding this comment

talex5 commented Jan 14, 2022

ctk21 commented Jan 17, 2022