event_monitor: refactor the implementation to support concurrent access #5633

brkp · 2023-07-29T23:02:58Z

This patch modifies event_monitor to ensure that concurrent access to event_log from multiple threads is safe. Previously, the event_log function would acquire a reference to the event log file and write to it without doing any synchronization, which made it prone to data races. This issue likely went under the radar because the relevant SAFETY comment on the unsafe block was incomplete.

The new implementation spawns a dedicated thread named event-monitor solely for writing to the file. It uses the MPMC channel exposed by flume to pass messages to the event-monitor thread. Since flume::Sender<T> implements Sync, it is safe for multiple threads to share it and send messages to the event-monitor thread.

I looked into doing this with the unbounded MPSC in the standard library but unfortunately, it's !Sync, which actually is considered to be an API mistake. Meaning, the following snippet has soundness issues and is not safe if tx were to be a std::sync::mpsc::Sender<T>:

[...]
    if let Some(monitor_handle) = unsafe { MONITOR.as_ref() } {
        [...]
        if let Ok(event) = serde_json::to_string_pretty(&event) {|
            // invocation of this snippet from multiple threads
            // in the case `tx` is `!Sync` is not safe
            monitor_handle.tx.send(event).ok();
        }
    }
[...]

If anyone is aware of a workaround/better pattern for implementing this with the MPSC in the standard library, I'd love to hear about it.

Here are some links that can provide more context for this PR:

likebreath

The current implementation looks good to me. I wonder how does the the "event-monitor" thread exit gracefully? Also, looks like this thread won't terminate together with other thread upon exit_evt.

rbradford · 2023-07-31T09:42:15Z

The current implementation looks good to me. I wonder how does the the "event-monitor" thread exit gracefully? Also, looks like this thread won't terminate together with other thread upon exit_evt.

exit_evt is used for VM specific threads - this thread is a VMM thread and like e.g. the http/dbus API threads it will be exited when the VMM process terminates. Unless it needs to do some specific cleanup (FDs will be automatically closed by the OS) then there is no need to do any special handling.

event_monitor/src/lib.rs

likebreath · 2023-07-31T20:03:30Z

The current implementation looks good to me. I wonder how does the the "event-monitor" thread exit gracefully? Also, looks like this thread won't terminate together with other thread upon exit_evt.

exit_evt is used for VM specific threads - this thread is a VMM thread and like e.g. the http/dbus API threads it will be exited when the VMM process terminates. Unless it needs to do some specific cleanup (FDs will be automatically closed by the OS) then there is no need to do any special handling.

Thanks for the explanation, Rob.

brkp · 2023-08-01T05:57:27Z

Do you think that we should write to exit_evt when the event-monitor thread panics? This is the behavior implemented by other VMM threads as well, e.g. HTTP/D-Bus API threads. @likebreath @rbradford

rbradford · 2023-08-01T06:33:37Z

The current implementation looks good to me. I wonder how does the the "event-monitor" thread exit gracefully? Also, looks like this thread won't terminate together with other thread upon exit_evt.

exit_evt is used for VM specific threads - this thread is a VMM thread and like e.g. the http/dbus API threads it will be exited when the VMM process terminates. Unless it needs to do some specific cleanup (FDs will be automatically closed by the OS) then there is no need to do any special handling.

Thanks for the explanation, Rob.

Thanks to @brkp's point my reply was slightly off. There is an exit_evt in Vmm and it is used by support threads in the VMM (like the HTTP API server) but only to signal they have panicked/prematurely exited; they don't listen for the event and react to it like the VM support threads.

rbradford · 2023-08-01T06:34:27Z

Do you think that we should write to exit_evt when the event-monitor thread panics? This is the behavior implemented by other VMM threads as well, e.g. HTTP/D-Bus API threads. @likebreath @rbradford

Yes - please make it consistent with the other VMM support threads. I think there should also be some new seccomp rules too?

rbradford · 2023-08-03T14:59:57Z

@brkp I've drafted this as I think there are still some more bits to do...?

brkp · 2023-08-03T15:25:53Z

@brkp I've drafted this as I think there are still some more bits to do...?

Thanks -- yeah, sorry I haven't had the time to continue working on this. I also want to improve the error handling in here, alongside the previously mentioned things (seccomp rules, making the thread behavior more consistent with the rest of the VMM threads, etc.).

This patch modifies `event_monitor` to ensure that concurrent access to `event_log` from multiple threads is safe. Previously, the `event_log` function would acquire a reference to the event log file and write to it without doing any synchronization, which made it prone to data races. This issue likely went under the radar because the relevant `SAFETY` comment on the unsafe block was incomplete. The new implementation spawns a dedicated thread named `event-monitor` solely for writing to the file. It uses the MPMC channel exposed by `flume` to pass messages to the `event-monitor` thread. Since `flume::Sender<T>` implements `Sync`, it is safe for multiple threads to share it and send messages to the `event-monitor` thread. This is not possible with `std::sync::mpsc::Sender<T>` since it's `!Sync`, meaning it is not safe for it to be shared between different threads. The `event_monitor::set_monitor` function now only initializes the required global state and returns an instance of the `Monitor` struct. This decouples the actual logging logic from the `event_monitor` crate. The `event-monitor` thread is then spawned by the `vmm` crate. Signed-off-by: Omer Faruk Bayram <omer.faruk@sartura.hr>

brkp · 2023-08-06T20:53:03Z

I've modified the event_monitor::set_monitor function so that now it only initializes the necessary global state and hands out a Monitor struct instance.

This change separates the logging logic from event_monitor, giving the caller more flexibility. vmm then uses this Monitor struct to spawn a thread called event-monitor and implements its own logging logic, which currently only writes to a file.

@rbradford @likebreath

likebreath

Changes look good to me. Just one thing about the seccomp filter. I think it is better to include brk and mmap to avoid potential violations for allocating memory on the event_monitor thread.

new changes

brkp · 2023-08-07T05:23:00Z

Thanks for catching that! @likebreath Added brk and mmap to the list of allowed calls as well.

vmm/src/seccomp_filters.rs

Signed-off-by: Omer Faruk Bayram <omer.faruk@sartura.hr>

peng6662001 · 2023-09-21T08:37:26Z

@brkp Could you please share a way to verify this patch on aarch64?

brkp · 2023-09-21T17:53:13Z

@peng6662001 Would you mind providing a bit more context? I'm not quite sure what you mean by "verify".

peng6662001 · 2023-09-22T02:13:19Z

@brkp Have you ever encountered a "concurrent access" bug?How to reproduce it?
I use --event-monitor path=/tmp/event.json to start the vm and the file /tmp/event.json looks fine with old version of CloudHypervisor.

brkp · 2023-09-29T22:38:33Z

@peng6662001 Hey, sorry for the late reply.

The previous implementation of event-monitor allows users to call the event_log function, which writes to a log file, from multiple threads without doing any synchronization.

This becomes problematic when two threads race to write to the log file at the same time. While this may not have been a significant issue in the past due to the scarce use of event-monitor throughout the code base, it is still an incorrect implementation going forward.

Since the 'write()' to the event file was moved to its own thread (see cloud-hypervisor#5633), we have no reliable way to read the latest contents of the event file from our integration tests, since we can't ensure the 'read()' from our test always happen after 'write()' is completed from Cloud Hypervisor. This is also why we started to see random failures on snapshot_restore tests (particularly when the system workload is high). This patch adds a 1s sleep before reading the event file to mitigate the random failures. Signed-off-by: Bo Chen <chen.bo@intel.com>

Since the 'write()' to the event file was moved to its own thread (see #5633), we have no reliable way to read the latest contents of the event file from our integration tests, since we can't ensure the 'read()' from our test always happen after 'write()' is completed from Cloud Hypervisor. This is also why we started to see random failures on snapshot_restore tests (particularly when the system workload is high). This patch adds a 1s sleep before reading the event file to mitigate the random failures. Signed-off-by: Bo Chen <chen.bo@intel.com>

brkp requested a review from a team as a code owner July 29, 2023 23:02

brkp mentioned this pull request Jul 29, 2023

D-Bus API and event-monitor integration #5517

Merged

brkp force-pushed the event-monitor-thread-safety branch from 22be4e3 to 992aae7 Compare July 29, 2023 23:06

likebreath reviewed Jul 30, 2023

View reviewed changes

rbradford previously approved these changes Jul 31, 2023

View reviewed changes

event_monitor/src/lib.rs Outdated Show resolved Hide resolved

likebreath previously approved these changes Jul 31, 2023

View reviewed changes

rbradford marked this pull request as draft August 3, 2023 14:59

brkp force-pushed the event-monitor-thread-safety branch from 992aae7 to fea89bd Compare August 6, 2023 19:27

brkp marked this pull request as ready for review August 6, 2023 20:53

likebreath reviewed Aug 7, 2023

View reviewed changes

brkp force-pushed the event-monitor-thread-safety branch from fea89bd to 9c647cd Compare August 7, 2023 05:21

likebreath reviewed Aug 8, 2023

View reviewed changes

vmm/src/seccomp_filters.rs Outdated Show resolved Hide resolved

vmm: seccomp: implement seccomp filtering for the event-monitor thread

8d104b2

Signed-off-by: Omer Faruk Bayram <omer.faruk@sartura.hr>

brkp force-pushed the event-monitor-thread-safety branch from 9c647cd to 8d104b2 Compare August 8, 2023 05:46

likebreath approved these changes Aug 9, 2023

View reviewed changes

rbradford approved these changes Aug 9, 2023

View reviewed changes

rbradford merged commit a0c8bf4 into cloud-hypervisor:main Aug 9, 2023
22 checks passed

brkp deleted the event-monitor-thread-safety branch August 9, 2023 18:01

likebreath mentioned this pull request Nov 9, 2023

tests: Stabilize snapshot_restore tests #5938

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

event_monitor: refactor the implementation to support concurrent access #5633

event_monitor: refactor the implementation to support concurrent access #5633

brkp commented Jul 29, 2023

likebreath left a comment

rbradford commented Jul 31, 2023

likebreath commented Jul 31, 2023

brkp commented Aug 1, 2023

rbradford commented Aug 1, 2023

rbradford commented Aug 1, 2023

rbradford commented Aug 3, 2023

brkp commented Aug 3, 2023

brkp commented Aug 6, 2023

likebreath left a comment

brkp commented Aug 7, 2023

peng6662001 commented Sep 21, 2023 •

edited

brkp commented Sep 21, 2023

peng6662001 commented Sep 22, 2023

brkp commented Sep 29, 2023 •

edited

event_monitor: refactor the implementation to support concurrent access #5633

event_monitor: refactor the implementation to support concurrent access #5633

Conversation

brkp commented Jul 29, 2023

likebreath left a comment

Choose a reason for hiding this comment

rbradford commented Jul 31, 2023

likebreath commented Jul 31, 2023

brkp commented Aug 1, 2023

rbradford commented Aug 1, 2023

rbradford commented Aug 1, 2023

rbradford commented Aug 3, 2023

brkp commented Aug 3, 2023

brkp commented Aug 6, 2023

likebreath left a comment

Choose a reason for hiding this comment

brkp commented Aug 7, 2023

peng6662001 commented Sep 21, 2023 • edited

brkp commented Sep 21, 2023

peng6662001 commented Sep 22, 2023

brkp commented Sep 29, 2023 • edited

peng6662001 commented Sep 21, 2023 •

edited

brkp commented Sep 29, 2023 •

edited