[core, ios, qt, android] Close race condition in RunLoop (issue #9620) #10537

ChrisLoer · 2017-11-22T21:26:26Z

Because a message we queue from the foreground may cause the background to complete, exit, and tear down the AsyncTask, we have to block queue processing until we've finished our call to AsyncTask::send().

Broadening the scope of a mutex is scary, but I audited the code of our four implementations of AsyncTask and I don't see any way this could cause a deadlock.

If we wanted to keep the scope of the mutex as limited as possible, we could move the mutex scope-broadening to RunLoop::stop(), with a custom implementation that bypasses invoke/push, but I'm not sure it would be worth the extra complexity. I suppose the scenario to worry about is that since we're holding the mutex while we call async->send we could cause the background to wake up and immediately be blocked on the mutex... but it doesn't seem like it would be that common or that costly when it did happen?

/cc @tmpsantos @jfirebaugh @kkaefer @ivovandongen

ChrisLoer · 2017-11-22T21:32:54Z

I audited the code of our four implementations of AsyncTask and I don't see any way this could cause a deadlock.

😅 My in-depth audit missed that the Android RunLoop doesn't use AsyncTask... The Android 'wake' looks like write(fds[PIPE_IN], "\n", 1)... which is... I dunno, is it OK to hold a mutex while writing to a file descriptor?

kkaefer · 2017-11-23T10:42:18Z

platform/android/src/run_loop.cpp

+    withMutex([&] {
+        queue.push(std::move(task));
+        impl->wake();
+    });


Now that we lock the entire function, we can also remove the withMutex function altogether and just move the std::lock_guard into every function.

👍 on @kkaefer point here

withMutex is also used in process, and I think there's a readability win in using the same syntax to mark all locked sections in the code. If we were to stop using withMutex in push, I think it would also make sense to change the call in process to something like:

... { std::lock_guard<std::mutex> lock(mutex); queue_.swap(queue); } ...

Between those two options... 🤷‍♂️ ? Doesn't make a big difference to me, but at some point we thought withMutex was a clearer way to mark the locked section of code.

tmpsantos · 2017-11-23T16:34:24Z

is it OK to hold a mutex while writing to a file descriptor?

Should be fine.

Because a message we queue from the foreground may cause the background to complete, exit, and tear down the AsyncTask, we have to block queue processing until we've finished our call to AsyncTask::send(). Broadening the scope of a mutex is scary, but I audited the code of our four implementations of AsyncTask and I don't see any way this could cause a deadlock.

tmpsantos · 2017-11-28T16:00:29Z

@ChrisLoer thanks! This apparently fixed a hang I could only reproduce on Qt + Windows.

ChrisLoer requested review from jfirebaugh and tmpsantos November 22, 2017 21:26

ChrisLoer force-pushed the cloer_9620 branch from b9e7ce1 to 6c0182d Compare November 22, 2017 21:29

jfirebaugh mentioned this pull request Nov 22, 2017

ThreadSanitizer error on API.RepeatedRender test #9620

Closed

kkaefer reviewed Nov 23, 2017

View reviewed changes

kkaefer added the Core The cross-platform C++ core, aka mbgl label Nov 24, 2017

ChrisLoer force-pushed the cloer_9620 branch from 6c0182d to a492834 Compare November 27, 2017 22:19

ChrisLoer force-pushed the cloer_9620 branch from a492834 to 30e7cbe Compare November 27, 2017 22:20

kkaefer approved these changes Nov 27, 2017

View reviewed changes

ChrisLoer merged commit 5da5ba7 into master Nov 27, 2017

tmpsantos deleted the cloer_9620 branch November 28, 2017 15:59

This was referenced Dec 5, 2017

Cherry-pick to release agua #10633

Merged

Release Android v5.2.1 #10646

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core, ios, qt, android] Close race condition in RunLoop (issue #9620) #10537

[core, ios, qt, android] Close race condition in RunLoop (issue #9620) #10537

ChrisLoer commented Nov 22, 2017

ChrisLoer commented Nov 22, 2017

kkaefer Nov 23, 2017

tmpsantos Nov 23, 2017 •

edited

Loading

ChrisLoer Nov 24, 2017

tmpsantos commented Nov 23, 2017 •

edited

Loading

tmpsantos commented Nov 28, 2017

[core, ios, qt, android] Close race condition in RunLoop (issue #9620) #10537

[core, ios, qt, android] Close race condition in RunLoop (issue #9620) #10537

Conversation

ChrisLoer commented Nov 22, 2017

ChrisLoer commented Nov 22, 2017

kkaefer Nov 23, 2017

Choose a reason for hiding this comment

tmpsantos Nov 23, 2017 • edited Loading

Choose a reason for hiding this comment

ChrisLoer Nov 24, 2017

Choose a reason for hiding this comment

tmpsantos commented Nov 23, 2017 • edited Loading

tmpsantos commented Nov 28, 2017

tmpsantos Nov 23, 2017 •

edited

Loading

tmpsantos commented Nov 23, 2017 •

edited

Loading