Guard max_latency_ with latencies_mutex_ #878

tjablin · 2021-03-16T00:57:48Z

There was a race in the computation of atomic max and std::memory_order_release
is not a valid order for atomic loads. The code isn't performance critical, so
it is easier and safer to use mutexes instead.

There was a race in the computation of atomic max and std::memory_order_release is not a valid order for atomic loads. The code isn't performance critical, so it is easier and safer to use mutexes instead.

github-actions · 2021-03-16T00:58:03Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

nvpohanh · 2021-03-16T01:00:42Z

loadgen/logging.h

@@ -372,7 +372,7 @@ class AsyncLog {
  std::condition_variable all_latencies_recorded_;
  uint64_t latencies_first_sample_sequence_id_ = 0;
  std::vector<QuerySampleLatency> latencies_;
-  std::atomic<QuerySampleLatency> max_latency_{0};
+  QuerySampleLatency max_latency_;


Should we initialize this to 0?

nvpohanh · 2021-03-16T01:04:08Z

loadgen/logging.cc

@@ -421,7 +417,8 @@ PerfClock::time_point AsyncLog::GetMaxCompletionTime() {
 }

 QuerySampleLatency AsyncLog::GetMaxLatencySoFar() {
-  return max_latency_.load(std::memory_order_release);
+  std::unique_lock<std::mutex> lock(latencies_mutex_);


Just curious: why does this need to hold mutex?

If GetMaxLatencySoFar were called in one thread and RecordSampleCompletion were called concurrently in another thread, there would be a race. Looking at the surrounding code, I don't think that is possible, but if a variable is guarded by a mutex in one context, I think it ought to have an equivalent guard in all contexts to be safe. This function isn't performance critical, so I'm not worried about grabbing a lock here, particularly since I am pretty certain there will be no contention.

makes sense

Guard max_latency_ with latencies_mutex_

5dd0aea

There was a race in the computation of atomic max and std::memory_order_release is not a valid order for atomic loads. The code isn't performance critical, so it is easier and safer to use mutexes instead.

tjablin requested a review from guschmue March 16, 2021 00:58

tjablin mentioned this pull request Mar 16, 2021

Assertion '__b != memory_order_release' failed #864

Closed

nvpohanh reviewed Mar 16, 2021

View reviewed changes

Initialize max_latency_

c2f3f59

nvpohanh approved these changes Mar 16, 2021

View reviewed changes

guschmue approved these changes Mar 16, 2021

View reviewed changes

Merge branch 'master' into master

6e2c288

guschmue merged commit 63c7df5 into mlcommons:master Mar 16, 2021

github-actions bot locked and limited conversation to collaborators Mar 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guard max_latency_ with latencies_mutex_ #878

Guard max_latency_ with latencies_mutex_ #878

tjablin commented Mar 16, 2021

github-actions bot commented Mar 16, 2021 •

edited

nvpohanh Mar 16, 2021

tjablin Mar 16, 2021

nvpohanh Mar 16, 2021

tjablin Mar 16, 2021

nvpohanh Mar 16, 2021

Guard max_latency_ with latencies_mutex_ #878

Guard max_latency_ with latencies_mutex_ #878

Conversation

tjablin commented Mar 16, 2021

github-actions bot commented Mar 16, 2021 • edited

nvpohanh Mar 16, 2021

Choose a reason for hiding this comment

tjablin Mar 16, 2021

Choose a reason for hiding this comment

nvpohanh Mar 16, 2021

Choose a reason for hiding this comment

tjablin Mar 16, 2021

Choose a reason for hiding this comment

nvpohanh Mar 16, 2021

Choose a reason for hiding this comment

github-actions bot commented Mar 16, 2021 •

edited