sync objects on stack cause both false positives and false negatives #494

ramosian-glider · 2015-09-01T11:19:51Z

Originally reported on Google Code with ID 87

Tsan does not reset state of sync object on stack when they go out of scope. A brand
new sync object created at the same address in new frame inherits all state from the
previous one.

This can cause false negatives for race detector, because tsan sees fake HB edges between
the old and the new sync objects.

This can also cause false positives for deadlock detector, because tsan effectively
merges two unrelated mutexes and all edges between them.

Several aspects to consider:
- the same actually applies to heap, if the program recreates a new sync object on
the same place by explicitly calling dtor/ctor
- we can't remove sync objects from meta map if there can be concurrent accesses, because
linked lists in the meta map are append-only
- sync objects can be recreated on stack/globals in the same way as for heap
- we can't reset state of sync objects in ctor (e.g. pthread_mutex_ctor) in all cases
because atomics do not have ctors and because of global LINKER_INITIALIZED mutexes
- we can't reset remove sync objects from meta map in dtors even for stack, because
if it is an explicit call to dtor, there can be concurrent accesses to the same 8-bytes
from a different thread and the meta map is append-only

There are several solutions of varying complexity depending on what from the above
we want to fix.
We can remember SP in func_entry and reset sync objects in func_exit. Care must be
taken to not slow down execution in common case.
We can reset some sync objects in ctors, e.g. pthread_mutex_init should be safe.
We can reset some sync objects in dtors (but not remove them from meta map), e.g. pthread_mutex_destroy
should be safe.

Reported by dvyukov@google.com on 2014-12-17 09:51:48

The text was updated successfully, but these errors were encountered:

ramosian-glider · 2015-09-01T11:19:59Z

Will it help to move the local vars to a fake stack?

Does the problem apply to the case when several mutexes created in local scopes share
a single stack slot?

Reported by glider@google.com on 2014-12-17 09:58:28

ramosian-glider · 2015-09-01T11:20:08Z

> Will it help to move the local vars to a fake stack?

I don't see how it changes anything.

> Does the problem apply to the case when several mutexes created in local scopes share
a single stack slot?

Yes, and it is even more difficult to handle. However, better handling of thread_mutex_init/destroy
will help to some degree.

Reported by dvyukov@google.com on 2014-12-17 10:00:46

ramosian-glider · 2015-09-01T11:20:17Z

Submitted a test case for one of the possible reincarnation of the issue:
http://llvm.org/viewvc/llvm-project?view=revision&revision=224422
Other possible reincarnations are: reuse of globals/heap, explicit dtor/ctor, reuse
of stack space in the same stack frame.

Reported by dvyukov@google.com on 2014-12-17 10:21:17

dvyukov · 2016-06-01T05:43:49Z

Another manifestation of the issue: if a stack-allocated object if destroyed locked, then another mutex is re-created on the same address, tsan can report "read lock of a write-locked mutex or similar" false reports.

morehouse · 2018-06-06T22:56:59Z

@dvyukov: Any progress since last update?

dvyukov · 2018-06-07T05:35:08Z

No progress. Still relevant.

ramosian-glider assigned google Sep 1, 2015

ramosian-glider added Type-Defect Priority-Medium Status-Accepted ProjectThreadSanitizer labels Sep 1, 2015

dvyukov mentioned this issue Dec 1, 2015

Cleanup sync objects in mmaps and stacks #477

Closed

morehouse unassigned google Jun 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync objects on stack cause both false positives and false negatives #494

sync objects on stack cause both false positives and false negatives #494

ramosian-glider commented Sep 1, 2015

ramosian-glider commented Sep 1, 2015

ramosian-glider commented Sep 1, 2015

ramosian-glider commented Sep 1, 2015

dvyukov commented Jun 1, 2016

morehouse commented Jun 6, 2018

dvyukov commented Jun 7, 2018

sync objects on stack cause both false positives and false negatives #494

sync objects on stack cause both false positives and false negatives #494

Comments

ramosian-glider commented Sep 1, 2015

ramosian-glider commented Sep 1, 2015

ramosian-glider commented Sep 1, 2015

ramosian-glider commented Sep 1, 2015

dvyukov commented Jun 1, 2016

morehouse commented Jun 6, 2018

dvyukov commented Jun 7, 2018