Fix lib-pthread issues #2410

wenyongh · 2023-07-31T09:19:22Z

Avoid destroying module instance repeatedly in pthread_exit_wrapper and
wasm_thread_cluster_exit.
Wait enough time in pthread_join_wrapper for target thread to exit and
destroy its resources.

Zzzabiyaka · 2023-07-31T13:40:12Z

Hi Wenyong,

recently we observed a situation when test was running 4 threads but thread manager reported up to 5-6 of them because after pthread_join it still was in thread_manager list for some short time (that's how I got it). So we added retries on the test side.

Is the second change fixing it?

wenyongh · 2023-08-01T01:39:38Z

Hi Wenyong,

recently we observed a situation when test was running 4 threads but thread manager reported up to 5-6 of them because after pthread_join it still was in thread_manager list for some short time (that's how I got it). So we added retries on the test side.

Is the second change fixing it?

Yes, in lib-pthread mode (not wasi-threads mode), I found that when thread A is exiting in pthread_join_wrapper, it may change its node's status to THREAD_EXIT before calling wasm_cluster_exit_thread to actually exit, and if thread B is joining it and detects the change of thread A node's status, thread B will just return successfully. But at this time thread A may hasn't actually exited, this may cause some unexpected behavior, for example, if thread B is main thread, it may think that all other thread have exited, and it may destroy all resources (module instance, module, runtime, etc.) and exit, but the resources may be still needed by thread A. I just let thread B wait some time after joining to fix it.

wenyongh · 2023-08-01T01:42:41Z

@Zzzabiyaka Maybe you can check whether this PR fixes your test issue and don't add the retries?

loganek · 2023-08-01T08:37:39Z

I don't think the issue mentioned by @Zzzabiyaka can actually be fixed. Both pthread_join and pthread_create are implemented in the userspace, and they have no information about the status of the underlying threads in a thread cluster. There's no guarantee that at the time when pthread_join() completes, the underlying thread actually finishes (it has a few more things to do in the host before it really ends). So I think the recommendation for users should be to:

either implement retries (just like we did in tests)
increase number of allowed threads and accept the fact that there will be some short periods of time where there will be more than expected threads running.

The thing is different for lib-pthread, because it's fully implemented in host, therefore we can confidently wait for the native thread to complete.

yamt · 2023-08-01T08:40:13Z

core/iwasm/libraries/lib-pthread/lib_pthread_wrapper.c

+           they are actually destroyed to avoid unexpected behavior. */
+        os_mutex_lock(&exec_env->wait_lock);
+        os_cond_reltimedwait(&exec_env->wait_cond, &exec_env->wait_lock, 1000);
+        os_mutex_unlock(&exec_env->wait_lock);


isn't this still racy?

The exec_env->wait_lock is used singly or together with exec_env->wait_cond, it won't be occupied by a thread for a long time: when a thread locks exec_env->wait_lock, it will unlock it after short time operations, or call os_cond_wait/reltimedwait, note that the lock will be unlocked firstly inside os_cond_wait/reltimedwait. My understanding is that here the current thread should be able to acquire the lock.

- Avoid destroying module instance repeatedly in pthread_exit_wrapper and wasm_thread_cluster_exit. - Wait enough time in pthread_join_wrapper for target thread to exit and destroy its resources.

Fix lib-pthread issues

8cbcdba

yamt reviewed Aug 1, 2023

View reviewed changes

wenyongh mentioned this pull request Aug 1, 2023

Propose a new (1.2.3) release #2378

Merged

fix typo

d45a03a

wenyongh merged commit cb6d850 into bytecodealliance:main Aug 1, 2023
368 checks passed

wenyongh deleted the fix_lib_pthread branch August 1, 2023 11:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix lib-pthread issues #2410

Fix lib-pthread issues #2410

wenyongh commented Jul 31, 2023

Zzzabiyaka commented Jul 31, 2023 •

edited

Loading

wenyongh commented Aug 1, 2023

wenyongh commented Aug 1, 2023

loganek commented Aug 1, 2023

yamt Aug 1, 2023

wenyongh Aug 1, 2023

Fix lib-pthread issues #2410

Fix lib-pthread issues #2410

Conversation

wenyongh commented Jul 31, 2023

Zzzabiyaka commented Jul 31, 2023 • edited Loading

wenyongh commented Aug 1, 2023

wenyongh commented Aug 1, 2023

loganek commented Aug 1, 2023

yamt Aug 1, 2023

Choose a reason for hiding this comment

wenyongh Aug 1, 2023

Choose a reason for hiding this comment

Zzzabiyaka commented Jul 31, 2023 •

edited

Loading