add stream callback #373

BenjaminW3 · 2017-08-11T08:21:06Z

possible solution for #368

psychocoderHPC · 2017-08-11T09:11:43Z

note: I am two weeks not available and will check all PRs after my holidays.

psychocoderHPC · 2017-08-28T07:23:57Z

Could you please add a short description which concepts are new in this pull request and maybe a code snipped (or pointing to a code snipped) how to use the call backs.

How does the call back works (in which context is the callback executed). Is there a new thread opened and executes the call back code?

BenjaminW3 · 2017-08-28T14:07:48Z

There are no new concepts. The alpaka::stream::traits::Enqueue trait and its accessor method alpaka::stream::enqueue are used as is and specialized for callbacks as parameter.
Callback usage looks like this:

alpaka::stream::enqueue(
    stream,
    [&](){
        do_what_you_want_in_here();
    }
);

A usage example can also be found in the new unit test.

Callbacks are always executed within an independent thread. The CPU streams always supported this (unintentionally ;-) ) because it uses a thread pool to execute the tasks. Only for CUDA streams there is new code. CUDA itself calls the callback from within its own thread which does not allow to call CUDA runtime API functions from within the callback. This limitation is lifted by starting a new thread per callback for async CUDA streams and using the waiting thread itself for sync CUDA streams.

BenjaminW3 · 2017-08-30T04:53:48Z

ready for review ;-)

psychocoderHPC

I have a performance question

psychocoderHPC · 2017-08-30T15:32:49Z

include/alpaka/stream/StreamCudaRtAsync.hpp

+                        pCallbackSynchronizationData.get(),
+                        0u));
+
+                    std::thread t(


To create a thread per callback can be very expensive in time. e.g. PIConGPU is spawning over 2000 kernel/memcpy per second and will have over 100 tasks waiting in streams. This means we need to spawn 2k threads/s and have over 100 active threads.

Is it possible to use on thread for all callbacks (waiting in the background), add callbacks to a list and than execute the callback always from the callback thread?

We can move this also to later pull request if it is currently not easy to do.

Currently, I would like to merge this as is. I would create a follow-up issue for optimizing this.
The most flexible solution would be a thread pool with a queue of ready callbacks. If there is only one thread, the latency would be highest but it would equal your single thread solution, if there are multiple threads, the latency/resource tradeoff can be adapted per use case.

Can we merge this and create the follow up ticket? My upcoming event unit tests are based on some of those stream test helpers.

sry I was busy, yes I will merge it

BenjaminW3 added Type:Enhancement State:Work In Progress labels Aug 11, 2017

BenjaminW3 added this to the Future milestone Aug 11, 2017

BenjaminW3 force-pushed the topic-stream-callback branch from fe54008 to 8a6d892 Compare August 11, 2017 08:53

BenjaminW3 force-pushed the topic-stream-callback branch 2 times, most recently from 685e468 to 22a84a9 Compare August 11, 2017 12:09

BenjaminW3 requested a review from psychocoderHPC August 11, 2017 12:52

BenjaminW3 assigned psychocoderHPC Aug 11, 2017

BenjaminW3 removed the State:Work In Progress label Aug 11, 2017

BenjaminW3 modified the milestones: Version 0.3.0, Future Aug 11, 2017

BenjaminW3 force-pushed the topic-stream-callback branch 3 times, most recently from 08a7a3b to c9006ab Compare August 11, 2017 15:41

add stream callback

f1cfe26

BenjaminW3 force-pushed the topic-stream-callback branch from c9006ab to f1cfe26 Compare August 11, 2017 18:11

BenjaminW3 added 2 commits August 28, 2017 16:44

update doc [ci skip]

05663c7

fixup doc [ci skip]

775c8b3

psychocoderHPC requested changes Aug 30, 2017

View reviewed changes

psychocoderHPC approved these changes Aug 31, 2017

View reviewed changes

psychocoderHPC merged commit 549733d into alpaka-group:develop Aug 31, 2017

BenjaminW3 deleted the topic-stream-callback branch September 4, 2017 17:02

psychocoderHPC mentioned this pull request Sep 18, 2018

Fix empty(QueueCpuAsync) returning true even though the last task is still executing #627

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add stream callback #373

add stream callback #373

BenjaminW3 commented Aug 11, 2017 •

edited

Loading

psychocoderHPC commented Aug 11, 2017

psychocoderHPC commented Aug 28, 2017 •

edited

Loading

BenjaminW3 commented Aug 28, 2017 •

edited

Loading

BenjaminW3 commented Aug 30, 2017

psychocoderHPC left a comment

psychocoderHPC Aug 30, 2017

BenjaminW3 Aug 30, 2017

BenjaminW3 Aug 31, 2017

psychocoderHPC Aug 31, 2017

add stream callback #373

add stream callback #373

Conversation

BenjaminW3 commented Aug 11, 2017 • edited Loading

psychocoderHPC commented Aug 11, 2017

psychocoderHPC commented Aug 28, 2017 • edited Loading

BenjaminW3 commented Aug 28, 2017 • edited Loading

BenjaminW3 commented Aug 30, 2017

psychocoderHPC left a comment

Choose a reason for hiding this comment

psychocoderHPC Aug 30, 2017

Choose a reason for hiding this comment

BenjaminW3 Aug 30, 2017

Choose a reason for hiding this comment

BenjaminW3 Aug 31, 2017

Choose a reason for hiding this comment

psychocoderHPC Aug 31, 2017

Choose a reason for hiding this comment

BenjaminW3 commented Aug 11, 2017 •

edited

Loading

psychocoderHPC commented Aug 28, 2017 •

edited

Loading

BenjaminW3 commented Aug 28, 2017 •

edited

Loading