Enable PaRSEC profiling system #227

therault · 2022-03-25T04:23:27Z

In order to have a working profiling system, the parsec_task_class
of each new TT needs to allocate 2 integers in an array that belongs
to the parsec_taskpool_t. Then, the profiling assumes that the
task class can compute an identifier that helps matching the end
of a task with the beginning, when those do not happen on the same
thread (e.g. for GPU or asynchronous execution). A few more fields
are used by the profiling system that need to be set.

therault · 2022-03-25T04:28:50Z

@devreal I added a register_new_tt, but I'm sure we should use the existing infrastructure that keep tracks of the TT lifecycle (world_impl.register_op()). But I didn't want to mess around that without talking with you, so I did it this way for review.

@evaleev @robertjharrison What is our TT creation model? Do we want to authorize multithreaded TT creation? That needs to be protected (and that's not the only thing that will be fragile if multiple threads can create TTs, I believe).

evaleev · 2022-03-25T11:22:31Z

In MAD backend TT creation must be done from a single thread, I believe.

robertjharrison · 2022-03-25T11:32:54Z

I think the TT base classes common to both backends also have points where this assumption is made. I don't see a need for a single TTG to be composed in parallel, or even for one process to be composing different TTGs in separate threads. We ideally need to permit overlapping execution of multiple TTGs on the same world, but the current ttg fence model would seem to fence all executing TTGs rather than individual ones. But that is addressed by using multiple worlds.

…

On Fri, Mar 25, 2022 at 7:22 AM Eduard Valeyev ***@***.***> wrote: In MAD backend TT creation must be done from a single thread, I believe. — Reply to this email directly, view it on GitHub <#227 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABZSAPPWVOYASSC45WGY4ATVBWOYHANCNFSM5RTBJCSA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

-- Robert J. Harrison tel: 865-274-8544

devreal · 2022-03-25T14:14:43Z

ttg/ttg/parsec/ttg.h

@@ -273,6 +282,21 @@ namespace ttg_parsec {
 #endif  // TTG_USE_USER_TERMDET
    }

+    template <typename keyT, typename output_terminalsT, typename derivedT, typename input_valueTs = ttg::typelist<>>
+    void register_new_tt(const TT<keyT, output_terminalsT, derivedT, input_valueTs> *t) {


I think it'd be sufficient to take TTBase* here since you only need the name.

devreal · 2022-03-25T17:58:22Z

ttg/ttg/parsec/ttg.h

@@ -2206,6 +2240,10 @@ namespace ttg_parsec {
      }
    }

+    static parsec_key_t make_key(const parsec_taskpool_t *tp, const parsec_assignment_t *as) {


I'm trying to understand what make_key does... parsec_assignment_t is a struct with one integer member. How is it safe to cast it to a uintptr_t below?

self.locals is an array of assignment_t with more than 4 of them, so there is room in self.locals to store the pointer-size int. Not clean, but I'm abusing the only space I have to store stack information.

devreal · 2022-03-25T17:59:44Z

ttg/ttg/parsec/ttg.h

@@ -161,7 +164,12 @@ namespace ttg_parsec {

   public:
    static constexpr const int PARSEC_TTG_MAX_AM_SIZE = 1024 * 1024;
-    WorldImpl(int *argc, char **argv[], int ncores) : WorldImplBase(query_comm_size(), query_comm_rank()) {
+    WorldImpl(int *argc, char **argv[], int ncores) : WorldImplBase(query_comm_size(), query_comm_rank())
+#if defined(PARSEC_PROF_TRACE) 


I would make that initialization independent of PARSEC_PROF_TRACE but make it an assignment here https://github.com/TESSEorg/ttg/pull/227/files#diff-b16b2b248d6db2366353f63da94be820a8905343d7452acc0639ffcaf3100068R330

therault · 2022-04-25T15:55:59Z

Stale code superseded by PR #222

Enable PaRSEC profiling system

394f3df

therault marked this pull request as draft March 25, 2022 04:23

Fix prototype of make_key

515af26

devreal requested changes Mar 25, 2022

View reviewed changes

therault closed this Apr 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable PaRSEC profiling system #227

Enable PaRSEC profiling system #227

therault commented Mar 25, 2022

therault commented Mar 25, 2022

evaleev commented Mar 25, 2022

robertjharrison commented Mar 25, 2022 via email

devreal Mar 25, 2022

devreal Mar 25, 2022

therault Mar 25, 2022

devreal Mar 25, 2022

therault commented Apr 25, 2022

Enable PaRSEC profiling system #227

Enable PaRSEC profiling system #227

Conversation

therault commented Mar 25, 2022

therault commented Mar 25, 2022

evaleev commented Mar 25, 2022

robertjharrison commented Mar 25, 2022 via email

devreal Mar 25, 2022

Choose a reason for hiding this comment

devreal Mar 25, 2022

Choose a reason for hiding this comment

therault Mar 25, 2022

Choose a reason for hiding this comment

devreal Mar 25, 2022

Choose a reason for hiding this comment

therault commented Apr 25, 2022