[DPL Analysis] Simple workflow suffix solution by saganatt · Pull Request #5558 · AliceO2Group/AliceO2

saganatt · 2021-02-25T14:44:50Z

turns out the workflow suffix needs to be appended early enough. The task names and hashes based on them are propagated to objects already inside the adaptAnalysisTask(). Any following task name changes prevent proper matching of output objects to tasks in sink.
I tried to do some workaround on the Framework side but we don't have access to the objects after tasks creation and before the sink. We could e.g. record task names in object headers, then in the sink function append the suffix and re-generate the hashes but this actually defies the purpose of having task hashes...

jgrosseo · 2021-02-25T15:01:43Z

Thanks! This means at present every task would need to add the suffix by hand?
(I changed this PR to draft as it is just an illustration.)

saganatt · 2021-02-25T15:19:40Z

Yes :/

ktf · 2021-03-01T14:45:57Z

Analysis/Tutorials/src/histogramRegistry.cxx

Why not simply passing the ctx to the adaptAnalysisTask?

One possible solution we could have is to make the adaptAnalysisTask() function be a member of (an object returned by) the ConfigContext. Something like:

ctx.taskMaker().makeAnalysisTask<ETask>();

or alternatively have an helper method for the workflow, rather than for the simple task:

return adaptAnalysisWorkflow(ctx, TaskSpec<ETask>{"output-obj-test"}, TaskSpec<ATask>{"eta-and-phi-histograms"}});

What do you think?

Another solution would be that defineDataProcessing does not return a std::vector<DataProcessorSpec>, but a ConfigureDataProcessorAction which then can be used to do the lazy configuration of the DataProcessor. I think we can even guarantee backward compatibility by having a constructor for ConfigureDataProcessorAction(DataProcessorSpec).

Hi,
true, we can just pass the ctx and append the suffix inside the adaptAnalysisTask().

ConfigureDataProcessorAction - hm, still we need to pass the ctx somewhere, so it'll still mean a change in analysis workflows. I don't see how it could be much different from just passing the ctx to adaptAnalysisTask().

adaptAnalysisWorkflow looks also good to me since ctx would be used just once. Though it is a bigger change on the user side ;)

Regarding ConfigureDataProcessorAction: my idea would be that the context is passed outside the user code.

Basically we would go from std::vector<DataProcessorSpec> runDataProcessing(ConfigContext...) to std::vector<ConfigureDataProcessorAction> runDataProcessing(ConfigContext...), where ConfigureDataProcessorAction would be something like:

enum struct DataProcessorAction : char { CopyFullFromProtoToTarget, // The default action, copy the whole DataProcessorSpec to the one labelled as target AppendInputsFromProtoToTarget, // Take the inputs in proto and replace them in the one labelled. AppendOutputsFromProtoToTarget }; struct ConfigureDataProcessorAction { std::string target; DataProcessorSpec *proto; DataProcessorAction action; };

It seems to me it'd be then the same solution as your primary one, with appending the suffix after the tasks creation in overrideSuffix().

We have direct access to output object only inside adaptAnalysisTask(). So, later it is not possible to change the task hashes inside objects and copying the spec doesn't help.

Maybe I am wrong and changing something in DataProcessorSpec will be enough to match the hashes properly in sinks...? Though I haven't suceeded with it previously. In that case, wouldn't it be simplier to stay with overrideSuffix()?

saganatt · 2021-03-02T10:50:07Z

Ciao @ktf , what do you think about the current solution?

ktf · 2021-03-03T10:52:18Z

So we discussed this a bit with @jgrosseo. In the end I think the easiest is to pass the context to the adaptAnalysisTask, since there are places were tasks are added programmatically and adapting to adaptAnalysisWorkflow would require extra manipulations.

The more complex solution I was trying to propose is to use the adaptAnalysisTask to capture outputs and manipulate them afterwards, but after having a look at the current adaptAnalysisTask that is indeed quite of a development so I would just go for the modified adaptAnalysisTask, at least for now.

saganatt · 2021-03-03T13:16:32Z

Ok, I adjusted the code.

jgrosseo · 2021-03-03T13:30:18Z

Looks very good to me. I guess now we have to adjust all existing tasks, right? @saganatt do you feel like grepping for them..? ;-)

saganatt · 2021-03-03T13:30:56Z

ok, I will do ;-)

ktf · 2021-03-03T16:08:01Z

Analysis/Tutorials/src/histograms.cxx

-    adaptAnalysisTask<BTask>("etaphi-histogram"),
-    adaptAnalysisTask<CTask>("pt-histogram"),
-    adaptAnalysisTask<DTask>("output-wrapper"),
+    adaptAnalysisTask<ATask>("eta-and-phi-histograms", ctx),


Make it first argument please.

jgrosseo · 2021-03-03T18:27:58Z

Thanks a lot for this tedious work!

jgrosseo · 2021-03-04T07:36:53Z

@ktf the errors seem only in the tests and unrelated. Is this due to the not running tests earlier? Can we merge?

saganatt requested review from a team, iarsene and jgrosseo as code owners February 25, 2021 14:44

jgrosseo marked this pull request as draft February 25, 2021 15:00

ktf reviewed Mar 1, 2021

View reviewed changes

saganatt force-pushed the object-suffix branch from 4f51a60 to c4a0ce8 Compare March 1, 2021 19:34

saganatt added 3 commits March 3, 2021 14:13

Simple suffix solution

8fe385b

Added analysis workflow creation helper

27ac5d7

Appending output inside adaptAnalysisTask

39bdb09

saganatt force-pushed the object-suffix branch from c4a0ce8 to 39bdb09 Compare March 3, 2021 13:15

ktf reviewed Mar 3, 2021

View reviewed changes

Added config context to all analysis tasks

8faa4fa

saganatt marked this pull request as ready for review March 3, 2021 16:25

saganatt requested a review from ginnocen as a code owner March 3, 2021 16:25

Fixes

f72b259

ktf self-requested a review March 3, 2021 17:01

ktf previously approved these changes Mar 3, 2021

View reviewed changes

More nitpick fixes

03f5faa

saganatt dismissed ktf’s stale review via 03f5faa March 3, 2021 17:16

Nitpick: Added missing config

9e8f053

jgrosseo approved these changes Mar 3, 2021

View reviewed changes

ktf merged commit 4e73f99 into AliceO2Group:dev Mar 4, 2021

EmilGorm pushed a commit to EmilGorm/AliceO2 that referenced this pull request Nov 22, 2021

[DPL Analysis] Simple workflow suffix solution (AliceO2Group#5558)

e51ba15

Conversation

saganatt commented Feb 25, 2021

Uh oh!

jgrosseo commented Feb 25, 2021

Uh oh!

saganatt commented Feb 25, 2021

Uh oh!

ktf Mar 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ktf Mar 1, 2021

Choose a reason for hiding this comment

Uh oh!

saganatt Mar 1, 2021

Choose a reason for hiding this comment

Uh oh!

ktf Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

saganatt Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

saganatt commented Mar 2, 2021

Uh oh!

ktf commented Mar 3, 2021

Uh oh!

saganatt commented Mar 3, 2021

Uh oh!

jgrosseo commented Mar 3, 2021

Uh oh!

saganatt commented Mar 3, 2021

Uh oh!

ktf Mar 3, 2021

Choose a reason for hiding this comment

Uh oh!

jgrosseo commented Mar 3, 2021

Uh oh!

jgrosseo commented Mar 4, 2021

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

ktf Mar 1, 2021 •

edited

Loading