Multiple outputs and restructured ParallelSourcePlugin #190

JelleAalbers · 2019-06-10T20:36:14Z

This allows strax plugins to have multiple outputs. If you want to return incompatible kinds of rows, e.g. both records and hits, or peaks and some metadata for each chunk, multi-output support is essential.

To return multiple outputs from a plugin:

The provides tuple should have multiple elements, listing the provided data type names
The dtype and data_kind attributes should be dictionaries mapping data type names to dtypes and data kind names, respectively. The values of these dicts can be specified in the same way as the entire attribute would be for a single-output plugin
The compute method must return a dictionary mapping data types to results (structured numpy arrays or field/array dictionaries).
For more info, see the updated docs, and tests/test_multi_output.py provides an example.

This also rewrites ParallelSourcePlugin. We should no longer subclass these plugins directy (as e.g. straxen's DAQReader does); instead, they are automatically created by strax when a parallel='process' plugin is followed by other plugins that can become inlined. Again, further description is in the docs. This should speed up processing from records onwards on machines with many cores, since records is then picked up by ParallelSourcePlugin just like raw_records usually is. See also the new docs and #150.

I hope... Previously the "kill" command to other mailboxes got skipped for non-MailboxKilled exceptions.

JelleAalbers added 12 commits June 9, 2019 10:55

Let's do this TDD thing

fa54bf6

Change provides to tuple

d843535

Multi-output plugin and saver initialization

84f96c7

Multi-output tests pass, remove alias support

01e7675

Misc cleanup, comments

d1d97ec

Autocreate ParallelSourcePlugin, other misc

7792394

Fix exception handling

be5bb94

I hope... Previously the "kill" command to other mailboxes got skipped for non-MailboxKilled exceptions.

Multi-output inside ParallelSourcePlugin

15d20e6

Update docs

7c83042

Fix exception handling

b2101ad

Add docs on new parallelization

620dd77

Fix show_config and unprivatize dtype_for

5153a83

JelleAalbers mentioned this pull request Jun 13, 2019

Upgrade pulse processing, cleanup XENONnT/straxen#16

Merged

JelleAalbers merged commit 8e4afe5 into AxFoundation:master Jun 15, 2019

JelleAalbers deleted the multi_output_2 branch September 16, 2019 09:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple outputs and restructured ParallelSourcePlugin #190

Multiple outputs and restructured ParallelSourcePlugin #190

JelleAalbers commented Jun 10, 2019

Multiple outputs and restructured ParallelSourcePlugin #190

Multiple outputs and restructured ParallelSourcePlugin #190

Conversation

JelleAalbers commented Jun 10, 2019