Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JOSS Review: State of the field #217

Open
Midnighter opened this issue Aug 7, 2022 · 0 comments
Open

JOSS Review: State of the field #217

Midnighter opened this issue Aug 7, 2022 · 0 comments

Comments

@Midnighter
Copy link

Midnighter commented Aug 7, 2022

This is discussed on p. 1, l. 20-25 and seems too brief to me. It is also rather unclear what the statement

few handle the heterogeneity which is prevalent in many experimental environments

means specifically. The referenced frameworks are very general purpose and I'm quite sure that they can handle almost anything if programmed that way. Just as specific pipelines need to be developed for shed-streaming.

There are also more Python-specific frameworks that are in wide use:

not all of them are specialized for streaming data but the key differentiator of shed-streaming is could be more clear.

The big cloud providers also provide proprietary solutions for streaming data.

It seems to me that the main benefit of shed-streaming is rather that it tightly integrates with an existing ecosystem maintained by NSLS-II. Overall, it appears to provide rather high-level, shallow interfaces and adapters to, for example, rapidz, bluesky, and automatic use of databroker for data provenance. I think both the documentation and the manuscript would greatly benefit from a figure similar to https://nsls-ii.github.io/_images/collection-overview.svg that shows how exactly shed-streaming fits into this ecosystem.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant