-
Notifications
You must be signed in to change notification settings - Fork 127
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat!: switch over to dask-based processing idioms, improve dataset handling #882
Conversation
@valsdav Here's the PR with the fileset pre-processor. The query tool you're working on with rucio / servicex backend should produce json that's compatible with this spec (and we can go so far as to define a schema if you like). Essentially - nests the spec mentioned here in uproot. So that you've got an object that looks like
The preprocessing step, as it is, strips out I'm going to add some modifiers on top of this to recover capabilities like |
Hi @lgray thanks! Starting to work on the query side :) |
for more information, see https://pre-commit.ci
…nto local_executors_to_dask
for more information, see https://pre-commit.ci
feat: Dataset querying features using rucio
…nto local_executors_to_dask
feat: dataset discovery CLI
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
Pre-processing fails on a distributed Client with
which fails during
throughout the error message. Works fine with local threads or processes |
@iasonkrom this is a limitation in the latest awkward array, I think. @agoose77 would know better. Though I'm a bit confused why it's trying to serialize a typetracer in the first place, that shouldn't be happening. |
…nto local_executors_to_dask Conflicts: src/coffea/dataset_tools/dataset_query.py
for more information, see https://pre-commit.ci
…nto local_executors_to_dask
…nto local_executors_to_dask
for more information, see https://pre-commit.ci
…cal_executors_to_dask
…nto local_executors_to_dask
…a into local_executors_to_dask
fix: improvements to dataset_query tools
@nsmith- can you give this another run through? Intend to merge later today unless there is something big. |
This will: