Skip to content

fix(loader): fix dry_run execution in dask workers#724

Merged
luisremis merged 6 commits into
developfrom
fix/issue-250
May 21, 2026
Merged

fix(loader): fix dry_run execution in dask workers#724
luisremis merged 6 commits into
developfrom
fix/issue-250

Conversation

@ad-claw000
Copy link
Copy Markdown
Contributor

Summary

Propagates the dry_run flag to Dask workers so that when running with dry_run=True, the data is not accidentally ingested by Dask worker instances.

Verification

  • Verified by checking the DaskManager.run method properly propagates dry_run when instantiating QueryClass.

Fixes #250

Copilot AI review requested due to automatic review settings May 20, 2026 12:28
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Propagates the dry_run flag through the Dask execution path so that Dask worker processes don’t accidentally execute ingestion queries when a loader/query is run with dry_run=True.

Changes:

  • Pass dry_run from ParallelQuery.query() into DaskManager.run().
  • Thread dry_run through the Dask map_partitions worker function and into the per-worker QueryClass instance.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File Description
aperturedb/ParallelQuery.py Passes self.dry_run into the Dask execution path when use_dask is enabled.
aperturedb/DaskManager.py Accepts dry_run, forwards it to Dask workers, and constructs QueryClass with dry_run in each worker.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread aperturedb/DaskManager.py
Comment thread aperturedb/DaskManager.py
Comment thread aperturedb/ParallelQuery.py
Copilot AI review requested due to automatic review settings May 20, 2026 21:34
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

Comment thread test/test_Parallel.py Outdated
Comment thread test/test_Parallel.py Outdated
Comment thread aperturedb/DaskManager.py
Copilot AI review requested due to automatic review settings May 21, 2026 00:23
@ad-claw000
Copy link
Copy Markdown
Contributor Author

CI fixes have been implemented and pushed. Let me know if there's anything else!

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Comment thread test/test_Parallel.py Outdated
Comment thread test/test_Parallel.py Outdated
- Rename connnector_type to connector_type in DaskManager.py
- Use utils.remove_entities in test_dask_dry_run
- Fix CSV file path in test_dask_dry_run
@luisremis luisremis merged commit 3b9e3da into develop May 21, 2026
2 checks passed
@luisremis luisremis deleted the fix/issue-250 branch May 21, 2026 15:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Bug on ParallelLoader when running with dry_run=True

3 participants