Skip to content

Conversation

@andygrove
Copy link
Member

@andygrove andygrove commented May 3, 2023

Which issue does this PR close?

N/A

Rationale for this change

Move db-benchmark from arrow-datafusion repo to arrow-datafusion-python repo. This makes more sense because these are Python benchmarks.

There is a corresponding PR to remove this benchmark from the arrow-datafusion repo:

apache/datafusion#6204

What changes are included in this PR?

Move files from arrow-datafusion repo

Note that additional work is required to get these working again, such as updating some of the paths in the Dockerfile.

Are there any user-facing changes?

No

@andygrove
Copy link
Member Author

@MrPowers fyi

&& cp ../arrow-datafusion/benchmarks/db-benchmark/join-datafusion.py db-benchmark/datafusion \
&& cp ../arrow-datafusion/benchmarks/db-benchmark/run-bench.sh db-benchmark/ \
&& chmod +x db-benchmark/run-bench.sh

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Both the documentation and the arrow-datafusion related paths in the docker file need to be upgraded to arrow-datafusion-python

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, we also need to stop having it clone the repo when building the docker image. I've filed #368 for the follow on work once this is merged.

@andygrove andygrove merged commit 31a86ee into apache:main May 4, 2023
@andygrove andygrove deleted the db-benchmark branch May 4, 2023 01:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants