-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Fix running in Docker instructions #11141
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix running in Docker instructions #11141
Conversation
| ```bash | ||
| git clone https://github.com/apache/datafusion | ||
| cd arrow-datafusion | ||
| git checkout 12.0.0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When i tried to check out 12.0.0. tag, the docker build command failed with some unsatisfied dependencies.
| git checkout 12.0.0 | ||
| cd datafusion | ||
| docker build -f datafusion-cli/Dockerfile . --tag datafusion-cli | ||
| docker run -it -v $(your_data_location):/data datafusion-cli |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd propose to add --rm here and take out -v $(your_data_location):/data.
this would result in the command being directly copy-pastable into terminal without modifications and also would reduce side effects.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe we could add a comment like
# Run with data mounted at /data in the container
# docker run --rm -it -v $(your_data_location):/data datafusion-cli
# Run in a container without access to the host filesystem
docker run --rm -it datafusion-cliThere was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🤔 though the second command doesn't work
andrewlamb@Andrews-MacBook-Pro-2 datafusion % docker run --rm -it datafusion-cli
error: Invalid value "/data" for '--data-path <DATA_PATH>': Invalid data directory '/data'
alamb
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the contribution @findepi 🙏 📖
I think this PR is an improvement over what is on main and thus will merge it. I liked your proposal about adding --rm -- maybe we can do that as a follow on PR
| git checkout 12.0.0 | ||
| cd datafusion | ||
| docker build -f datafusion-cli/Dockerfile . --tag datafusion-cli | ||
| docker run -it -v $(your_data_location):/data datafusion-cli |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe we could add a comment like
# Run with data mounted at /data in the container
# docker run --rm -it -v $(your_data_location):/data datafusion-cli
# Run in a container without access to the host filesystem
docker run --rm -it datafusion-cli| git checkout 12.0.0 | ||
| cd datafusion | ||
| docker build -f datafusion-cli/Dockerfile . --tag datafusion-cli | ||
| docker run -it -v $(your_data_location):/data datafusion-cli |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🤔 though the second command doesn't work
andrewlamb@Andrews-MacBook-Pro-2 datafusion % docker run --rm -it datafusion-cli
error: Invalid value "/data" for '--data-path <DATA_PATH>': Invalid data directory '/data'
No description provided.