New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve Custom/On-Premise Docs #2771
Comments
Hi Dan, We were attempting to deploy an instance of pachyderm with our CEPH S3 gateway protocol as a replacement for a MINIO deployment, but ran into the error you explained in the intro of this issues report. This forced us to deploy using the helm chart provided in the official kubernetes charts repo. This was a GREAT help! To make things even better for our use case, we were now able to deploy multiple instances of pachyderm for creating serval pipelines using different storage backends in the same K8 cluster. We encountered a few problems along the way which required us to modify the helm chart pr 4159 This enabled us to deploy multiple instances of pachyderm using CEPH S3 gateway, MINIO, Amazon, etc.. Many thanks!! |
Thanks for you comments and contribution @naztyroc! This is great context for us as we improve the on prem docs. I think Jon Ander is reviewing your Helm chart PR, and we will move forward on the docs side here. |
hi Dan, |
Hi Dan,
For our use case, we are using CEPH as the storage backend for Kubernetes which connects as RBD volumes. We configured Kubernetes to communicate with CEPH by creating storage class methods. With that setup, we deployed Minio, with CEPH/RBD PVC, as the backend for Pachyderm which talks to it over S3 protocol. I hope that helps?
…-Rashian
Sent from my iPhone
On Apr 26, 2018, at 9:11 PM, liuchenxjtu ***@***.***> wrote:
hi Dan,
thanks your for your efforts. I am quite new to Pachyderm. now we want to deploy pachyderm on our own cluster. but most of the docs are about deploying on cloud provider, e.g., persistent-disk must be google, azure or aws. so for our case, could you advise what we should be for the local clusters? thank you!
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Thanks for the update @naztyroc. Will hopefully have these updated soon. @liuchenxjtu Thanks for reaching out. I would recommend using the Helm chart for your deploy: https://github.com/kubernetes/charts/tree/master/stable/pachyderm. You can use Minio, Rook, Ceph, Swift, etc. for your object store, and then use can use can any k8s PV for the volume: https://kubernetes.io/docs/concepts/storage/persistent-volumes/ |
Hi Dan, |
@liuchenxjtu a local deployment (minikube) of Pachyderm just uses local storage and not a full object store. It's also not really meant for production workloads where you'd need a real object store. If you want to use minio as the storage backend for a tiny cluster, you'll need to do a single-node or on-prem deployment of kubernetes and then use the If you tell me a bit more about your setup and requirement, we can give you additional guidance. For example:
When you say "local clusters" do you mean on-prem or do you mean a minikube or "pachctl deploy local" cluster? |
Hi @dwhitena ,
|
Regarding gluster FS, yes you can use gluster FS, which a Minio server as an S3 compatible layer between the two. Regarding the persistent disk, We are changing out on-prem default to use the Helm chart. That should clear up the confusion around the PVs. More details can be found here: https://github.com/kubernetes/charts/tree/master/stable/pachyderm |
hi @dwhitena , @JoeyZwicker when I tried to put file in one repo, e.g., pachctl put-file test master -c -f lsof.log thanks for your help!! |
I have the same issue: I'd like to use |
Will the helm chart be updated to 1.8? |
@brokenjacobs the helm chart is still not (yet) an officially/fully supported deployment method for Pachyderm, as it hasn't been incorporated into the test suite or build/release process. That being said, it would be great if you could file each of those three questions as it's own issue. Hopefully someone on the pachyderm core team or a community member can get around to answering/adding each of those things soon. |
@marcadella most likely what you'll want to do is run |
I was able to get it deployed using the custom option. I was scared off by the persistent disk option being required but it didn’t seem to put anything into the output manifest. It’s not working properly though, dashboard won’t come up. I suspect k8s 13.2 may be an issue. Will investigate on Monday. |
Hi, I am trying to use CEPH by s3 gateway. How can I config charts? I set s3 configuration in values.yaml, and It seems to use Minio as backend storage. It would be of great help if you could show me how you config charts. |
@iswaverly Sorry for the confusion, Pachyderm uses the Minio client to communicate with all non-Amazon S3-compatible object stores (including Ceph). So it sounds like you've got it set up correctly |
@JoeyZwicker @svekars Do you think we could close this, with the work done on custom deployment last year? |
Sounds good to me! |
Closing as this has been addressed by JK. |
There's been a lot of interest in on-prem clusters lately, and our docs on the subject aren't very good. The main issues that people have been hitting are:
custom
deploycustom
deploy for cloud vs. on prem deploysNotExist
error with CEPH S3 interface deployI think the following will probably make the process better for on prem users:
The text was updated successfully, but these errors were encountered: