Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ksonnet packages for Pachyderm #611

Closed
jlewi opened this issue Apr 7, 2018 · 8 comments
Closed

ksonnet packages for Pachyderm #611

jlewi opened this issue Apr 7, 2018 · 8 comments

Comments

@jlewi
Copy link
Contributor

jlewi commented Apr 7, 2018

See #151 for a larger proposal about how Pachyderm might integrate with other Kubeflow components.

This issue is specifically for getting a Kubeflow package for pachyderm (#610)

I started with the helm chart

But it looks like that is missing a bunch of components e.g.

  • storage class
  • dashboard

It looks like some of these resources are created in code e.g. in assets.go

@dwhitena @jdoliner @JoeyZwicker Could you provide me with a set of YAML/json manifests for all the K8s resources that need to be created as part of a pachyderm deployment?

@jlewi
Copy link
Contributor Author

jlewi commented Apr 7, 2018

So I was able to connect by doing

pachctl --namespace=kubeflow port-forward

Didn't realize the dashboard was part of the enterprise edition.

@pdmack
Copy link
Member

pdmack commented Apr 8, 2018

Yeah, was wondering that. Seems like just the CLI for the community version.

@JoeyZwicker
Copy link

The dashboard is part of the enterprise edition, but if you put in your email it'll give you a free 2-week trial token.

@jlewi
Copy link
Contributor Author

jlewi commented Apr 9, 2018

@JoeyZwicker Thanks but I don't have an immediate need to use the dashboard; now that I know its not part of the community edition I removed it from the ksonnet package.

I deployed pachyderm using the CLI configured to use GCS and here is a list of the K8s resources I see created

serviceaccount "pachyderm" configured
clusterrole "pachyderm" configured
clusterrolebinding "pachyderm" configured
storageclass "etcd-storage-class" configured
service "etcd-headless" configured
statefulset "etcd" configured
service "etcd" configured
service "pachd" created
deployment "pachd" configured
service "dash" configured
deployment "dash" configured
secret "pachyderm-storage-secret" configured

@jlewi
Copy link
Contributor Author

jlewi commented Apr 9, 2018

Looks like its using cluster roles where as the helm chart was just using roles.

Here is a list of all the K8s resources created when I deployed via the CLI and configured to use GCS.
pachyd_resources.txt

@jlewi
Copy link
Contributor Author

jlewi commented Apr 11, 2018

@dwhitena @JoeyZwicker it looks like etcd is running as a statefulset in the GCP case but as a deployment in the non GCP case?

Any reason its not always running as a statefulset?

@jdoliner
Copy link

@jlewi when stateful sets first came out we setup etcd to work with stateful sets but we found that GCP was the only place that really had functioning support for them. I suspect that's improved at this point some but we still do get a number of issues where people deploy stateful sets on AWS and EBS volumes get stuck and things like that. A couple of times we've had to get people to redeploy w/o stateful sets to get things working on other platforms. So it's mostly just an artifact of what worked and what didn't and tweaking the defaults to route people around common support issues. I'm not totally sure how current this information is though, I don't quite remember when exactly the last time someone had an issue like this was.

@jlewi
Copy link
Contributor Author

jlewi commented Apr 30, 2018

Closing this issue since we have the initial package.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants