Friction log for TFX Chicago taxi cab example on minikube #594

jlewi · 2018-04-05T23:29:09Z

We'd love for some folks to try running the TFX Chicago Taxi Example
https://github.com/tensorflow/model-analysis/tree/master/examples/chicago_taxi

and providing a log of any pain points or problems they encounter trying to run it.

We are especially interested in the experience deploying and running on minikube as that's were folks will start.

The goal would be to start with deploying minikube and capture any information needed to properly setup minikube for kubeflow (e.g disk size). As noted in #502 we want to provide guidance on how to setup minikube for Kubeflow.

Then go through all the steps in the example for running locally and note any problems or difficulties you encounter.

It would also be fantastic to run it on other Kubeflow distributions.

Maerville · 2018-04-06T22:40:05Z

if while installing kubeflow person wants to install jupyter hub as well, default 20Gb disk-size of minikube is not enough due to large tf image
kubectl get svc -n=${NAMESPACE} -- this command shows outdated results in tutorial, now it also includes k8s-dashboard and ambassadors
ks param set kubeflow-core jupyterNotebookPVCMount /home/jovyan/work -- this is also funny

jlewi · 2018-04-07T01:47:58Z

@Maerville Thank you very much.

The jupyter notebook is always running as user jovyan which is why you have that path.

Maerville · 2018-04-12T22:59:41Z

Using tf-image version 1.7.0-cpu
Opened terminal in Jupyter hub and cloned repository https://github.com/tensorflow/model-analysis
Opened chicago_taxi_tfma_local_playground.ipynb
Visualization: Plots -- kernel died.
At first resources were cpu-1, memory-1Gi. With memory 2Gi, 3Gi kernel died as well. 4Gi finally worked out.
Shutdown the notebook
Open terminal. Run steps in "Running the local example" section.
bash ./preprocess_local.sh -- fails. Because uses "python" command, while python is pointing to /opt/conda/bin/python and jupyter kernels are pointing to /opt/conda/envs/ipykernel_py2/bin/python. Last one has all dependencies.
So after changing path to python in preprocess_local.sh script worked.
Had to make same changes in train_local.sh and process_tfma_local.sh
Ran all cells in chicago_taxi_tfma.ipynb successfully
bash ./start_model_server_local.sh
ERROR: This script requires Docker
[kinda obvious haha]

Conclusion:

minimum 4Gb memory for Jupyter hub (although even 2Gb was enough for training, problems started when plotting the results).
Need alias python to normal pythons. (normal means the ones who have all dependencies).

This test was performed on Minikube.

Maerville · 2018-04-17T18:44:06Z

Performed same test on Kubeadm (1 machine, provided 4 CPU and 16Gi to Jupyter hub).
Everything works successfully (but also had to change python link)

jlewi · 2018-04-18T06:04:35Z

Fantastic thank you.

jlewi · 2018-05-10T18:05:49Z

/close

* Fix ml-pipeline-ui doesn't have gcp permission bug * Moved patch to gcp overlay * Regenerated tests

jlewi added help wanted good first issue labels Apr 5, 2018

jlewi mentioned this issue Apr 6, 2018

Make it easy to get started with Kubeflow #105

Closed

jlewi mentioned this issue Apr 8, 2018

Improve documentation #613

Merged

k8s-ci-robot closed this as completed May 10, 2018

yanniszark pushed a commit to arrikto/kubeflow that referenced this issue Feb 15, 2021

Mini fix for getExperimentConf (kubeflow#594)

c3478af

surajkota pushed a commit to surajkota/kubeflow that referenced this issue Jun 13, 2022

Fix ml-pipeline-ui doesn't have gcp permission bug (kubeflow#594)

eb328ec

* Fix ml-pipeline-ui doesn't have gcp permission bug * Moved patch to gcp overlay * Regenerated tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Friction log for TFX Chicago taxi cab example on minikube #594

Friction log for TFX Chicago taxi cab example on minikube #594

jlewi commented Apr 5, 2018

Maerville commented Apr 6, 2018 •

edited

jlewi commented Apr 7, 2018

Maerville commented Apr 12, 2018

Maerville commented Apr 17, 2018

jlewi commented Apr 18, 2018

jlewi commented May 10, 2018

Friction log for TFX Chicago taxi cab example on minikube #594

Friction log for TFX Chicago taxi cab example on minikube #594

Comments

jlewi commented Apr 5, 2018

Maerville commented Apr 6, 2018 • edited

jlewi commented Apr 7, 2018

Maerville commented Apr 12, 2018

Maerville commented Apr 17, 2018

jlewi commented Apr 18, 2018

jlewi commented May 10, 2018

Maerville commented Apr 6, 2018 •

edited