pytorch jupyter notebooks? #1057

jlewi · 2018-06-21T17:30:05Z

With the PyTorch operator coming should we add better support for PyTorch to our notebooks.

I can think of a couple options

Add PyTorch to our existing Jupyter notebooks
Create a new set of PyTorch Jupyter images
Use someone elses images
Use the Kaggle image (assuming it supports PyTorch) [discussion] How can we play well with Kaggle? #258

/cc @johnugeorge
/cc @pdmack

pdmack · 2018-06-21T17:44:12Z

Users can do conda install pytorch torchvision -c pytorch themselves in the current NB, right?

ankushagarwal · 2018-06-21T23:36:30Z

With the PyTorch operator coming should we add better support for PyTorch to our notebooks.

Supporting <framework>'s operator should be orthogonal to supporting <framework> in our jupyter notebook.

+1 for @pdmack suggestion. Generally speaking our story for supporting a pip / python package in jupyter notebook should be pip install or conda install

johnugeorge · 2018-06-22T04:00:19Z

I agree in general. However, I feel that it would be a better user experience if pytorch package is supported out of the box.

…

On Fri, Jun 22, 2018 at 5:06 AM Ankush Agarwal ***@***.***> wrote: With the PyTorch operator coming should we add better support for PyTorch to our notebooks. Supporting <framework>'s operator should be orthogonal to supporting <framework> in our jupyter notebook. +1 for @pdmack <https://github.com/pdmack> suggestion. Generally speaking our story for supporting a pip / python package in jupyter notebook should be pip install or conda install — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1057 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ACEj_zR2DkDsdgqi3n26sZlTF7hUQLSkks5t_C4BgaJpZM4Uyel2> .

jlewi · 2018-06-22T05:18:14Z

I think we should try to provide a wholistic experience for other frameworks just like we do for TF.

So now that we are adding PyTorch support we should think about supporting it across the stack e.g

Notebooks
Training
Serving
Monitoring

pdmack · 2018-06-22T12:00:03Z

Regarding option 1, the name of the notebooks advertise TF 1.x. I don't know if that would be an impedance to pytorch users where the installed version is more or less unknown from just browsing image names.

jlewi · 2018-06-22T13:59:03Z

@pdmack Agreed but we could always rename the images.

For me the question is what's the right balance between the overhead of maintaining a set of container images, providing a container that has what you need to do things out of the box, and keeping image size tolerable.

I like the idea of building an uber, kitchen sync container like Kaggle. Mostly as an experiment to see whether customers find that extreme solution valuable.

jlewi · 2018-06-22T14:00:51Z

Also I think it would be really valuable if users could run any Kaggle solution right out of the box.

pdmack · 2018-06-22T15:33:40Z

What about that? A big, honkin' Kaggle image that we de-couple from the release cadence. Latest of all the frameworks, lightly curated, and loosely maintained. At least to start. Keep it out of the spawner defaults but promote it through docs, website. Make it clear that it's a long pull, other caveats, etc.

johnugeorge · 2018-06-22T17:26:21Z

I feel, there is always a value in providing what user needs rather than asking them to install something before using it.

pdmack · 2018-06-22T17:36:33Z

FYI, just tried to build the kaggle image but ran out of room in my base dm. But, it's at least 59 minutes in and approaching 12 Gb in size.

pdmack · 2018-06-22T19:44:27Z

I'm skeptical that a multi-stage build with the Kaggle image would work but I'll have a look. But we could consider deriving a Kaggle image from our TF 1.8 image and adding in the missing parts. I'm guessing there's a good amount of overlap.

pdmack · 2018-06-22T21:13:54Z

Oh my...

kaggle/python latest cdc6ffe1b12c 2 months ago 16.8GB

https://gist.github.com/pdmack/6bea356917d6edbad0eccf46a27970eb

pdmack · 2018-06-22T23:39:35Z

Successfully built 178635b0097c
Successfully tagged kaggle/python-build:latest
real	119m34.774s

"gulp"
This was on a somewhat older Xeon CPU but it's a 24-way and has 48 Gb RAM.
Intel(R) Xeon(R) CPU X5670 @ 2.93GHz

jlewi · 2018-06-28T09:19:45Z

@pdmack This is pretty cool. Were you able to start Jupyter in the Kaggle image and use it with JupyterHub?

pdmack · 2018-06-28T11:13:44Z

No, it doesn't have start-singleuser.sh in place. Note that this wasn't the proposed multi-stage build or anything like that, just the upstream Dockerfile build. I supposed I could still look at that with doing a COPY of our /usr/local/bin/ scripts. But 2 hours for an almost 20Gb image? Do we want to go there?

pdmack · 2018-06-28T15:39:55Z

This is as far as I got reusing the kaggle docker image and adding our special sauce:

Traceback (most recent call last):
  File "/opt/conda/bin/jupyterhub-singleuser", line 3, in <module>
    from jupyterhub.singleuser import main
  File "/opt/conda/lib/python3.6/site-packages/jupyterhub/singleuser.py", line 34, in <module>
    from notebook.notebookapp import (
  File "/opt/conda/lib/python3.6/site-packages/notebook/notebookapp.py", line 40, in <module>
    ioloop.install()
  File "/opt/conda/lib/python3.6/site-packages/zmq/eventloop/ioloop.py", line 210, in install
    assert (not ioloop.IOLoop.initialized()) or \
AttributeError: type object 'IOLoop' has no attribute 'initialized'

https://github.com/pdmack/kubeflow/tree/kaggle-nb-image

jzf2101 · 2018-06-29T22:13:43Z

2c from the binder ecosystem- I've seen people install PyTorch and torchvision on binder after creating a conda environment using @soumith 's conda channel, though this is not the latest installation instructions

See example:

https://github.com/jrzech/reproduce-chexnet/blob/master/postBuild

jlewi · 2018-06-29T23:48:34Z

@pdmack Point taken.

Perhaps we should create an examples container intended to be used for running the examples. By definition, the image will contain the union of dependencies needed to run the Kubeflow examples. So it will grow over time until i) it becomes too large or ii) version conflicts.

By that definition it would make sense to start building an image with PyTorch and TF.

For 0.3 I'd like to be able to

Offer Users Click To Deploy Kubeflow
Dump user into Jupyter
User walks through one or more Kubeflow codelab in JupyterLab

I'd like users to be able to do everything from JupyterLab; i.e. JupyterLab provides a notebook editor, basic text editor, and terminal which is sufficient for running the codelabs.

jlewi · 2018-07-06T19:51:47Z

Looks like @pdmack published a version of the image in
gcr.io/kubeflow-dev/kubeflow-kaggle-notebook:latest

jlewi · 2018-07-06T22:57:07Z

I retagged into gcr.io/kubeflow-images-public/kaggle-notebook:v20180629

I retagged it using Google Container Builder; trying to use gcloud container add tag choked.
Here's the GCB config
https://github.com/jlewi/kubeflow-dev/tree/master/kaggle-image

jlewi · 2018-08-10T14:08:22Z

Lets add PyTorch to our codelab notebook image see #1157 and then close this out.

chrisheecho · 2018-10-25T15:24:36Z

/remove-priority p2

jlewi · 2018-12-03T19:53:19Z

Anyone working on this?
Should we punt this to 0.5?

carmine · 2018-12-04T04:49:28Z

Move to 0.5.0, same priority.

jlewi · 2019-02-04T14:34:51Z

Downgrading to P2 since we are focusing on xgboost and TF.

I think a good next step would be to try to use some stock Jupyter images for PyTorch with the new notebook CR.

Ideally, we'd like existing jupyter images to just work so that we don't need to build custom images; see #2208. This should be more doable now that we no longer use JupyterHub.

siddsuresh97 · 2019-03-21T10:30:47Z

Hello @jlewi ,
I'm new to contributing to open source. I would like to work on this issue. Could you help me on getting started?

jlewi · 2019-04-22T14:56:15Z

@siddsuresh97 Docs for creating custom Jupyter images are here
https://www.kubeflow.org/docs/notebooks/custom-notebook/

So I think if you were interested you could start by building a jupyter image suitable for PyTorch and making it work with Kubeflow. You could then publish it on DockerHub and provide instructions here or in kubeflow/website on how people could use it for PyTorch.

stale · 2019-07-21T15:49:55Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

davidspek · 2020-09-23T14:27:01Z

@jlewi I was going through the Notebook issues and found this one. I created a PyTorch image based on the jupyter/scipy-notebook that seems to work. It also contains TensorFlow, not sure if that is an unnecessary combination or not (I don't work with either), but it can be easily removed if necessary. Hopefully this helps, otherwise I can do more testing if you specify what would need to be done. Here is the dockerfile:

# Copyright (c) Jupyter Development Team.
# Distributed under the terms of the Modified BSD License.
ARG BASE_CONTAINER=jupyter/scipy-notebook
FROM $BASE_CONTAINER

LABEL maintainer="Jupyter Project <jupyter@googlegroups.com>"

# Install Tensorflow
RUN pip install --quiet --no-cache-dir \
    'tensorflow==2.3.0' && \
    fix-permissions "${CONDA_DIR}" && \
    fix-permissions "/home/${NB_USER}"

USER $NB_UID

RUN conda config --system --append channels pytorch

RUN conda install --quiet --yes -c pytorch \
    'pytorch' \
    'torchvision' \
    'cpuonly' \
    && \
    conda clean --all -f -y && \
    fix-permissions "${CONDA_DIR}" && \
    fix-permissions "/home/${NB_USER}"

# Configure container startup
EXPOSE 8888
USER jovyan
ENTRYPOINT ["tini", "--"]
CMD ["sh","-c", "jupyter lab --notebook-dir=/home/${NB_USER} --ip=0.0.0.0 --no-browser --allow-root --port=8888 --NotebookApp.token='' --NotebookApp.password='' --NotebookApp.allow_origin='*' --NotebookApp.base_url=${NB_PREFIX}"]

issue-label-bot · 2020-09-23T14:27:10Z

Issue-Label Bot is automatically applying the labels:

Label	Probability
kind/feature	0.57

Please mark this comment with 👍 or 👎 to give our bot feedback!
Links: app homepage, dashboard and code for this bot.

* Add first run condition in BO Suggestion * Tell to Optimizer only about new Trials * Logging Return new trials in each getSuggestion call * Small fix log * Remove n_points from ask * Fix log * Add newline to log * Change log * Change dict to list of recorded trials * Get search space only for the first run

jlewi added area/jupyter Issues related to Jupyter area/0.3.0 labels Jun 21, 2018

pdmack mentioned this issue Jun 30, 2018

Add Kaggle notebook Dockerfile #1109

Merged

jlewi mentioned this issue Jul 9, 2018

Jupyter image suitable for running the examples/codelabs #1157

Closed

jlewi added the priority/p2 label Aug 10, 2018

richardsliu added area/0.4.0 and removed area/0.3.0 labels Oct 11, 2018

k8s-ci-robot removed the priority/p2 label Oct 25, 2018

carmine added area/0.5.0 and removed area/0.4.0 labels Dec 4, 2018

jlewi removed this from the 0.4.0 milestone Dec 17, 2018

jlewi added this to New in 0.5.0 via automation Dec 17, 2018

jlewi removed this from To do in 0.4.0 Dec 17, 2018

jlewi moved this from New to Build / Train / Deploy from notebook in 0.5.0 Jan 6, 2019

jlewi added cuj/build-train-deploy priority/p2 help wanted and removed priority/p1 labels Feb 4, 2019

jlewi mentioned this issue Feb 4, 2019

Make arbitrary Jupyter images work with Kubeflow #2208

Closed

jlewi added this to New in 0.6.0 via automation Mar 10, 2019

jlewi removed this from Build / Train / Deploy from notebook in 0.5.0 Mar 10, 2019

jlewi added the good first issue label Mar 10, 2019

stale bot added the lifecycle/stale label Jul 21, 2019

jlewi added this to To Do in Needs Triage Jul 26, 2019

stale bot closed this as completed Jul 28, 2019

Needs Triage automation moved this from To Do to Closed Jul 28, 2019

jlewi removed this from Closed in Needs Triage Aug 2, 2019

issue-label-bot bot added the kind/feature label Sep 23, 2020

snyk-bot mentioned this issue Jan 17, 2022

[Snyk] Security upgrade node-fetch from 2.6.0 to 3.1.1 aliceUnhinged613/kubeflow#88

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pytorch jupyter notebooks? #1057

pytorch jupyter notebooks? #1057

jlewi commented Jun 21, 2018

pdmack commented Jun 21, 2018

ankushagarwal commented Jun 21, 2018

johnugeorge commented Jun 22, 2018 via email

jlewi commented Jun 22, 2018

pdmack commented Jun 22, 2018

jlewi commented Jun 22, 2018

jlewi commented Jun 22, 2018

pdmack commented Jun 22, 2018

johnugeorge commented Jun 22, 2018

pdmack commented Jun 22, 2018

pdmack commented Jun 22, 2018 •

edited

Loading

pdmack commented Jun 22, 2018

pdmack commented Jun 22, 2018

jlewi commented Jun 28, 2018 •

edited

Loading

pdmack commented Jun 28, 2018

pdmack commented Jun 28, 2018

jzf2101 commented Jun 29, 2018 •

edited

Loading

jlewi commented Jun 29, 2018

jlewi commented Jul 6, 2018

jlewi commented Jul 6, 2018

jlewi commented Aug 10, 2018

chrisheecho commented Oct 25, 2018

jlewi commented Dec 3, 2018

carmine commented Dec 4, 2018

jlewi commented Feb 4, 2019

siddsuresh97 commented Mar 21, 2019 •

edited

Loading

jlewi commented Apr 22, 2019

stale bot commented Jul 21, 2019

davidspek commented Sep 23, 2020

issue-label-bot bot commented Sep 23, 2020

pytorch jupyter notebooks? #1057

pytorch jupyter notebooks? #1057

Comments

jlewi commented Jun 21, 2018

pdmack commented Jun 21, 2018

ankushagarwal commented Jun 21, 2018

johnugeorge commented Jun 22, 2018 via email

jlewi commented Jun 22, 2018

pdmack commented Jun 22, 2018

jlewi commented Jun 22, 2018

jlewi commented Jun 22, 2018

pdmack commented Jun 22, 2018

johnugeorge commented Jun 22, 2018

pdmack commented Jun 22, 2018

pdmack commented Jun 22, 2018 • edited Loading

pdmack commented Jun 22, 2018

pdmack commented Jun 22, 2018

jlewi commented Jun 28, 2018 • edited Loading

pdmack commented Jun 28, 2018

pdmack commented Jun 28, 2018

jzf2101 commented Jun 29, 2018 • edited Loading

jlewi commented Jun 29, 2018

jlewi commented Jul 6, 2018

jlewi commented Jul 6, 2018

jlewi commented Aug 10, 2018

chrisheecho commented Oct 25, 2018

jlewi commented Dec 3, 2018

carmine commented Dec 4, 2018

jlewi commented Feb 4, 2019

siddsuresh97 commented Mar 21, 2019 • edited Loading

jlewi commented Apr 22, 2019

stale bot commented Jul 21, 2019

davidspek commented Sep 23, 2020

issue-label-bot bot commented Sep 23, 2020

pdmack commented Jun 22, 2018 •

edited

Loading

jlewi commented Jun 28, 2018 •

edited

Loading

jzf2101 commented Jun 29, 2018 •

edited

Loading

siddsuresh97 commented Mar 21, 2019 •

edited

Loading