Custom notebooks #150

ChaamC · 2021-04-19T19:48:14Z

New solution for custom notebooks, related to both pull requests :
bird-house/pavics-jupyter-base#3
crim-ca/pavics-jupyter-images#3

The new solutions uses a custom script that is executed directly on a specific image such as pavics/crim-jupyter-eo or pavics/crim-jupyter-nlp. This script can be executed by configuring a job for the scheduler, using the AUTODEPLOY_EXTRA_SCHEDULER_JOBS variable in the env.local file. The script uses a yaml config file found on the specific images' repo, which describes the source of the notebook files to download and the destination location where to store them.

There was also a need to restructure the tutorial-notebooks folder, in order to better control which notebooks are available depending of the selected image in JupyterHub.

We can discuss of the solution here, but I would propose putting all the current notebooks to a 'common' folder, which would contain all the notebooks that would be available to all images. There would also be other folders for each specific images.
So we would have these paths for example :

tutorial-notebooks/common
tutorial-notebooks/crim-eo
tutorial-notebooks/crim-nlp
...
We can then mount the common folder for all images, and we can mount the folder related to our specific image.
For now, I have added the common folder for the volume mounting as an example, but the folder is empty at this current state, since we would need to move the desired notebooks there.

A restriction with my solution is that the specific image notebook folder has to have the same name as the name of the image in the whitelist from JupyterHub (see the code of the CustomDockerSpawner which uses the name of the spawned image to mount the corresponding folder.)

…e as spawned image

birdhouse/default.env

dbyrns

Really good job! We can see the thing takes form and it's elegant. I still see some improvements, but we are real close.

dbyrns · 2021-04-20T14:01:45Z

birdhouse/env.local.example

 # Add more jobs to ./components/scheduler/config.yml
 #
 # Potential usages: other deployment, backup jobs on the same machine
 #
 #export AUTODEPLOY_EXTRA_SCHEDULER_JOBS=""
 #
+# Example extra job that deploys custom notebooks for a specific image


Could we just create a script that loop over DOCKER_NOTEBOOK_IMAGES env var (https://github.com/bird-house/birdhouse-deploy/blob/master/birdhouse/env.local.example#L214) and perform this operation on each of them?
So, if I want to offer the eo image I add it the the available images env and voilà! I also get all its notebooks.

Do you mean automatically creating a job for each image found in DOCKER_NOTEBOOK_IMAGES?
I suppose we could, most of the parameters will stay the same with only the name (such as eo, nlp, etc.) changing.
But, do we always want to create a job for all of those images? For example, I don't want to have notebooks specific to pavics/workflow-tests, I don't want to have a job for this image. But I guess we could organize those images differently, like having another variable that contains the list of the images for which we want to actually run the script.

Or did you mean a single job that will run the script for each images?

I mean a single job running a script updating all images. Basically what you already have, but inside a loop. Apart the image name everything else is the same.
And yes for all images found in DOCKER_NOTEBOOK_IMAGES. Right now we have workflow-tests, but the name is misleading because it's the primary image that is used by almost everyone! In fact all images in DOCKER_NOTEBOOK_IMAGES are available to user, so yes I think that we want to keep them updated.

dbyrns · 2021-04-20T14:06:41Z

birdhouse/deployment/deploy-data-specific-image

+#
+# This is meant to be run on the same host running PAVICS.
+#
+# The data details is specified in a yaml config file (TEMPLATE_CONFIG_YML), using this format :


Not clear here that the yaml config file must be in the image. This is the case right?
Also the file is expected to be located to a particular location : https://github.com/bird-house/birdhouse-deploy/pull/150/files#diff-17e9b2e274b97a022f797d1f221f2b50144c0ce3b70537a9faa2b7412b2a2cafR159

Well, in our case, we run this script on an image, so the config must be on the same image, yes. But, I suppose the script could be used directly on the host, without using any image. Since our use case here is to run it on a docker image, I should just put a clarification that the config file should be on the image too though. :P

And, the script has to be copied, when building the docker image, to the same path that will be written for the job command (like on the link you put here.)

Hum ok, I agree that this script doesn't mind where the config file is located, it is passed as an argument.
In fact, what is needed is a better place to document the location of the config file in images. All images will require this config file at a specific location so that the job (my link) can find it.

dbyrns · 2021-04-20T14:07:40Z

birdhouse/env.local.example

+#- name: notebookdeploy-eo
+#  comment: Auto-deploy tutorial notebooks for the eo image
+#  schedule: '${AUTODEPLOY_NOTEBOOK_FREQUENCY}'
+#  command: '/deploy-data-specific-image /notebook_config.yml.template'


Following my previous comment regarding this config, given that this file /notebook_config.yml.template is in the image, why is it a .template?

This "cronjob" is again pretty much similar to the existing https://github.com/bird-house/birdhouse-deploy/blob/57a320c3583f804838e8c18aeb064266527a0bc9/birdhouse/components/scheduler/deploy_data_job.env. Same question, if a feature is missing, can we add to it than forking it?

This one you might not be aware since it's under the "scheduler" component because it has to be used with that component.

Using that "generic" cronjob allows a much simplified cronjob definition like this https://github.com/bird-house/birdhouse-deploy/blob/57a320c3583f804838e8c18aeb064266527a0bc9/birdhouse/components/scheduler/deploy_raven_testdata_to_thredds.env

So, I used the template word because in this version of the config, we find the variable ${JUPYTERHUB_USER_DATA_DIR}, which gets replaced by its value when running the script.
We do a copy of the config file using envsubst < $TEMPLATE_CONFIG_YML > $CONFIG_YML, where the copy has the real value instead of the variable.
I agree 'template' is not the best word here though. I don't know if you have a better idea.

In regards to your comment about the dest_dir variable found in the yaml config, I could remove it, and we could always output the notebooks to the same directory, in ${JUPYTERHUB_USER_DATA_DIR}/tutorial-notebooks/{$IMAGE_NAME}.
We could remove that option of customization, which could remove my need of having a copy of the config with variables to replace.
If we decide to remove the dest_dir option, I will just remove all mentions of .template, and remove the envsubst command from the script.

Ok, it solves two issues and make the whole thing simpler, so I would remove the dest_dir option from the config.
But I would not use the directory ${JUPYTERHUB_USER_DATA_DIR}/tutorial-notebooks/{$IMAGE_NAME}. This is where the notebooks need to be on the host, but in the image it could be anywhere like /tmp as long as the calling command mount the host volume at that point. My suggestion still hold : in the docker command include the volume mount and set an environment variable telling the script where to put the stuff (the dest_dir, but as a env var rather than a config)

Again, now that I have more time to digest this PR ...

I would suggest moving this sample big blob of code to generate the cronjob also into jupyter-pavics-base repo so it is together with the script deploy-data-specific-image it wraps.

It makes the env.local.example shorter and less intimidating

You now own the "code" so if you need to fix something (like add/remove a volume-mount) you can do it and it's transparent to all the callers, all they need is always run the latest version. You basically provide the user with a stable interface and give you control to change the implementation.

Later, in a different PR, once we "migrate" to the generic deploy-data script, the matching generic cronjob components/scheduler/deploy_data_job.env will also have to be adapted for whatever newer options that deploy-data will support. And we still keep the same consistency that the "script" and the "cronjob wrapper" are together.

Try to inspire from the existing generic cronjob wrapper, especially the part about

birdhouse-deploy/birdhouse/components/scheduler/deploy_data_job.env

Lines 64 to 67 in 57a320c

if [ -z "`echo "$AUTODEPLOY_EXTRA_SCHEDULER_JOBS" | grep $DEPLOY_DATA_JOB_JOB_NAME`" ]; then

# Add job only if not already added (config is read more than once during

# autodeploy process).

to ensure it also works during autodeploy (there is a slight difference between ./pavics-compose.sh invoked manually on the console and invoke inside the scheduler component).

Forgot to say, once you move the "cronjob generation code" out to jupyter-pavics-base, you can reference it back in env.local.example to "advertise" it, like

birdhouse-deploy/birdhouse/env.local.example

Lines 171 to 181 in 57a320c

# Load pre-configured cronjob to automatically deploy Raven testdata to Thredds

# for Raven tutorial notebooks.

#

# See the job for additional possible configurations. The "scheduler"

# component needs to be enabled for this pre-configured job to work.

#

#if [ -f "$COMPOSE_DIR/components/scheduler/deploy_raven_testdata_to_thredds.env" \

# -a -f "$COMPOSE_DIR/components/scheduler/deploy_data_job.env" ]; then

# . $COMPOSE_DIR/components/scheduler/deploy_raven_testdata_to_thredds.env

# . $COMPOSE_DIR/components/scheduler/deploy_data_job.env

#fi

birdhouse/default.env

birdhouse/deployment/deploy-data-specific-image

birdhouse/config/jupyterhub/jupyterhub_config.py.template

tlvu

I think we are getting somewhere in the good direction.

I just want to ensure a seemless transition and also more code re-use.

Consider this my first-pass review as I have not seen the other 2 PR that leverage this feature yet.

birdhouse/config/jupyterhub/jupyterhub_config.py.template

birdhouse/default.env

tlvu · 2021-04-20T18:59:55Z

birdhouse/deployment/deploy-data-specific-image

+  echo "Extracting ${FULL_URL} to ${DEST_DIR}"
+
+  # Download the data from github and copy it to the destination directory
+  svn export --force $FULL_URL $DEST_DIR


Download from github using svn client? I didn't know that it could be done! But why not just use a git client?

The bigger question is this script seems to do a very similar work as the existing deploy-data script. Not clear what feature is missing but why we can not re-use the existing deploy-data script? If a feature is missing, can we add it instead of forking another script?

They do have a lot of similarities. I could give it a try and try using the deploy-data script directly, but I am scared of finding a blocking point sometime if we want to add different features to one of the use case.
They are almost similar though, if not the same, for now.

Also, I see that the deploy-data script requires the use of Docker, which means we have to install Docker on the pavics-jupyter-base (this would replace the yq/jq installation actually). I don't know what is the best here, if we want to keep those images to the bare minimum.

@dbyrns I would be curious about your input on this :)

I don't have enough information on what @tlvu would like you to reuse to make that call. If he could show us how it could easily be done I'm not against it. As long as we don't have to invest another couple of days to do that. The same apply to the other reuse request, maybe you could talk directly to each other so that it could be done fast.

I think the main difference between our scripts is that the deploy-data script is meant to run on a generic image that includes Docker and Git, but our new script deploy-data-specific-image is meant to be run directly on one of our own specific image, which includes our config.yaml file.

A benefit of using the new script is that it lets us have the yaml file directly on the specific image's repo, keeping it close to its related context. For example, this means, if a developer wants to include a new folder for the crim-eo environment, he just goes to the crim-eo's repo and changes the config there.

I am not sure if we could do this easily with the deploy-data script? I think the yaml files are stored directly on the birdhouse-deploy repo? Such as : deploy_raven_testdata_to_thredds.env and deploy-data-raven-testdata-to-thredds.yml

They are then added as a job in the env.local file :

birdhouse-deploy/birdhouse/env.local.example

Line 177 in 57a320c

#if [ -f "$COMPOSE_DIR/components/scheduler/deploy_raven_testdata_to_thredds.env" \

Now that I have more time to digest this change. I would suggest moving this deploy-data-specific-image script into the jupyter-pavics-base image so the script and the config yaml (that is in the child image) is part of the same final image.

Why? So eventually Jenkins can also checkout all the notebooks and run tests on them, like currently with the default Jupyter image. The e2e repo should not be responsible for checking out the notebooks like currently and will enable that same e2e repo to test against different notebooks depending on the Jupyter image. That requires implementation of Ouranosinc/PAVICS-e2e-workflow-tests#57 but we can start to lay the ground in that direction.

For the scheduler cronjob, this means you don't even have to volume-mount the script from outside since it's already inside the image !

As for re-using the existing generic deploy-data script, I think it simply boils down to

being able to use yq as docker run (currently) or as installed (your case), will have to add a new switch for that.

download the repo as a tar/zip archive (your way) instead of git pull (current), will have to add a new switch for that as well.
Note that going the tar/zip archive route nullify the caching provided by git pull. However, git pull way waste more space because basically there is a double checkout, one used for caching, one in the /data/jupyter_user_data/tutorial-notebooks/.... Each side have pros and cons so for generic and re-usabilty sake, we can implement both.

In a different PR, once deploy-data have the additional mode of operations, the jupyter-pavics-base will wget the script part of the build and make it available inside the image as deploy-data-specific-image is, without being directly committed as deploy-data-specific-image.

tlvu · 2021-04-20T19:08:36Z

birdhouse/env.local.example

+#- name: notebookdeploy-eo
+#  comment: Auto-deploy tutorial notebooks for the eo image
+#  schedule: '${AUTODEPLOY_NOTEBOOK_FREQUENCY}'
+#  command: '/deploy-data-specific-image /notebook_config.yml.template'


This "cronjob" is again pretty much similar to the existing https://github.com/bird-house/birdhouse-deploy/blob/57a320c3583f804838e8c18aeb064266527a0bc9/birdhouse/components/scheduler/deploy_data_job.env. Same question, if a feature is missing, can we add to it than forking it?

This one you might not be aware since it's under the "scheduler" component because it has to be used with that component.

Using that "generic" cronjob allows a much simplified cronjob definition like this https://github.com/bird-house/birdhouse-deploy/blob/57a320c3583f804838e8c18aeb064266527a0bc9/birdhouse/components/scheduler/deploy_raven_testdata_to_thredds.env

birdhouse/env.local.example

birdhouse/deployment/deploy-data-specific-image

tlvu

So here my "final-pass" review.

Good job, you seem to grasp the scheduler component and how to add more "jobs" to it.

Things are aligned in the right direction, that's the most important point. There are improvements but we do not have to make them right now.

Here are what I suggest to merge this PR without too much more work:

ensure 100% seemless transition (ie when MOUNT_IMAGE_SPECIFIC_NOTEBOOKS == false should behave exactly as before, as if this change never existed).
move the custom, not generic deploy script and the matching cronjob generation to jupyter-pavics-base

All the issues about code re-use, we can handle in a "phase 2" sort of PR, iterative/incremental development style.

Final thing, remember to test this change in an autodeploy context (meaning have autodeploy job in the scheduler callling ./pavics-compose.sh which generate the actual ./components/scheduler/config.yml with all the jobs fully instantiated, make sure no double job and all the variables expanded correctly).

birdhouse/deployment/deploy-data-specific-image

ChaamC · 2021-04-23T19:32:34Z

birdhouse/env.local.example

-#                                   'nlp-crim': os.environ['DOCKER_NOTEBOOK_IMAGES'].split()[2],
+#c.DockerSpawner.image_whitelist = {os.environ['JUPYTERHUB_IMAGE_SELECTION_NAMES'].split()[0]: os.environ['DOCKER_NOTEBOOK_IMAGES'].split()[0],
+#                                   os.environ['JUPYTERHUB_IMAGE_SELECTION_NAMES'].split()[1]: os.environ['DOCKER_NOTEBOOK_IMAGES'].split()[1],
+#                                   os.environ['JUPYTERHUB_IMAGE_SELECTION_NAMES'].split()[2]: os.environ['DOCKER_NOTEBOOK_IMAGES'].split()[2],


For these lines, I am aware it is not super clean. I haven't found an easier way to implement it yet, since the concerned variables have to be working both on shellscript and python.

About DOCKER_NOTEBOOK_IMAGES and JUPYTERHUB_IMAGE_SELECTION_NAMES, we could have use those names in just one variable instead of 2, using a string "formatted similarly to a dictionary" such as "pavics;pavics/workflow-tests:210216 eo-crim;pavics/crim-jupyter-eo:0.1.0 ...." but it would require additional string splitting, so I don't think it would be cleaner.

Another thing is we could probably simplify this by using the same name all the time, instead of having 2 different name format. We would use the image name such as pavics/crim-jupyter-eo:0.1.0 everywhere though, meaning it would be what we see in the jupyterhub list, and it would be the name of the directories for the tutorial-notebooks, instead of the shorter version eo-crim. I don't think it would be necessarily cleaner either. (Potential problem with the '/' found in the image names, clashing with using it as a directory name?)

Agreed it's ugly, but also agree other alternative either complexify the implemenation a lot or look less great. Open to other alternative as well, if someone see something we both missed ...

tlvu

Just first pass. Have not seen the other matching PRs. Nothing major so far, looking good.

birdhouse/components/scheduler/deploy_data_specific_image_job.env

tlvu · 2021-04-26T18:52:24Z

birdhouse/components/scheduler/deploy_data_specific_image_job.env

+
+# Log file location.  Default location under /var/log/PAVICS/ has built-in logrotate.
+if [ -z "$DEPLOY_DATA_JOB_LOGFILE" ]; then
+    DEPLOY_DATA_JOB_LOGFILE="/var/log/PAVICS/${DEPLOY_DATA_JOB_JOB_NAME}.log"


Not using the new PAVICS_LOGDIR you added?

I will replace it here, but if I search for the "/var/log/PAVICS" path in the repo, I see that it is referenced many times, in different other jobs or configs (logrotate, autodeploy, trigger-deploy-notebook, ...) Should I also take the liberty of replacing those with the new variable? I am not sure if I want to change anything in parts of the code that I don't know about.

@tlvu Let me know what I should do for this :)

Sorry, missed this question. Yes indeed, will be nice to clean up all existing hardcode but I'd suggest to just finish this PR that's been open for a while already. You can open another PR to fix the remaining of reference.

birdhouse/components/scheduler/deploy_data_specific_image_job.env

birdhouse/config/jupyterhub/jupyterhub_config.py.template

birdhouse/env.local.example

tlvu

Great work. I think you got all the pieces. Just simplify them a bit since you are not aiming for something generic (it's all specific to notebooks deployment).

birdhouse/components/scheduler/deploy_data_specific_image_job.env

birdhouse/env.local.example

tlvu · 2021-04-26T20:47:04Z

birdhouse/env.local.example

-#                                   'nlp-crim': os.environ['DOCKER_NOTEBOOK_IMAGES'].split()[2],
+#c.DockerSpawner.image_whitelist = {os.environ['JUPYTERHUB_IMAGE_SELECTION_NAMES'].split()[0]: os.environ['DOCKER_NOTEBOOK_IMAGES'].split()[0],
+#                                   os.environ['JUPYTERHUB_IMAGE_SELECTION_NAMES'].split()[1]: os.environ['DOCKER_NOTEBOOK_IMAGES'].split()[1],
+#                                   os.environ['JUPYTERHUB_IMAGE_SELECTION_NAMES'].split()[2]: os.environ['DOCKER_NOTEBOOK_IMAGES'].split()[2],


Agreed it's ugly, but also agree other alternative either complexify the implemenation a lot or look less great. Open to other alternative as well, if someone see something we both missed ...

birdhouse/env.local.example

dbyrns

I feel bad for the long journey, but we are almost there.

birdhouse/config/jupyterhub/jupyterhub_config.py.template

birdhouse/components/scheduler/deploy_data_specific_image_job.env

tlvu · 2021-04-27T13:40:59Z

I feel bad for the long journey, but we are almost there.

Yes sorry @ChaamC I was the one who lead you into having the loop outside env.local in comment #150 (comment).

At the time, not knowing how things will turn out, I was simply thinking with what I have currently.

… feedback

ChaamC · 2021-05-04T17:11:30Z

@dbyrns @tlvu @matprov
Updated PR : moved the env file to the jupyter-pavics-base repo, and fixed other concerns pointed in your feedback

Let me know if it works!

tlvu

Looks good on this side.

dbyrns

Perfect! I really like how it turns out :)

ChaamC · 2021-05-10T14:46:01Z

@dbyrns @tlvu @matprov

After testing on the daccs-iac, I found one bug with the current solution.
It seems the existing 'notebookdeploy' scheduler job is in conflict with the new specific image jobs we are trying to add.
I did not see it at first on a local setup, since the 'notebookdeploy' scheduler job was only happening one time per hour, but I started seeing weird behaviors with the volume mounts on the daccs-iac, with the 'notebookdeploy' job happening at each 5 minutes on the daccs-iac setup.

At each 5 minutes, I first see the image-specific folders getting updated as expected, but the 'notebookdeploy' job finishes after, and erases the eo-crim/nlp-crim folders. I was testing the specific image job at each minute, so the following minutes, the eo-crim/nlp-crim come back and get updated as expected. The problem then reoccurs each 5 minutes.

I think the problem is that the 'notebookdeploy' job seems to erase everything in the 'tutorial-notebooks' folder, when running the trigger-deploy-notebook script :

birdhouse-deploy/birdhouse/deployment/trigger-deploy-notebook

Line 82 in 8dd34d6

rm -rf $TUTORIAL_NOTEBOOKS_DIR/*

I am not sure what would be to best solution here. Maybe we need to restructure those directories.
I would be curious about your feedback on this.
Should we close this PR first, and attack this problem in a new one?
We can still keep the feature disabled in the configs for now while we fix this part.

tlvu · 2021-05-10T22:56:14Z

the 'notebookdeploy' job seems to erase everything in the 'tutorial-notebooks' folder

@ChaamC
Yes indeed, I forgot again about this little detail. It deletes everything so that when an existing notebook is renamed or deleted, it does not leave behind the old name.

A quick solution I can propose to you is to deploy your notebooks to a different location on disk, ex: /data/jupyterhub_user_data/tutorial-notebooks-multi-image/. So on disk it's different location but you can still volume-mount to the same /notebook_dir/tutorial-notebooks/ inside the Jupyter container.

So the feature toggle MOUNT_IMAGE_SPECIFIC_NOTEBOOKS now will also look for the new location on disk.

I had to do the exact same trick (deploy to another location on disk /data/jupyterhub_user_data/pavics-homepage) to deploy our extra tutorial notebook from our new landing page in PR bird-house/birdhouse-deploy-ouranos#8.

Good thing you tested in the real IAC env that replicate the prod env !

tlvu · 2021-05-10T23:03:00Z

Speaking of not leaving old name behind, does svn export --force $FULL_URL $FULL_DEST_DIR in https://github.com/bird-house/pavics-jupyter-base/blob/6a17b1965346f0c8d0893b5accda0f4f52cf230a/scheduler-jobs/deploy_data_specific_image#L86 override the entire folder?

The current deploy-data (

birdhouse-deploy/birdhouse/deployment/deploy-data

Line 154 in 2b27a0f

--recursive --links --checksum --delete \

) has the rsync --delete option to handle that case.

This is still a corner case. If renamed or deleted notebooks is not handled yet, just merge this PR now and handle it in a different PR.

ChaamC · 2021-05-11T13:57:32Z

I checked for the svn command and it doesn't seem to override the folder. If I put some random file in the folder tutorial-notebooks/eo-crim/eo, and run the deploy job with those settings :

source_dir: eo/notebooks
dest_sub_dir: eo

It just takes every file from the source_dir, and sends them to the dest_sub_dir, updating any file that already exists with the same name.
Any other file found in the dest_sub_dir will just stay there for now.

ChaamC · 2021-05-11T13:59:10Z

Updated with new fix.
tutorial-notebooks files related to specific images are not found on another folder on the host.
I will do a quick check in the iac too, and I think I can merge this after, if it's good for everyone!

tlvu

Looks good for me with the new location on disk for notebooks. But I'd rather we play safe and revert the c.DockerSpawner.pull_policy = "always" change unless there is a compelling reason for it.

Make sure to re-test in a real IAC setup.

birdhouse/config/jupyterhub/jupyterhub_config.py.template

… into custom-notebooks

ChaamC · 2021-05-11T16:15:37Z

Did more tests in IAC setup, and new fix seems to be working fine now!

I see that the E2E tests are failing, but they seems to have failed in other recent PRs too, with similar types of errors, so I suppose it is correct for me to ignore this, and to go on with the merge.

Edit : Received confirmation by Mathieu that the E2E errors are not related to this PR, so proceding with merges :)

matprov · 2021-05-11T17:55:55Z

Good to know @ChaamC ! That was not an easy one, but it's finally getting merged - great job :)

cwcummings added 10 commits March 31, 2021 08:21

remove notebook volume

3d4d9cf

bump images version

8c8f715

revert change on jupyterhub_config

1d5164f

new deploy script to download notebooks in specific images

85b50a0

add log file output and var replacement in deployment script

e33f841

improve logging in new deploy script

914bdc6

only install yq if necessary in deploy script

fc2e504

add custom tutorial notebooks volume with a subfolder of the same nam…

04b46a9

…e as spawned image

fix typo

a3d5a24

update doc

27e5c7f

ChaamC requested review from tlvu, dbyrns and matprov April 19, 2021 19:48

This was referenced Apr 19, 2021

custom notebooks are added to the image crim-ca/pavics-jupyter-images#3

Merged

Custom tutorial notebooks bird-house/pavics-jupyter-base#2

Closed

matprov reviewed Apr 20, 2021

View reviewed changes

birdhouse/default.env Show resolved Hide resolved

dbyrns requested changes Apr 20, 2021

View reviewed changes

dbyrns reviewed Apr 20, 2021

View reviewed changes

birdhouse/config/jupyterhub/jupyterhub_config.py.template Outdated Show resolved Hide resolved

tlvu requested changes Apr 20, 2021

View reviewed changes

ChaamC mentioned this pull request Apr 20, 2021

Custom tutorial notebooks bird-house/pavics-jupyter-base#3

Merged

tlvu reviewed Apr 20, 2021

View reviewed changes

birdhouse/env.local.example Outdated Show resolved Hide resolved

tlvu reviewed Apr 20, 2021

View reviewed changes

birdhouse/env.local.example Outdated Show resolved Hide resolved

matprov reviewed Apr 20, 2021

View reviewed changes

birdhouse/deployment/deploy-data-specific-image Outdated Show resolved Hide resolved

tlvu requested changes Apr 20, 2021

View reviewed changes

matprov reviewed Apr 20, 2021

View reviewed changes

birdhouse/deployment/deploy-data-specific-image Outdated Show resolved Hide resolved

matprov reviewed Apr 20, 2021

View reviewed changes

birdhouse/deployment/deploy-data-specific-image Outdated Show resolved Hide resolved

matprov reviewed Apr 20, 2021

View reviewed changes

birdhouse/deployment/deploy-data-specific-image Outdated Show resolved Hide resolved

matprov reviewed Apr 20, 2021

View reviewed changes

birdhouse/deployment/deploy-data-specific-image Outdated Show resolved Hide resolved

matprov reviewed Apr 20, 2021

View reviewed changes

birdhouse/deployment/deploy-data-specific-image Outdated Show resolved Hide resolved

matprov reviewed Apr 20, 2021

View reviewed changes

birdhouse/deployment/deploy-data-specific-image Outdated Show resolved Hide resolved

ChaamC commented Apr 23, 2021

View reviewed changes

tlvu reviewed Apr 26, 2021

View reviewed changes

tlvu requested changes Apr 26, 2021

View reviewed changes

dbyrns reviewed Apr 26, 2021

View reviewed changes

birdhouse/config/jupyterhub/jupyterhub_config.py.template Outdated Show resolved Hide resolved

birdhouse/components/scheduler/deploy_data_specific_image_job.env Outdated Show resolved Hide resolved

cwcummings added 3 commits May 3, 2021 16:15

fix logdir for script and remove unnecessary volume mount

31d5027

remove common directory volume mount

fe7348c

move env file for deploy script to external repo + minor fixes for pr…

cfe4634

… feedback

ChaamC requested review from dbyrns and tlvu May 4, 2021 17:11

tlvu approved these changes May 4, 2021

View reviewed changes

matprov approved these changes May 4, 2021

View reviewed changes

dbyrns approved these changes May 4, 2021

View reviewed changes

cwcummings added 2 commits May 11, 2021 09:01

rename mount directory to avoid conflict with other deploy jobs

5c1e56a

add todo comment

c934e3d

tlvu approved these changes May 11, 2021

View reviewed changes

birdhouse/config/jupyterhub/jupyterhub_config.py.template Outdated Show resolved Hide resolved

cwcummings added 3 commits May 11, 2021 10:07

fix error commented line in config

927f090

Merge branch 'master' of https://github.com/bird-house/birdhouse-deploy…

fe79bc5

… into custom-notebooks

bump pavics jupyter images versions

8990073

ChaamC merged commit d90765a into master May 11, 2021

ChaamC deleted the custom-notebooks branch May 11, 2021 16:57

ChaamC mentioned this pull request May 13, 2021

Merge deploy_data_specific_image script with the deploy-data script #169

Open

	if [ -z "`echo "$AUTODEPLOY_EXTRA_SCHEDULER_JOBS" \| grep $DEPLOY_DATA_JOB_JOB_NAME`" ]; then

	# Add job only if not already added (config is read more than once during
	# autodeploy process).

	# Load pre-configured cronjob to automatically deploy Raven testdata to Thredds
	# for Raven tutorial notebooks.
	#
	# See the job for additional possible configurations. The "scheduler"
	# component needs to be enabled for this pre-configured job to work.
	#
	#if [ -f "$COMPOSE_DIR/components/scheduler/deploy_raven_testdata_to_thredds.env" \
	# -a -f "$COMPOSE_DIR/components/scheduler/deploy_data_job.env" ]; then
	# . $COMPOSE_DIR/components/scheduler/deploy_raven_testdata_to_thredds.env
	# . $COMPOSE_DIR/components/scheduler/deploy_data_job.env
	#fi

Custom notebooks #150

Custom notebooks #150

Conversation

ChaamC commented Apr 19, 2021 • edited Loading

dbyrns left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tlvu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tlvu left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tlvu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tlvu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dbyrns left a comment

Choose a reason for hiding this comment

tlvu commented Apr 27, 2021

ChaamC commented May 4, 2021

tlvu left a comment

Choose a reason for hiding this comment

dbyrns left a comment

Choose a reason for hiding this comment

ChaamC commented May 10, 2021 • edited Loading

tlvu commented May 10, 2021

tlvu commented May 10, 2021

ChaamC commented May 11, 2021

ChaamC commented May 11, 2021

tlvu left a comment

Choose a reason for hiding this comment

ChaamC commented May 11, 2021 • edited Loading

matprov commented May 11, 2021

ChaamC commented Apr 19, 2021 •

edited

Loading

tlvu left a comment •

edited

Loading

ChaamC commented May 10, 2021 •

edited

Loading

ChaamC commented May 11, 2021 •

edited

Loading