Consider continuous deployment for this Repo #5

yuvipanda · 2018-04-02T17:01:42Z

This is awesome work, @jacobtomlinson! I'm also excited to see chartpress being used :)

What you think of continuous deployment for this repo? It took the pressure off me and other ops-y people in both mybinder.org and the berkeley jupyterhub deployment. See http://mybinder-sre.readthedocs.io/en/latest/deployment/how.html for how deployments to mybinder.org work. https://github.com/jupyterhub/mybinder.org-deploy/graphs/contributors shows that people of various skillsets can deploy with confidence.

This will also allow us (The JupyterHub community) to document good practices around continuous deployment, security management in public repos, and growing a community around your deployment.

Possible sequence of steps to doing CD:

Move the docker image contents here
Use git-crypt to store deployment secrets. MyBinder.org uses it to good effect!
Set up automatic build and push on merge for docker image
Create a 'staging' environment that mimics the production environment
Create a Google Cloud Service Account that has just enough rights to do a helm upgrade
Create two branches - 'staging' and 'prod'. Merging to 'staging' will deploy to the staging environment. Deployer then verifies staging environment works as intended, and merges PR to 'prod' environment after.

Open questions

Who has merge (and hence deploy) rights?
Whose responsibility it is to fix stuff when things break?
How do we incorporate other deployments (on AWS? etc) to this CD pipeline?

The text was updated successfully, but these errors were encountered:

yuvipanda · 2018-04-02T17:01:58Z

/cc @mrocklin who might be interested too.

jacobtomlinson · 2018-04-03T08:10:16Z

I agree this would be a good thing to do. Thanks for the advice!

This repo is the generic helm chart for making 'a pangeo'. I wonder if you're thinking about pangeo.pydata.org which I would describe as 'an instance of a pangeo' and is managed from pangeo-data/pangeo. So perhaps the CI/CD should go there?

I would be keen to add better automated testing and publishing of charts to this repo.

yuvipanda · 2018-04-03T08:19:08Z

Apologies! I didn't know 'A Pangeo' is different from 'The Pangeo'! This issue does belong on the pangeo-data/pangeo repo. Publishing this chart automatically to a github pages repo might be a good idea (and is something chartpress can do).

…

On Tue, Apr 3, 2018 at 1:10 AM, Jacob Tomlinson ***@***.***> wrote: I agree this would be a good thing to do. Thanks for the advice! This repo is the generic helm chart for making 'a pangeo'. I wonder if you're thinking about pangeo.pydata.org which I would describe as 'an instance of a pangeo' and is managed from pangeo-data/pangeo. So perhaps the CI/CD should go there? I would be keen to add better automated testing and publishing of charts to this repo. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#5 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAB23rd7a-ORCH-k-j6XuqMDEySZpFRkks5tky5tgaJpZM4TDyCo> .

-- Yuvi Panda T http://yuvi.in/blog

jacobtomlinson · 2018-04-03T08:26:26Z

I'm keen for Pangeo to be a generic thing, like JupyterHub, which many people can deploy. pangeo.pydata.org is the reference deployment, but we also have our own at the Met Office.

Yeah I've got Travis set up to auto build and deploy using chartpress, but I set it up to only deploy on tags which means the commit range is not provided. I raised #2 to discuss solutions.

yuvipanda · 2018-04-03T08:30:26Z

Why do you prefer deploying only on tags rather than on every commit?

…

On Tue, Apr 3, 2018 at 1:26 AM, Jacob Tomlinson ***@***.***> wrote: I'm keen for Pangeo to be a generic thing, like JupyterHub, which many people can deploy. pangeo.pydata.org is the reference deployment, but we also have our own at the Met Office. Yeah I've got Travis set up to auto build and deploy using chartpress, but I set it up to only deploy on tags which means the commit range is not provided. I raised #2 <#2> to discuss solutions. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#5 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAB23q9PtkRffmCr_JDEw1s_lyvdEoX1ks5tkzI0gaJpZM4TDyCo> .

-- Yuvi Panda T http://yuvi.in/blog

jacobtomlinson · 2018-04-03T08:39:59Z

Tags are created with intent to mark a milestone, have been tested more thoroughly and come with some (possibly small) guarantees that it will definitely work. They also use semver which is well understood and means that package managers can intelligently upgrade.

I feel that using commits causes headaches because it is not always obvious which commit is the latest, particularly because they often get listed alphabetically. It is also hard to know where you revert back to if you encounter a bug, if there have been 30 commits since the last version you were on do you go back to where you were before to try somewhere in the middle.

As a first step I think using git describe --tags --abbrev=1 which includes the number of commits since the last tag helps make things clearer.

rabernat · 2018-04-03T10:54:59Z

Thank you both for your work on this!

For me, continuous deployment of pangeo.pydata.org is a very high priority (see e.g. pangeo-data/pangeo#131). The reason is that we really don't have the resources for a full time sys-admin team to babysit the cluster. Right now, the burden of pushing bugfix changes and redeploying the cluster falls on @mrocklin, @jacobtomlinson, or anyone else with the tech savvy who happens to be available when a meltdown occurs. This is not really sustainable.

I do also agree with Jacob that we want to provide a "generic" pangeo deployment template, with pangeo.pydata.org as a particular instance.

rabernat · 2018-04-30T13:18:35Z

Just following up on this older discussion, now that I have gotten my feet wet a bit with docker, helm, etc.

In my rush to get a new chart built, I just enabled deployment of every commit, contrary to @jacobtomlinson's recommendations above. I am happy to roll this back, as now I understand the arguments better. However, it seems like jupyterhub deploys every commit, so clearly there are different approaches possible.

My main question is: given that this is the "generic" pangeo helm chart, what additional configuration is required for a specific deployment (e.g. pangeo.pydata.org, met office, etc.) And where should that config live? Should we have an additional repo dedicated just to pangeo.pydata.org deployment?

rabernat · 2018-05-17T19:50:59Z

We continue to update the docker files in the main pangeo repo (pangeo-data/pangeo#261). I would really like to get the docker files moved here and automatically built.

@yuvipanda would you care to suggest how we might move towards implementing the continuous deployment as described in your original post? Just start checking off the boxes one by one?

jgerardsimcock · 2018-07-06T20:47:37Z

What is the procedure for having travis deploy charts to gh-pages via chartpress? My .travis.yml and deploy.sh files are similar to those in this repo. Its not clear to me how to configure the gh-pages branch for travis to do what it needs to do. Travis is choking on this line in chartpress.py

jacobtomlinson · 2018-07-09T07:53:44Z

@jgerardsimcock you also need to have an SSH key which can push to the gh-pages branch on your repo. We are doing this with an encrypted secret file. You also need to start off with an orphaned gh-pages branch.

Shout if you need any more help.

rabernat · 2018-08-14T13:57:38Z

We have accomplished the first three steps of @yuvipanda's checklist.

But issue #54 highlights the need for a test cluster before we deploy to the production cluster. Maybe we can work on this this week at the Pangeo meeting.

yuvipanda · 2018-08-15T22:05:29Z

https://github.com/yuvipanda/hubploy is the code I'm working on that lets you do CD easily for things like this. It complements chartpress. Some docs here: https://hubploy.readthedocs.io/ but more coming.

guillaumeeb · 2018-12-18T09:45:39Z

I'm reviving some old issue here, but I'd like to revive this repo. So sorry for the noise in advance.

I think this one can be closed as this as been done in https://github.com/pangeo-data/dev.pangeo.io-deploy.

I support the distinction of @jacobtomlinson between 'A pangeo' and 'The Pangeo'. So keeping this repo without CD on a given site seems fine.

jhamman · 2019-01-26T19:26:57Z

I'm in agreement with @guillaumeeb here. I will close this in favor of the recent hubploy deployments.

rabernat mentioned this issue May 19, 2018

move pangeo docker images here #27

Merged

rabernat mentioned this issue Jun 14, 2018

help wanted! #38

Closed

jhamman closed this as completed Jan 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider continuous deployment for this Repo #5

Consider continuous deployment for this Repo #5

yuvipanda commented Apr 2, 2018 •

edited by rabernat

yuvipanda commented Apr 2, 2018

jacobtomlinson commented Apr 3, 2018

yuvipanda commented Apr 3, 2018 via email

jacobtomlinson commented Apr 3, 2018

yuvipanda commented Apr 3, 2018 via email

jacobtomlinson commented Apr 3, 2018 •

edited

rabernat commented Apr 3, 2018

rabernat commented Apr 30, 2018

rabernat commented May 17, 2018

jgerardsimcock commented Jul 6, 2018

jacobtomlinson commented Jul 9, 2018

rabernat commented Aug 14, 2018

yuvipanda commented Aug 15, 2018

guillaumeeb commented Dec 18, 2018

jhamman commented Jan 26, 2019

Consider continuous deployment for this Repo #5

Consider continuous deployment for this Repo #5

Comments

yuvipanda commented Apr 2, 2018 • edited by rabernat

yuvipanda commented Apr 2, 2018

jacobtomlinson commented Apr 3, 2018

yuvipanda commented Apr 3, 2018 via email

jacobtomlinson commented Apr 3, 2018

yuvipanda commented Apr 3, 2018 via email

jacobtomlinson commented Apr 3, 2018 • edited

rabernat commented Apr 3, 2018

rabernat commented Apr 30, 2018

rabernat commented May 17, 2018

jgerardsimcock commented Jul 6, 2018

jacobtomlinson commented Jul 9, 2018

rabernat commented Aug 14, 2018

yuvipanda commented Aug 15, 2018

guillaumeeb commented Dec 18, 2018

jhamman commented Jan 26, 2019

yuvipanda commented Apr 2, 2018 •

edited by rabernat

jacobtomlinson commented Apr 3, 2018 •

edited