Road to a supported installation #27

cirocosta · 2019-04-10T19:55:12Z

YoussB · 2019-04-11T18:39:07Z

after talking to @cirocosta this morning, the steps that we should take are as follows:

upgrade hush-house to use the latest chart version (currently 5.1.2)
move everything to the new domain hush-house.pivotal.io hh: move external url from concourse-ci.org to pivotal.io and update certificates #25
move Dwayne's pipelines to hush-house. His teams:
- https://wings.pivotal.io/?search=team:cf-notifications
- https://wings.pivotal.io/?search=team:cf-tools

@scottietremendous wdyt?

YoussB · 2019-04-11T18:52:23Z

Extra things to think about: (not for hush-house GA but might be useful)

with stable/concourse: separate worker, web deployments helm/charts#12920 merged our process should be around creating only team-specific workers so we can get rid of noisy neighbors.
if possible we can leverage k8s secrets by creating a namespace per team and giving them access to this namespace allowing them to use it for their secrets.
adding auto terraform apply whenever the terraform scripts are updated.
adding more documentation about how to use the hush-house terraform scripts.

scottietremendous · 2019-04-11T18:56:29Z

@YoussB

I tend to think we should just go with the repo for incident reporting since that's what we basically do now with Wings.

On the second note, I think we should try out as many new features that helm can provide us as possible. I think it's important we still use this as a place to experiment.

aegershman · 2019-04-12T20:08:30Z

Decide between having the workloads on top of GKE or PKS
at the moment, deploying hush-house in PKS wouldn't give the team much more data than we'd get from GKE, where we already have it running, allowing us to not have to learn any details of PKS and just move directly to what we already have.

Apologies for acting as some random dude interjecting my opinion here, but I'd gently suggest reconsidering.

Thoughts:

There aren't too many details to learn that are unique to PKS's actual kubernetes clusters, but if you're going to be dog-fooding this with the intent for Pivotal customers to parrot (like me! :D) it wouldn't hurt to get first-hand experience on the toolset that they'll be using, what config options are available when setting up cluster plans, getting credentials to the cluster, etc.
If nothing else, it would be beneficial to remove as much IaaS-specific implementation as possible. The current deployment appears to use IaaS/GKE-specific implementation details for the web/worker.nodeSelector and loadBalancerIP. It seems like there's not just an opportunity to experiment with Concourse's stability on kubernetes, but the operationalization of it's deployment
Speaking of operationalization, you may also consider dogfooding running multiple Concourse environments like a "sandbox" to validate, then promoting those changes to production. It's beneficial because not only does it provide the practical value of multiple environments for testing changes, but it helps suss out optimizing workflows for handling "promoting" a series of changes in sandbox to prod.

3.1. Dogfooding the "environment promotion" workflow is beneficial because it helps drive out optimizations for declarative configuration that can be statically defined in a helm values.yml and not require an operator to go through "upgrade steps". The more things are operationalized such that configuration param keys: & values can be set and not require an operator to perform "in-between" steps, the better.

thanks for listening to my dissertation 👍

YoussB · 2019-04-15T21:31:48Z

Hey @aegershman,

thanks for the thoughtful comment. It makes a lot of sense 👍.

The point of keeping the environment running against GKE now is that we have already run some tests against it and has been hardened enough. We are considering PKS as a very important use case for our helm chart, but we wanted to run some tests against it first, and also document the steps needed for using with PKS for, as you said, other pivots can parrot it.
If nothing else, it would be beneficial to remove as much IaaS-specific implementation as possible. The current deployment appears to use IaaS/GKE-specific implementation details for the web/worker.nodeSelector and loadBalancerIP. It seems like there's not just an opportunity to experiment with Concourse's stability on kubernetes, but the operationalization of it's deployment

^^ these are hush-house deployment specific params, we have already created, and still creating, tests that run concourse against different deployment params
for instance:
https://github.com/concourse/concourse/blob/master/topgun/k8s/baggageclaim_drivers_test.go#L66-L94
Please feal free to give us feedback around other stbility tests that might be helpful in our case.
for the third point, there are currently 2 way to run an experimental concourse environment using k8s:
- either using the concourse helm chart
- also, based on the helm chart if you follow the deployments with creds or without creds mentioned in this repo you should be able to create a quick sandbox environment.
I am not sure I got this point correctly, please tell me if it makes sense.

cirocosta · 2019-04-17T14:20:24Z

Hey @YoussB ,

In case we end up going with using namespaced secrets as a way of leveraging k8s cred mgmt, we'd need this one tackled first concourse/docs#96 so that we can give a reference for the teams who end up consuming it.

Thanks!

YoussB · 2019-04-17T16:13:10Z

makes sense 👍

cirocosta pinned this issue Apr 10, 2019

cirocosta closed this as completed Apr 26, 2019

cirocosta unpinned this issue Apr 26, 2019

cirocosta mentioned this issue Apr 26, 2019

Road to supported Installation - round 2 #38

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Road to a supported installation #27

Road to a supported installation #27

cirocosta commented Apr 10, 2019 •

edited

YoussB commented Apr 11, 2019 •

edited by cirocosta

YoussB commented Apr 11, 2019 •

edited by cirocosta

scottietremendous commented Apr 11, 2019

aegershman commented Apr 12, 2019 •

edited

YoussB commented Apr 15, 2019

cirocosta commented Apr 17, 2019

YoussB commented Apr 17, 2019

Road to a supported installation #27

Road to a supported installation #27

Comments

cirocosta commented Apr 10, 2019 • edited

YoussB commented Apr 11, 2019 • edited by cirocosta

YoussB commented Apr 11, 2019 • edited by cirocosta

scottietremendous commented Apr 11, 2019

aegershman commented Apr 12, 2019 • edited

YoussB commented Apr 15, 2019

cirocosta commented Apr 17, 2019

YoussB commented Apr 17, 2019

cirocosta commented Apr 10, 2019 •

edited

YoussB commented Apr 11, 2019 •

edited by cirocosta

YoussB commented Apr 11, 2019 •

edited by cirocosta

aegershman commented Apr 12, 2019 •

edited