sftd: add support for multiple SFT servers #1325

arianvp · 2021-01-13T16:23:37Z

The ingress assigns an SFT allocation request to a random SFT
Each sftd pod is made aware of an URL on which it is directly
reachable, and will return the URL in the response to the client. e.g.
Pod sftd-0 will be assigned https://sft.example.com/sfts/sftd-0
The client tells this URL to other clients willing to join the call
Other clients make a request to this URL
The ingress points requests to /sfts to the join-call deployment,
which will redirect to the specific pod, such that the client can join
the conference call of the other client

charts/sftd/templates/ingress.yaml

lucendio

It's a very interesting approach. Amazing work! Some nits and concerns on my end. Not necessarily related to the approach though.
I still have some questions, but that's something for a coffee.

charts/sftd/README.md

jschaul · 2021-02-15T21:17:38Z

charts/sftd/README.md

@@ -1,5 +1,8 @@
 # SFTD Chart

+In theory the `sftd` chart can be installed on its own, but it's usually
+installed as part of the `wire-server` umbrella chart.


Just a thought: I understand it might be nice to just say "helm install wire-server", though often "more complex" services (stateful ones or those with slower update cycles like SFT due to e.g. not wanting to cut ongoing calls out for 6 hours or so) being bundled into an otherwise fast-to-upgrade chart may slow any wire-server upgrades down going forward. I presume some scripts and/or helmfiles to be in use anyway, as more than a single chart needs to be installed.
Not sure if there are benefits otherwise that I might be missing? Or do I overestimate potential downsides?

(feel free to go ahead with the wrapper chart approach, just wondering about the motivations)

I'm really on the fence between the two options. It was originally the way you just described and I only changed it today...

Motivation was:

I wanted to do a bit of refactoring in the wire-server umbrella chart in a follow-up PR so that there is a globals.baseDomain option that all charts can use to remove all the domain duplication in our values.yml. As a form of experiment I wanted to make the SFT chart (which needs to know the base domain; and the domain name of the webapp chart) use this method of configuration to set its host and allowOrigin options.

Also helm upgrade --wait only waits for new pods to be spawned; it doesn't wait for old pods to be terminated; so even with a slow update cycle (due to setting terminationGracePeriodSeconds to a few hours) upgrades would still be "snappy".

However now I just realised that's only true for Deployment; not for StatefulSet; as StatefulSet replaces pods one by one (See Future Work section at the bottom of the README) so it might take longer for --wait to return.

I'm going to sleep on it and revisit tomorrow. I might end up reverting this again.

I was inspired to structure stuff a bit like:

https://gitlab.com/gitlab-org/charts/gitlab
https://docs.gitlab.com/charts/charts/globals.html#configure-host-settings

@jschaul I have made it clear in the docs that both ways are possible. docs.wire.com currently still instructs to not use the umbrella chart

@jschaul I have made it clear in the docs that both ways are possible. docs.wire.com currently still instructs to not use the umbrella chart

Great! Thanks!

Depends on wireapp/wire-server#1325

* The ingress assigns an SFT allocation request to a random SFT * Each sftd pod is made aware of an URL on which it is directly reachable, and will return the URL in the response to the client. e.g. Pod `sftd-0` will be assigned `https://sft.example.com/sfts/sftd-0` * The client tells this URL to other clients willing to join the call * Other clients make a request to this URL * The ingress points requests to `/sfts` to the `join-call` deployment, which will redirect to the specific pod, such that the client can join the conference call of the other client

some kubernetes clusters (like Scaleway) don't name the DNS server kube-dns but core-dns. /etc/resolv.conf is guaranteed to point to the correct thing though.

This makes it quicker to scale up and down. As sft is actually not stateful and does not require ordered restarts. We're just using StatefulSet to get a persistent DNS name so we can join existing calls.

Co-authored-by: Lucendio <gregor.jahn@wire.com>

We can make deployment of sftd a bit easier in the future. Added a section on this so I do not forget :)

Also make the docs a lot nicer.

arianvp commented Jan 13, 2021

View reviewed changes

charts/sftd/templates/ingress.yaml Show resolved Hide resolved

arianvp force-pushed the sftd-ha-2 branch from 21ac715 to d78b289 Compare January 13, 2021 17:35

arianvp mentioned this pull request Jan 13, 2021

Horizontally scale conference calling wireapp/wire-server-deploy#392

Closed

arianvp requested a review from julialongtin January 13, 2021 18:52

lucendio suggested changes Jan 18, 2021

View reviewed changes

charts/sftd/README.md Outdated Show resolved Hide resolved

charts/sftd/README.md Outdated Show resolved Hide resolved

charts/sftd/README.md Outdated Show resolved Hide resolved

charts/sftd/README.md Outdated Show resolved Hide resolved

charts/sftd/README.md Outdated Show resolved Hide resolved

lucendio approved these changes Jan 26, 2021

View reviewed changes

jschaul reviewed Feb 15, 2021

View reviewed changes

arianvp added a commit to wireapp/wire-server-deploy that referenced this pull request Feb 15, 2021

Add sftd configuration to wire-server values

243ee9f

Depends on wireapp/wire-server#1325

arianvp mentioned this pull request Feb 15, 2021

Add sftd configuration to wire-server values wireapp/wire-server-deploy#419

Closed

arianvp added a commit to wireapp/wire-server-deploy that referenced this pull request Feb 15, 2021

Add sftd configuration to wire-server values

1e83206

Depends on wireapp/wire-server#1325

arianvp and others added 11 commits February 15, 2021 22:23

charts/sftd: Add docs about gradual rollout

be45163

charts/sftd: Fix doc typo

d8871aa

charts/sftd: Use /etc/resolv.conf to discover DNS server

7034913

some kubernetes clusters (like Scaleway) don't name the DNS server kube-dns but core-dns. /etc/resolv.conf is guaranteed to point to the correct thing though.

charts/sftd: Remove TODO

15b57ee

charts/sftd: podManagementPolicy: Parallel

aa8104d

This makes it quicker to scale up and down. As sft is actually not stateful and does not require ordered restarts. We're just using StatefulSet to get a persistent DNS name so we can join existing calls.

charts/sftd: Add docs about scaling

d4fccca

charts/sftd: Add docs about routability

4531ea8

Apply suggestions from code review

7075d8a

Co-authored-by: Lucendio <gregor.jahn@wire.com>

Add a table with parameters and add future work

73b153b

We can make deployment of sftd a bit easier in the future. Added a section on this so I do not forget :)

Add sftd to wire-server umbrella chart

d504759

Also make the docs a lot nicer.

arianvp force-pushed the sftd-ha-2 branch from 5ebf0ad to d504759 Compare February 15, 2021 21:24

arianvp added 2 commits February 16, 2021 13:19

change docs to show that both standalone and umbrella is possible

ef440e2

Add note about tags

a798120

arianvp merged commit 3f819a6 into develop Feb 17, 2021

arianvp deleted the sftd-ha-2 branch February 17, 2021 18:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sftd: add support for multiple SFT servers #1325

sftd: add support for multiple SFT servers #1325

arianvp commented Jan 13, 2021

lucendio left a comment

jschaul Feb 15, 2021

arianvp Feb 15, 2021 •

edited

arianvp Feb 15, 2021

arianvp Feb 16, 2021

jschaul Feb 16, 2021

sftd: add support for multiple SFT servers #1325

sftd: add support for multiple SFT servers #1325

Conversation

arianvp commented Jan 13, 2021

lucendio left a comment

Choose a reason for hiding this comment

jschaul Feb 15, 2021

Choose a reason for hiding this comment

arianvp Feb 15, 2021 • edited

Choose a reason for hiding this comment

arianvp Feb 15, 2021

Choose a reason for hiding this comment

arianvp Feb 16, 2021

Choose a reason for hiding this comment

jschaul Feb 16, 2021

Choose a reason for hiding this comment

arianvp Feb 15, 2021 •

edited