BGP with enterprise VPNs use case #60

iawells · 2021-02-08T14:04:31Z

No use case template yet, so this will want updating when it's committed.

electrocucaracha

minimal changes

use-case/bgp-customer.md

iawells · 2021-02-08T21:07:33Z

Thinking about this, assets should be in a subdirectory.

Two possibilities:

An .md in the main folder and a subdirectory without the .md extension.
An index.md in the subfolder along with its assets

use-case/bgp-enterprise/README.md

iawells

I beat you to it

iawells · 2021-02-08T21:46:59Z

I think we'll undo the markdown linting, there. We can add it separately. Also, I cocked it up and it doesn't run.

iawells · 2021-02-08T21:48:42Z

(removed, despite the record in the review comments)

taylor

lgtm

iawells · 2021-02-08T21:51:13Z

Yeah, I think we're now down to the abstract 'is this how we want a user story to look' - for which we could use the template for the next step.

jeffsaelens

Interesting starting point. In addition to BGPs allergy to NAT, there is also the convergence issue, and the time it takes for BGP to get to a "happy" status upon standup, or recovery.

This LGTM, but as a side question, is there a possible future where we say BGP isn't a good candidate for this style of packaging and hosting? I'm picking here solely because its the first use-case, but I'm curious... do we have a threshold for how much we try and get a use case to work before we say "maybe this is a bad idea?", or do we keep engineering until we beat BGP into submission?

iawells · 2021-02-09T16:32:52Z

Interesting starting point. In addition to BGPs allergy to NAT, there is also the convergence issue, and the time it takes for BGP to get to a "happy" status upon standup, or recovery.

It's an interesting point. We discuss recovery from failure, but the consequences of failure have to be considered in this. What we're really saying with more conventional apps (e.g. web services) is that no meaningful consequences if a component fails as long as we're ready to accept another request. Here, we have different consequences for failure and we have to see if it matters.

is there a possible future where we say BGP isn't a good candidate for this style of packaging and hosting ?[...] do we have a threshold for how much we try and get a use case to work before we say "maybe this is a bad idea?"

I think we can keep judgement out of this and say with an even hand 'this is the best that this allows us to do, take it or leave it'. It may not suit the use cases it used to; it might be better for other ones (e.g. much bigger RIBs).

Bear in mind, with BGP, that it does fail when whatever it runs on dies, and it always has - by 'fail', I mean 'withdraw all routes'. There's nothing new about that.

The point about clouds that we perhaps forget, is that failures are more likely - because we (theoretically) buy cheaper servers and use more fragile equipment, there are more points of failure, and we also expect ops activities like upgrades to be more frequent and more disruptive (killing containers during an upgrade is fine, e.g.). We are supposed to use the new tools to minimise the consequences of them. But failures are still monthly, not hourly, so they might be acceptable without doing more.

The overall resilience would depend several factors. We can get a replacement BGP server running very quickly, even if the original dies - that's new. We can make the RIB live in a distributed database, so a new process can use GR as it subs in for an old one - that's new too. And no-one builds their network to rely on one BGP server always running, so BGP never had to be 100% even in the before times.

We can use this sort of judgement method with whatever we're doing. This is how it was before, this is how it is now - better in some ways, worse in others - and now you-the-user decide if that's worth having. Using cloud native does have benefits; they're just not as straightforward, so we have to consider this a bit more closely.

[I'm wondering whether this discussion wants putting somewhere where people will find it.]

electrocucaracha · 2021-02-09T19:24:19Z

This use case covers different alternatives to address how to implement a BGP server in a kube-native way, but it's not referring to CNI Multiplexers like Multus, DANM or NSM, I just wondering if this was on purpose.

iawells · 2021-02-09T19:32:19Z

I'm trying to separate requirements from design - and, I admit, not being 100% consistent at it while we work out what the rules should be, so feedback on that would be useful. What I've tried to say here is: if we assume only what we'd get from a standard k8s install - so that's default CNI behaviour without extensions (as in, literally, 'what all CNIs are documented to do', without having to choose one that implements CNI-and-then-some) and we want to solve this use case, then we have shortcomings and they're worth writing up. And then I've stopped.

Applying Multus or DANM to this would then be the next step - a design question, separate from the use case and its implied requirements, and a means to test if that system design actually solves the requirements of this use case - and that belongs somewhere else

It needs to be somewhere else because there could be more than just those two solutions to consider. I could apply other technologies to it (NSM, appropriately trained cockroaches moving packets in little envelopes, other solutions we haven't considered or written yet) and, just the same, measure if they do well or badly for this use case.

Ultimately our best practice, if we choose one, should be the one that ticks the most boxes this and the other use cases; and we get to document the shortcomings too because they should become clear.

Thus: use case -> unsatisfied requirements -> bunch of design proposals -> best practice.

electrocucaracha · 2021-02-09T19:52:07Z

Agree, not only as a way to highlight the unsatisfied requirements, it also promotes the portability of CNFs. In the other hand, it's tricky to define a standard K8s deployment, given that Multus implements CNI methods (ADD, DEL, CHECK and VERSION) but relays on the pre-creation of overlay networks to operate properly. So maybe the criteria for a standard K8s deployment is what we can get from SPs, isn't it?

xmulligan · 2021-02-11T10:09:11Z

Just an idea, there are a lot of acronyms in there (and networking in general). For someone coming with a k8s but not networking background, it may look like alphabet soup. Should we require that all acronyms are defined once at the first instance of it in each doc or have a separate glossary? I would actually be in favor of the former as it makes for a more continuous reading experience and a glossary is extra work where most people would just google it.

Low-rent .gitignore file. Ignores typical in-edit and backup files for text editors.

A multi-VRF BGP speaker use case describing a reasonably common use case where two isolated networks are in use and the BGP protocol is the aim of the network function.

Co-authored-by: Victor Morales <chipahuac@hotmail.com>

Restructure into a subdirectory to keep document with its assets

Provide a reference section including any acronyms that we use that might throw people, and a discussion of what we can reasonably expect from any Kubernetes platform that has not been specifically tailored for NFV.

iawells · 2021-02-11T16:43:24Z

Rebased, added a 'context' section (none of the rest is changed but this should address both the acronyms and the 'what are we comparing with' questions; they might be common to all use cases, but we can deal with the consequences of that later).

use-case/bgp-enterprise/README.md

Markdown usage issue. Without blank lines, it runs into one big paragraph.

rannyh

Good problem statement

iawells · 2021-02-12T17:58:36Z

Multus implements CNI methods (ADD, DEL, CHECK and VERSION)

So the thing about CNIs is that what you say is true; this is there interface to the platform that they have to provide to be \a CNI, and they must all implement this. But there's documented expected behaviour they offer to the platform consumer, which is the more important thing to us.

Things like Calico, Cilium, Flannel and Weave offer - mostly - just that base behaviour. If you want to write a portable app, you would stick to the core functionality that they all have in common.

Multus also offers that behaviour, but most of what it does that is interesting for our use case is an extension of that that is specific to Multus. And Multus (or equivalent functionality) is not going to be found in a k8s deployment you pick at random, so we're not really judging the suitability of what we can reasonably guarantee to find. It's not common best practice to deploy Multus.

So Multus is not a great place to start from with a comparison perspective, but it's a great thing to bring in at the design step. "If we said it was a CNF best practice to expect a Multus-enabled platform, then..." And then we can test this theory against other designs with other conditions.

electrocucaracha reviewed Feb 8, 2021

View reviewed changes

use-case/bgp-customer.md Outdated Show resolved Hide resolved

use-case/bgp-customer.md Outdated Show resolved Hide resolved

taylor self-requested a review February 8, 2021 16:20

xmulligan requested review from vukg and removed request for taylor February 8, 2021 16:20

taylor requested changes Feb 8, 2021

View reviewed changes

use-case/bgp-enterprise/README.md Outdated Show resolved Hide resolved

iawells commented Feb 8, 2021

View reviewed changes

taylor requested review from fkautz, jeffsaelens and nickolaev February 8, 2021 21:29

iawells force-pushed the usecase-bgp-enterprise branch from e00575d to ce32158 Compare February 8, 2021 21:47

taylor requested review from taylor and electrocucaracha February 8, 2021 21:50

taylor approved these changes Feb 8, 2021

View reviewed changes

jeffsaelens approved these changes Feb 9, 2021

View reviewed changes

iawells and others added 7 commits February 11, 2021 08:31

Add .gitignore

4e1c708

Low-rent .gitignore file. Ignores typical in-edit and backup files for text editors.

Customer-BGP use case

44c9510

A multi-VRF BGP speaker use case describing a reasonably common use case where two isolated networks are in use and the BGP protocol is the aim of the network function.

Update use-case/bgp-customer.md

59bee49

Co-authored-by: Victor Morales <chipahuac@hotmail.com>

Update use-case/bgp-customer.md

86f45d0

Co-authored-by: Victor Morales <chipahuac@hotmail.com>

Move files around

50316d7

Restructure into a subdirectory to keep document with its assets

Filename capitalisation mismatch

3f85d1a

Add a context section

ff252aa

Provide a reference section including any acronyms that we use that might throw people, and a discussion of what we can reasonably expect from any Kubernetes platform that has not been specifically tailored for NFV.

iawells force-pushed the usecase-bgp-enterprise branch from 82426bf to ff252aa Compare February 11, 2021 16:34

rannyh reviewed Feb 11, 2021

View reviewed changes

use-case/bgp-enterprise/README.md Show resolved Hide resolved

Update glossary formatting

9e984ac

Markdown usage issue. Without blank lines, it runs into one big paragraph.

rannyh approved these changes Feb 11, 2021

View reviewed changes

taylor added the use case label Feb 11, 2021

xmulligan self-requested a review February 22, 2021 16:19

xmulligan approved these changes Feb 22, 2021

View reviewed changes

electrocucaracha approved these changes Feb 22, 2021

View reviewed changes

xmulligan merged commit 221405c into lfn-cnti:master Feb 22, 2021

iawells deleted the usecase-bgp-enterprise branch February 22, 2021 16:23

claudiobartolini approved these changes Feb 22, 2021

View reviewed changes

xmulligan mentioned this pull request Mar 16, 2021

BGP use case numbering #85

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BGP with enterprise VPNs use case #60

BGP with enterprise VPNs use case #60

iawells commented Feb 8, 2021

electrocucaracha left a comment

iawells commented Feb 8, 2021

iawells left a comment

iawells commented Feb 8, 2021

iawells commented Feb 8, 2021

taylor left a comment

iawells commented Feb 8, 2021

jeffsaelens left a comment

iawells commented Feb 9, 2021

electrocucaracha commented Feb 9, 2021

iawells commented Feb 9, 2021

electrocucaracha commented Feb 9, 2021

xmulligan commented Feb 11, 2021

iawells commented Feb 11, 2021 •

edited

Loading

rannyh left a comment

iawells commented Feb 12, 2021

BGP with enterprise VPNs use case #60

BGP with enterprise VPNs use case #60

Conversation

iawells commented Feb 8, 2021

electrocucaracha left a comment

Choose a reason for hiding this comment

iawells commented Feb 8, 2021

iawells left a comment

Choose a reason for hiding this comment

iawells commented Feb 8, 2021

iawells commented Feb 8, 2021

taylor left a comment

Choose a reason for hiding this comment

iawells commented Feb 8, 2021

jeffsaelens left a comment

Choose a reason for hiding this comment

iawells commented Feb 9, 2021

electrocucaracha commented Feb 9, 2021

iawells commented Feb 9, 2021

electrocucaracha commented Feb 9, 2021

xmulligan commented Feb 11, 2021

iawells commented Feb 11, 2021 • edited Loading

rannyh left a comment

Choose a reason for hiding this comment

iawells commented Feb 12, 2021

iawells commented Feb 11, 2021 •

edited

Loading