good practice for gateway peer loadbalancing #257

davidkel · 2021-10-19T14:19:27Z

We should provide guidance on this as part of samples/documentation some considerations are

load balancers in a K8s environment
client handling the balancing itself

Please feel free to add more thoughts, info for capturing plus where appropriate artifacts demonstrating this should be created

eg samples, docs

bestbeforetoday · 2021-10-19T14:45:24Z

These pieces of gRPC documentation may be relevant:

Definitely we should have some documentation on deployment patterns and best-practices with Fabric Gateway. Where appropriate we should reference existing documentation (like those gRPC ones above) instead of duplicating or recreating that information.

davidkel · 2021-10-20T09:07:06Z

load balancing will not guarantee the initial endorse will be balanced as gateway decides based on block height which peer to use but does favour the gateway peer. under load the gateway information on block height could be stale

davidkel · 2021-11-08T13:40:12Z

@mbwhite @jkneubuh This might be a good place to capture any relevant info

jkneubuh · 2021-11-08T18:46:35Z

On the Kubernetes front there are a few different approaches that we can employ to shape traffic between the gateway client and peers. All of these will provide some level of HA, failover, and traffic distribution across a set of peers.

Gateway load balancing within Kubernetes can be accomplished by establishing a Service instance matching node selectors for multiple peer Deployments. Provided that the TLS certificates for the peers have been initialized with a common SAN in the signing request, a single k8s Service can act as a front-end to multiple peers using a common DNS name. When establishing a TCP connection from the client to the gateway peer, Kubernetes will use the Service interface to dynamically resolve the address of one of the Pods bound to the Service.

One downside of using the Kubernetes Service routing is that any finer-grain message routing, e.g. at the gRPC message layer, is not possible. Kubernetes can help with the initial assignment of a TCP connection to one of the backing peers, but once a client connection is established it will be maintained for the duration of the socket.

By default a Kubernetes Service will use iptables to bind a client connection to a peer pod using random assignment. The pods backing a Service instance can be monitored via Readiness Probes, ensuring that only "ready" services receive gRPC handshakes from the gateway client SDK.

Building on the iptables routing, it looks like a Kube service can also use IPVS mode to further shape the IP resolution:

IPVS provides more options for balancing traffic to backend Pods; these are:

rr: round-robin
lc: least connection (smallest number of open connections)
dh: destination hashing
sh: source hashing
sed: shortest expected delay
nq: never queue

In addition, the IPVS routing mode includes a sessionAffinity attribute which can be set to "ClientIP", ensuring that connections from a particular client are resolved to the same peer pod.

Building on the Kubernetes defaults, we could consider an additional layer of traffic shaping by co-deploying a Fabric network within a service mesh, as provided by Ambassador, Istio, or Linkerd. This approach may be a little more involved and not generally applicable to all environments running a Fabric network on K8s.

In addition to finding a home on the Fabric docs site, I like the idea of including a reference deployment within the Kubernetes test network. Do we have a reference example app showing the best practices for authoring a Fabric application using the Gateway SDK? (e.g. fabric-rest-sample, but using the Gateway SDK)

mbwhite · 2021-11-10T10:12:35Z

@jkneubuh I can create a PR on the test-network-k8s that includes the changes along these lines. Ironically they are relatively minor changes, but I guess that shows the power of K8S.

(see hyperledger/fabric-samples#532)

I believe there is an example planned for the gateway, meanwhile I'll refer you to the LedgerMessaging IBM example.

bestbeforetoday · 2022-03-18T15:45:05Z

@mbwhite @jkneubuh Have we done all we need (or realistically plan to do for now) on this issue? If so, I'll close it. If not, what needs doing and what's the outlook on that?

mbwhite · 2022-03-18T15:47:24Z

@bestbeforetoday I believe we've done all we can at the moment.

jkneubuh · 2022-03-18T15:55:56Z

I still have a draft PR (lingering) open for the peer load balancing in k8s. It's got some good info but is still too "kube specific" in the context where it's currently anchored in the docs.

Mark please leave this one open. I will connect with Josh (H) on finding the correct page / site for the doc content.

bestbeforetoday · 2022-05-20T17:46:21Z

Another potentially useful snippet of information on how to configure client-side load balancing over a set of IP addresses using the Node gRPC client:

grpc/grpc-node#1307 (comment)

A key piece of information is that grpc-js now supports ipv4: and ipv6: address schemes, which allow multiple target IP addresses to be specified for a client connection.

https://github.com/grpc/grpc/blob/master/doc/naming.md

This should probably be included in a sample somewhere, or at least just in the API docs.

bestbeforetoday · 2023-01-23T11:09:05Z

Best-practice recommendations for production deployment, which discusses load balancing / fail-over, is published in the full-stack-asset-transfer-guide sample.

bestbeforetoday added the documentation Improvements or additions to documentation label Oct 19, 2021

denyeart assigned jkneubuh and mbwhite Nov 16, 2021

jkneubuh mentioned this issue Nov 19, 2021

Fix Issue #257 by including additional details on Kube Service routing hyperledger/fabric#3065

Draft

bestbeforetoday unassigned mbwhite Mar 19, 2022

bestbeforetoday mentioned this issue Nov 15, 2022

too many requests for /gateway.Gateway, exceeding concurrency limit (500) #506

Closed

bestbeforetoday closed this as completed Jan 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

good practice for gateway peer loadbalancing #257

good practice for gateway peer loadbalancing #257

davidkel commented Oct 19, 2021

bestbeforetoday commented Oct 19, 2021

davidkel commented Oct 20, 2021

davidkel commented Nov 8, 2021

jkneubuh commented Nov 8, 2021

mbwhite commented Nov 10, 2021 •

edited

Loading

bestbeforetoday commented Mar 18, 2022

mbwhite commented Mar 18, 2022

jkneubuh commented Mar 18, 2022

bestbeforetoday commented May 20, 2022 •

edited

Loading

bestbeforetoday commented Jan 23, 2023

good practice for gateway peer loadbalancing #257

good practice for gateway peer loadbalancing #257

Comments

davidkel commented Oct 19, 2021

bestbeforetoday commented Oct 19, 2021

davidkel commented Oct 20, 2021

davidkel commented Nov 8, 2021

jkneubuh commented Nov 8, 2021

mbwhite commented Nov 10, 2021 • edited Loading

bestbeforetoday commented Mar 18, 2022

mbwhite commented Mar 18, 2022

jkneubuh commented Mar 18, 2022

bestbeforetoday commented May 20, 2022 • edited Loading

bestbeforetoday commented Jan 23, 2023

mbwhite commented Nov 10, 2021 •

edited

Loading

bestbeforetoday commented May 20, 2022 •

edited

Loading