Inconsistent cluster/mapping configurations are used when multiple mappings pointed to the same upstream service #3112

guoyiang · 2020-12-11T05:46:39Z

Describe the bug
When multiple mappings are configured to use the same upstream service, all those mappings share the same envoy cluster. But each mapping can be created with different parameters. As a result, a mapping may behave different from its configuration.

For example, ambassador is proxying to an envoy which are capable to perform gRPC transcoding. The envoy reverse proxy serves both RESTful API and gRPC API. 2 mappings are created, one for rest api and the other for gRPC, with different prefix. The gRPC mapping has grpc: true configured. But the setting may get ignored when ambassador configuring envoy cluster behind the gRPC route, and gRPC request fails.

To Reproduce

Create mappings as follow:

apiVersion: getambassador.io/v1
kind: Mapping
metadata:
  name: test
spec:
  prefix: /helloworld
  rewrite: ""
  service: test-service
  connect_timeout_ms: 1000
---
apiVersion: getambassador.io/v1
kind: Mapping
metadata:
  name: testgrpc
spec:
  prefix: /helloworld.Greeter/
  rewrite: ""
  service: test-service
  grpc: true
  connect_timeout_ms: 5000

Go to ambassador diagnostic page and check testgrpc mapping. Mappings "test" and "testgrpc" are merged together using the same cluster with following configuration:

{
    "connect_timeout": "1.000s",
    "dns_lookup_family": "V4_ONLY",
    "lb_policy": "ROUND_ROBIN",
    "load_assignment": {
        "cluster_name": "cluster_test_service_dev",
        "endpoints": [
            {
                "lb_endpoints": [
                    {
                        "endpoint": {
                            "address": {
                                "socket_address": {
                                    "address": "test-service",
                                    "port_value": 80,
                                    "protocol": "TCP"
                                }
                            }
                        }
                    }
                ]
            }
        ]
    },
    "name": "cluster_test_service_dev",
    "type": "STRICT_DNS"
}

Actual behavior
gRPC request failed. Testing gRPC API using evans failed with message "server closed the stream without sending trailers".

connect_timeout is set to 1s which is not what's set in mapping.

This is because the cluster behind mapping testgrpc do NOT have http2_protocol_options option. Envoy won't initialize HTTP 2 connection with upstream, which caused failure in proxied gRPC requests as described here. This can be observed in envoy debug log as http1 handler is used.

Expected behavior
Each mapping's envoy cluster should be configured according to the configuration of the mapping it self.

testgrpc mapping should work as expected with 5s connect timeout. Which means http2_protocol_options should be added to envoy cluster and connect_timeout is set to 5s.

Mapping test should have connect timeout as 1s.

Versions (please complete the following information):

Ambassador: [1.8.1]
Kubernetes environment [Azure Kubernetes Service]
Version [e.g. 1.18.10]

Additional context
A quick look inside this code, it seems that the cluster config is cached using cluster name, and the cached cluster config is used as is without verifying if they are configured the same way. According to this code, configs from the first mapping sorted alphabetically are getting used.

The text was updated successfully, but these errors were encountered:

guoyiang · 2020-12-14T04:50:18Z

A few ways to workaround this issue:

Set different services between two mappings. This can be achieved by using different port. In our case, port 80 is added explicitly for gRPC mapping (service: test-service:80), but http mapping only have hostname and port is implicit (service: test-service), .
Use cluster_tag to enforce a different cluster name for gRPC mapping.

stale · 2021-02-14T20:00:22Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

dvaldivia · 2021-02-26T04:26:00Z

@guoyiang can you share what you mean by cluster tag?

guoyiang · 2021-02-26T04:30:57Z

@dvaldivia doesn't recall exactly what I did back that time. But looked a bit on the docs, cluster_tag is an attribute can be added in mapping, to customize generated cluster name. When present, it enforces a different cluster name, and then workaround this issue.

Using cluster_tag
If the cluster_tag attribute is present, its value will be prepended to cluster names generated from the Mapping. This provides a simple mechanism for customizing the cluster name when working with metrics.

cindymullins-dw · 2022-08-05T21:19:10Z

Closing as there is an apparent workaround. If the issue persists on 2.x please reopen.

juanjoku · 2023-03-02T12:07:48Z

But then... is it still mandatory to use "clustertag" with Emissary 3.x, as a workaround to this problem?
It is not clear to me if it has been fixed in any version.

Thx!

stale bot added the stale Issue is stale and will be closed label Feb 14, 2021

stale bot removed the stale Issue is stale and will be closed label Feb 26, 2021

cindymullins-dw closed this as completed Aug 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent cluster/mapping configurations are used when multiple mappings pointed to the same upstream service #3112

Inconsistent cluster/mapping configurations are used when multiple mappings pointed to the same upstream service #3112

guoyiang commented Dec 11, 2020 •

edited

guoyiang commented Dec 14, 2020 •

edited

stale bot commented Feb 14, 2021

dvaldivia commented Feb 26, 2021

guoyiang commented Feb 26, 2021

cindymullins-dw commented Aug 5, 2022

juanjoku commented Mar 2, 2023

Inconsistent cluster/mapping configurations are used when multiple mappings pointed to the same upstream service #3112

Inconsistent cluster/mapping configurations are used when multiple mappings pointed to the same upstream service #3112

Comments

guoyiang commented Dec 11, 2020 • edited

guoyiang commented Dec 14, 2020 • edited

stale bot commented Feb 14, 2021

dvaldivia commented Feb 26, 2021

guoyiang commented Feb 26, 2021

cindymullins-dw commented Aug 5, 2022

juanjoku commented Mar 2, 2023

guoyiang commented Dec 11, 2020 •

edited

guoyiang commented Dec 14, 2020 •

edited