Bump gRPC max tx/rx to 100MB for ingester and distributor #149

amckinley · 2020-07-29T00:40:09Z

This just changes the send and receive limits to match what was configured in the query-frontend (and changes the definition of 100MB from 100 << 20 to 1024 * 1024 * 100. My logs are full of ResourceExhausted desc = trying to send message larger than max, and I'm guessing it's just an oversight that the ingester and distributor didn't get their default limits bumped to match.

pracucci · 2020-07-29T11:12:31Z

Thanks @amckinley for opening this PR and raising the discussion around gRPC message size limit. The current config is not an oversight, but we're aware of cases when the limit can be hit. Let me do a step back.

Cortex internally uses gRPC to communicate between different services. Different services use gRPC to transfer different type of data; for some communication we use gRPC streaming (which suffers less this issue) and for other we don't.

In your setup you can increase the limits as a quick workaround, but I don't think it's wise to raise all the limits to 100MB by default. On the contrary, we should understand which channel and why reach the limit. If this happens between the ingester and querier when running the blocks storage, then it's a known issue we want to work on (cortexproject/cortex#2945) so my suggestion would be to override it in your setup but not change the default here. If it happens anywhere else, please us know where so we can further investigate it.

amckinley · 2020-08-06T17:11:53Z

Hi @pracucci, what is the purpose of these limits? It doesn't look like Cortex is capable of "chunking" any of the data it returns, so hitting these limits just causes hard failures. In my deployment, I've been forced to just keep increasing these limits every time I find a new Grafana dashboard that refuses to render because of max gRPC size. Most recently we hit grpc: trying to send message larger than max (219597294 vs. 104857600) when trying to render a dashboard that parameterizes on k8s namespace, in a cluster where we have ~1000 unique namespaces. Wouldn't it be better to leave these limits uncapped everywhere?

CLAassistant · 2022-06-15T17:49:37Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Bump gRPC max tx/rx to 100MB for ingester and distributor

19e2059

amckinley requested a review from a team as a code owner July 29, 2020 00:40

Use correct config variable for s3 ruler config

f9af2e0

Base automatically changed from master to main March 3, 2021 14:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump gRPC max tx/rx to 100MB for ingester and distributor #149

Bump gRPC max tx/rx to 100MB for ingester and distributor #149

amckinley commented Jul 29, 2020

pracucci commented Jul 29, 2020

amckinley commented Aug 6, 2020

CLAassistant commented Jun 15, 2022

Bump gRPC max tx/rx to 100MB for ingester and distributor #149

Are you sure you want to change the base?

Bump gRPC max tx/rx to 100MB for ingester and distributor #149

Conversation

amckinley commented Jul 29, 2020

pracucci commented Jul 29, 2020

amckinley commented Aug 6, 2020

CLAassistant commented Jun 15, 2022