-
Notifications
You must be signed in to change notification settings - Fork 821
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gRPC RESOURCE_EXHAUSTED w. "Received message larger than max" with bumped max_{response, request}_size
config.
#1219
Comments
@hampusrosvall Looks fine with the configurations and from the torchserve logs, it seems the request size and response size are set to |
Hi, I am experiencing the same issue -- my grpc client-side channel is set up like so:
and my config.properties contains the following lines:
and yet when I try to run inference I get the response:
It would appear that the size gets set on the netty server but not the actual grpc server. |
If you are using TS_CONFIG_FILE ENV var then there is an open issue on reading this env var. #1257 |
@jagadeeshi2i I don't have that env var set -- I set ts config from the command line and nowhere else:
|
check torchserve startup logs if |
@jagadeeshi2i Yes; the max_request_size and max_response_size are indeed set --
The settings are being recognized and honored for HTTP payload changes. I tested this. The issue is that these changes do not seem to be propagated down to the gRPC server. |
The request size is set from the config. properties try changing the client code. |
The root cause of the error "Received message larger than max" is on the client side. Please check stackoverflow. |
Making the following changes in torchserve_grpc_client.py solved the issue:
|
Context
Your Environment
VisionHandler
(more details below)[If public url then provide link.]: local
enable_envvars_config=true
in config.propertiesExpected Behavior
By setting the
max_response_size
config parameter I should be able to send messages over gRPC smaller or equal to the parameter value.Current Behavior
I am querying torchserve using gRPC and on the client side I am getting the following error:
I am using the
torchserve:0.4.1-gpu
docker image.I should be able to bump the
max_response_size
config parameter and hence send messages smaller than or equal to that size? Any idea how I can solve this?Some additional information on my settings:
I am using the following environment variables for the server
and the
config.properties
has theenable_envvars_config
flag set to trueIn the official documentation the default size should be
6553500
, however the error message provided above hints that the default size is4194304
.Here are the logs upon startup of the
torchserve
serviceThe text was updated successfully, but these errors were encountered: