Slowdown encountered when reading request message from async server C++ API #23231

tanmayv25 · 2020-06-16T20:17:37Z

What version of gRPC and what language are you using?

v1.24.0, C++

What operating system (Linux, Windows,...) and version?

Linux, Ubuntu 18.04.1

What runtime / compiler are you using (e.g. python version or version of gcc)

gcc version 7.4.0

What did you do?

Let's assume the following two request message format for specifying requests:
Version 1:

message ModelInferRequest {
..
..
repeated bytes raw_contents;
}

Version 2:

message ModelInferRequest {
..
..
repeated InferInputTensor inputs;
..
..
}
message InferInputTensor {
..
..
..
InferTensorContents contents;
}
message InferTensorContents {
bytes raw_contents
..
..
}

The Version 1 contains repeated bytes directly in the main message, on the other hand Version 2 has the bytes field inside a nested message.

I have a client that keeps sending the same request message repeatedly. The server is designed using the C++ Async API. As described in one of the examples, we use the corresponding Request function to register the RPC and call Next on the completion queue for the activity.

With large raw_contents size(77MB), I am observing a significant average slowdown of 28ms on moving from message format Version 1 to message format Version 2, between registering the request to detecting the activity on completion queue. The client ensures the requests are generated at the same rate for both the cases. My guess is grpc is suboptimal in reusing the protobuf request message with large nested submessages.

What did you expect to see?

Similar performance on the two message formats. The C++ async gRPC should effectively reuse the message.

What did you see instead?

An average slowdown 28ms to read the message from the wire.
Version 1:
Wait + Read (avg): 118036.41362179487us
Version 2:
Wait + Read (avg): 146169.65345528454us

Anything else we should know about your project / environment?

markdroth · 2020-06-17T14:35:53Z

I'm not sure if this is a gRPC issue or a protobuf issue. Esun and/or Vijay, can one of you please advise?

tanmayv25 · 2020-06-17T19:29:46Z

The above reported numbers were observed when the protobuf message was cleared(by calling Clear()) in between the calls. I assumed may be not calling Clear() on the request message would help. However, not calling Clear on message before passing it to RequestAsyncCall further deteriorated the performance.
Version 2:
Wait + Read (avg): 193443us

veblush · 2022-05-23T22:49:12Z

I don't think gRPC behaves differently based on the proto messages. Serialization & Deserialization are done by protobuf so I guess this is more likely protobuf doing differently depending on the protobuf definition.

tanmayv25 added kind/bug priority/P2 labels Jun 16, 2020

tanmayv25 assigned markdroth Jun 16, 2020

markdroth assigned veblush and vjpai and unassigned markdroth Jun 17, 2020

markdroth added the lang/c++ label Jun 17, 2020

tanmayv25 mentioned this issue Jun 18, 2020

GRPC v2 low bandwidth triton-inference-server/server#1665

Closed

ctiller assigned ctiller and unassigned vjpai May 25, 2021

yashykt added the untriaged label Mar 23, 2022

veblush closed this as completed May 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slowdown encountered when reading request message from async server C++ API #23231

Slowdown encountered when reading request message from async server C++ API #23231

tanmayv25 commented Jun 16, 2020 •

edited

markdroth commented Jun 17, 2020

tanmayv25 commented Jun 17, 2020 •

edited

veblush commented May 23, 2022

Slowdown encountered when reading request message from async server C++ API #23231

Slowdown encountered when reading request message from async server C++ API #23231

Comments

tanmayv25 commented Jun 16, 2020 • edited

What version of gRPC and what language are you using?

What operating system (Linux, Windows,...) and version?

What runtime / compiler are you using (e.g. python version or version of gcc)

What did you do?

What did you expect to see?

What did you see instead?

Anything else we should know about your project / environment?

markdroth commented Jun 17, 2020

tanmayv25 commented Jun 17, 2020 • edited

veblush commented May 23, 2022

tanmayv25 commented Jun 16, 2020 •

edited

tanmayv25 commented Jun 17, 2020 •

edited