You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is the relevant traceback when the server disconnects while streaming:
File "...\aesara_federated\common.py", line 131, in evaluate
logp, *gradients = self._client.evaluate(*inputs, use_stream=use_stream)
File "...\aesara_federated\service.py", line 203, in evaluate
output = loop.run_until_complete(eval_task)
File "...\aefenv\lib\asyncio\base_events.py", line 646, in run_until_complete
return future.result()
File "...\aesara_federated\service.py", line 219, in _streamed_evaluate
response = await self._lazy_stream.recv_message()
File ...\aefenv\lib\site-packages\grpclib\client.py", line 427, in recv_message
with self._wrapper:
File "...\aefenv\lib\site-packages\grpclib\utils.py", line 70, in __exit__
raise self._error
grpclib.exceptions.StreamTerminatedError: Connection lost
And this is the error when each request is sent as an independent message:
File "...\aesara_federated\common.py", line 131, in evaluate
logp, *gradients = self._client.evaluate(*inputs, use_stream=use_stream)
File "...\aesara_federated\service.py", line 203, in evaluate
output = loop.run_until_complete(eval_task)
File "...\aefenv\lib\asyncio\base_events.py", line 646, in run_until_complete
return future.result()
File "...\aesara_federated\rpc.py", line 54, in evaluate
return await self._unary_unary(
File "...\aefenv\lib\site-packages\betterproto\grpc\grpclib_client.py", line 85, in _unary_unary
response = await stream.recv_message()
File "...\aefenv\lib\site-packages\grpclib\client.py", line 425, in recv_message
await self.recv_initial_metadata()
File "...\aefenv\lib\site-packages\grpclib\client.py", line 367, in recv_initial_metadata
with self._wrapper:
File "...\aefenv\lib\site-packages\grpclib\utils.py", line 70, in __exit__
raise self._error
grpclib.exceptions.StreamTerminatedError: Connection lost
⚠ Note that with the demo example, use_stream=True takes 40 seconds for the parallelized MCMC sampling while use_stream=False takes 51 seconds.
The text was updated successfully, but these errors were encountered:
michaelosthege
changed the title
Diagnose what happens to the gRPC client when the server disconnects
Come up with a reconnect/failover mechanism
Aug 5, 2022
This is the relevant traceback when the server disconnects while streaming:
And this is the error when each request is sent as an independent message:
⚠ Note that with the demo example,
use_stream=True
takes 40 seconds for the parallelized MCMC sampling whileuse_stream=False
takes 51 seconds.The text was updated successfully, but these errors were encountered: