Replies: 2 comments 4 replies
-
Sorry for the delay in responding. Here are my disorganized thoughts:
|
Beta Was this translation helpful? Give feedback.
-
Typically, aggressive sampling is necessary for reducing performance overhead when using distributed tracing. I notice you're setting a sample rate of 0.1%, which should be sufficient. However, the sampling is implemented purely in We'd like to add better support for sampling in I suspect that baked-in sampling support in |
Beta Was this translation helpful? Give feedback.
-
Hey folks,
I am having a problem after enabling tracing in a rust service. This service serves as an in-memory cache for data loaded from persistence. Each host of the service handles 7K-8K QPS at peak of grpc requests from clients.
Before enabling tracing, the average and P95 response latency are below 1 ms. CPU usage of each host is below 40%.
However, after turning on the tracing, the average response latency went up to around 5 ms and P95 can be above 100ms. CPU usage increased to 50-60%.
The dependencies that are used in this service:
Basic information about the service:
Tracing pipeline configuration:
We are running out of ideas how to resolve the performance degradation after tracing is enabled.
I am happy to provide more details.
Could anyone shed some lights?
Thank you very much in advance.
Beta Was this translation helpful? Give feedback.
All reactions