ratelimit: add support for emitting filter state stats for access logging #39086

agrawroh · 2025-04-12T17:36:01Z

##Description

This PR adds support for emitting rate limiting stats to the filter state, accessible via access logs. It allows users to collect details about the rate limit service interactions, including latency measurements, bytes sent/received, and upstream host information.

This PR adds a new configuration option called emit_filter_state_stats to the rate limit filter which, when enabled, stores detailed statistics about each rate limit request in the filter state. This data can then be accessed via access logs for monitoring and performance analysis.

This feature could be valuable for tracking the performance overhead introduced by rate limiting operations in production environments and helps users better understand and optimize their rate limiting configuration.

Fixes: #39018

Commit Message: Add filter state statistics collection for rate limit filter
Additional Description: This PR adds a new configuration option called emit_filter_state_stats which, when enabled, stores rate limit statistics in filter state. The statistics include latency measurements, bytes sent/received, upstream host info, and cluster info. These statistics can be accessed via access logs for monitoring purposes.
Risk Level: Low
Testing: Added integration tests
Docs Changes: Added
Release Notes: Added

repokitteh-read-only · 2025-04-12T17:36:11Z

CC @envoyproxy/api-shepherds: Your approval is needed for changes made to (api/envoy/|docs/root/api-docs/).
envoyproxy/api-shepherds assignee is @wbpcode
CC @envoyproxy/api-watchers: FYI only for changes made to (api/envoy/|docs/root/api-docs/).

🐱

Caused by: #39086 was opened by agrawroh.

see: more, trace.

…ging Signed-off-by: Rohit Agrawal <rohit.agrawal@databricks.com>

wbpcode

Thanks for the contribution. From my perspective, I think the tracing is the best the way to collect the overhead of the rate limit calling. And the clusters level request time buckets (histogram stats) is another choice which could be used to observe this performance of the rate limit server.

At last, it's fine to add additional simple timepoint in the downstream timing system. See the DownstreamTiming::setValue

wbpcode · 2025-04-13T13:20:54Z

source/extensions/filters/common/ratelimit/ratelimit.h

+
+  /**
+   * Returns streamInfo of the current request if possible. By default just return a nullptr.
+   */
+  virtual StreamInfo::StreamInfo const* streamInfo() const { return nullptr; }


The client in the future will be refactored to be shared by multiple requests, I actually don't think this is a correct way to expose the inner stream info.

wbpcode · 2025-04-13T13:22:03Z

api/envoy/extensions/filters/http/ratelimit/v3/rate_limit.proto

+
+  // When set to true, the filter will emit per-stream stats for access logging. These stats will be stored in the
+  // filter state under the filter name. The filter will emit latency, bytes sent/received, upstream host, and
+  // upstream cluster info.
+  //
+  // .. note::
+  //   Stats are only emitted to the filter state if a check request is actually made to the rate limit service.
+  bool emit_filter_state_stats = 16;


If this doesn't change any behavior and won't bring too much overhead, then I think we can enable it by default and needn't additional flag.

wbpcode · 2025-04-13T13:26:20Z

source/extensions/filters/http/ratelimit/ratelimit.h

+class RateLimitLoggingInfo : public Envoy::StreamInfo::FilterState::Object {
+public:
+  RateLimitLoggingInfo() {}
+
+  absl::optional<std::chrono::microseconds> latency() const { return latency_; };
+  absl::optional<uint64_t> bytesSent() const { return bytes_sent_; }
+  absl::optional<uint64_t> bytesReceived() const { return bytes_received_; }
+  Upstream::ClusterInfoConstSharedPtr clusterInfo() const { return cluster_info_; }
+  Upstream::HostDescriptionConstSharedPtr upstreamHost() const { return upstream_host_; }
+
+  void setLatency(std::chrono::microseconds ms) { latency_ = ms; };
+  void setBytesSent(uint64_t bytes_sent) { bytes_sent_ = bytes_sent; }
+  void setBytesReceived(uint64_t bytes_received) { bytes_received_ = bytes_received; }
+  void setClusterInfo(Upstream::ClusterInfoConstSharedPtr cluster_info) {
+    cluster_info_ = std::move(cluster_info);
+  }
+  void setUpstreamHost(Upstream::HostDescriptionConstSharedPtr upstream_host) {
+    upstream_host_ = std::move(upstream_host);
+  }
+
+  bool hasFieldSupport() const override { return true; }
+
+private:
+  absl::optional<std::chrono::microseconds> latency_;
+  absl::optional<uint64_t> bytes_sent_;
+  absl::optional<uint64_t> bytes_received_;
+  Upstream::ClusterInfoConstSharedPtr cluster_info_;
+  Upstream::HostDescriptionConstSharedPtr upstream_host_;
+};


I didn't see how this works for logging. And I think this actually expose too much info which have exceeded the original requirements of #39018. In this complex system, I think don't do something is more important than do something.

wbpcode · 2025-04-13T13:31:51Z

/wait

github-actions · 2025-05-13T16:01:29Z

This pull request has been automatically marked as stale because it has not had activity in the last 30 days. It will be closed in 7 days if no further activity occurs. Please feel free to give a status update now, ping for review, or re-open when it's ready. Thank you for your contributions!

github-actions · 2025-05-20T16:02:26Z

This pull request has been automatically closed because it has not had activity in the last 37 days. Please feel free to give a status update now, ping for review, or re-open when it's ready. Thank you for your contributions!

agrawroh requested a review from mattklein123 as a code owner April 12, 2025 17:36

repokitteh-read-only bot added the api label Apr 12, 2025

repokitteh-read-only bot assigned wbpcode Apr 12, 2025

agrawroh force-pushed the rls-stats branch 3 times, most recently from 7f1adf4 to 4ca3fbd Compare April 12, 2025 21:59

ratelimit: add support for emitting filter state stats for access log…

d5b653b

…ging Signed-off-by: Rohit Agrawal <rohit.agrawal@databricks.com>

agrawroh force-pushed the rls-stats branch from 4ca3fbd to d5b653b Compare April 12, 2025 22:55

wbpcode requested changes Apr 13, 2025

View reviewed changes

repokitteh-read-only bot added the waiting label Apr 13, 2025

github-actions bot added the stale stalebot believes this issue/PR has not been touched recently label May 13, 2025

github-actions bot closed this May 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ratelimit: add support for emitting filter state stats for access logging #39086

ratelimit: add support for emitting filter state stats for access logging #39086

Uh oh!

agrawroh commented Apr 12, 2025

Uh oh!

repokitteh-read-only bot commented Apr 12, 2025

Uh oh!

wbpcode left a comment

Uh oh!

wbpcode Apr 13, 2025

Uh oh!

wbpcode Apr 13, 2025

Uh oh!

wbpcode Apr 13, 2025

Uh oh!

wbpcode commented Apr 13, 2025

Uh oh!

github-actions bot commented May 13, 2025

Uh oh!

github-actions bot commented May 20, 2025

Uh oh!

Uh oh!

ratelimit: add support for emitting filter state stats for access logging #39086

ratelimit: add support for emitting filter state stats for access logging #39086

Uh oh!

Conversation

agrawroh commented Apr 12, 2025

Uh oh!

repokitteh-read-only bot commented Apr 12, 2025

Uh oh!

wbpcode left a comment

Choose a reason for hiding this comment

Uh oh!

wbpcode Apr 13, 2025

Choose a reason for hiding this comment

Uh oh!

wbpcode Apr 13, 2025

Choose a reason for hiding this comment

Uh oh!

wbpcode Apr 13, 2025

Choose a reason for hiding this comment

Uh oh!

wbpcode commented Apr 13, 2025

Uh oh!

github-actions bot commented May 13, 2025

Uh oh!

github-actions bot commented May 20, 2025

Uh oh!

Uh oh!