From d34aa9df92f9e894b7742cd133d0ed3eccf8dac2 Mon Sep 17 00:00:00 2001
From: Aleksandra Spilkowska <aleksandra.spilkowska@elastic.co>
Date: Mon, 27 Oct 2025 12:53:21 +0100
Subject: [PATCH 1/5] Add troubleshooting guide for 429

---
 .../ingest/opentelemetry/429-errors-motlp.md  | 121 ++++++++++++++++++
 troubleshoot/ingest/opentelemetry/toc.yml     |   1 +
 2 files changed, 122 insertions(+)
 create mode 100644 troubleshoot/ingest/opentelemetry/429-errors-motlp.md
diff --git a/troubleshoot/ingest/opentelemetry/429-errors-motlp.md b/troubleshoot/ingest/opentelemetry/429-errors-motlp.md
new file mode 100644
index 0000000000..cf692962fa
--- /dev/null
+++ b/troubleshoot/ingest/opentelemetry/429-errors-motlp.md
@@ -0,0 +1,121 @@
+---
+navigation_title: 429 errors when using the mOTLP endpoint
+description: Resolve HTTP 429 `Too Many Requests` errors when sending data through the Elastic Cloud Managed OTLP (mOTLP) endpoint in Elastic Cloud Serverless or Elastic Cloud Hosted (ECH).
+applies_to:
+  stack:
+  serverless:
+    observability:
+  product:
+    edot_collector:
+products:
+  - id: cloud-serverless
+  - id: cloud-hosted
+  - id: observability
+  - id: edot-collector
+---
+
+# 429 errors when using the Elastic Cloud Managed OTLP Endpoint
+
+When sending telemetry data through the {{motlp}} (mOTLP), you might encounter HTTP `429 Too Many Requests` errors. These indicate that your ingest rate has temporarily exceeded the rate or burst limits configured for your Elastic Cloud project.
+
+This issue can occur in both Elastic Cloud Serverless and {{ech}} (ECH) environments.
+
+## Symptoms
+
+You might see log messages similar to the following in your EDOT Collector output or SDK logs:
+
+```json
+{
+  "code": 8,
+  "message": "error exporting items, request to <ingest endpoint> responded with HTTP Status Code 429"
+}
+```
+
+In some cases, you may also see warnings or backpressure metrics increase in your Collector’s internal telemetry (for example, queue length or failed send count).
+
+## Causes
+
+A 429 status means that the rate of requests sent to the Managed OTLP endpoint has exceeded allowed thresholds. This can happen for several reasons:
+
+* Your telemetry pipeline is sending data faster than the allowed ingest rate.
+* Bursts of telemetry data exceed the short-term burst limit, even if your sustained rate is within limits.
+
+    The specific limits depend on your environment:
+
+    | Deployment type | Rate limit | Burst limit |
+    |-----------------|------------|-------------|
+    | Serverless      | 15 MB/s    | 30 MB/s     |
+    | ECH             | Depends on deployment size and available {{es}} capacity | Depends on deployment size and available {{es}} capacity |
+
+    Refer to the [Rate limiting section](opentelemetry://reference/motlp.md#rate-limiting) in the mOTLP reference documentation for details.
+
+* The {{es}} capacity for your Cloud deployment cannot handle the incoming data rate.
+* Multiple Collectors or SDKs are sending data concurrently without load balancing or backoff mechanisms.
+
+## Resolution
+
+To resolve 429 errors, identify whether the bottleneck is caused by ingest limits or {{es}} capacity.
+
+### Reduce ingest rate or enable backpressure
+
+Lower the telemetry export rate by enabling batching and retry mechanisms in your EDOT Collector or SDK configuration. For example:
+
+```yaml
+processors:
+  batch:
+    send_batch_size: 1000
+    timeout: 5s
+
+exporters:
+  otlp:
+    retry_on_failure:
+      enabled: true
+      initial_interval: 1s
+      max_interval: 30s
+      max_elapsed_time: 300s
+```
+
+These settings help smooth out spikes and automatically retry failed exports after rate-limit responses.
+
+### Scale your deployment or request higher limits
+
+If you’ve confirmed that your ingest configuration is stable but still encounter 429 errors:
+
+* Elastic Cloud Serverless: [Contact Elastic Support](contact-support.md) to request an increase in ingest limits.
+* {{ech}} (ECH): Increase your {{es}} capacity by scaling or resizing your deployment:
+  * [Scaling considerations](../../../deploy-manage/production-guidance/scaling-considerations.md)
+  * [Resize deployment](../../../deploy-manage/deploy/cloud-enterprise/resize-deployment.md)
+  * [Autoscaling in ECE and ECH](../../../deploy-manage/autoscaling/autoscaling-in-ece-and-ech.md)
+
+After scaling, monitor your ingest metrics to verify that the rate of accepted requests increases and 429 responses stop appearing.
+
+### Enable retry logic and queueing
+
+To minimize data loss during temporary throttling, configure your exporter to use a sending queue and retry logic. For example:
+
+```yaml
+exporters:
+  otlp:
+    sending_queue:
+      enabled: true
+      num_consumers: 10
+      queue_size: 1000
+    retry_on_failure:
+      enabled: true
+```
+
+This ensures the Collector buffers data locally while waiting for the ingest endpoint to recover from throttling.
+
+## Best practices
+
+To prevent 429 errors and maintain reliable telemetry data flow, implement these best practices:
+
+* Monitor internal Collector metrics (such as `otelcol_exporter_send_failed` and `otelcol_exporter_queue_capacity`) to detect backpressure early.
+* Distribute telemetry load evenly across multiple Collectors instead of sending all data through a single instance.
+* When possible, enable batching and compression to reduce payload size.
+* Keep retry and backoff intervals conservative to avoid overwhelming the endpoint after a temporary throttle.
+
+## Resources
+
+* [{{motlp}} reference](opentelemetry://reference/motlp.md)
+* [Quickstart: Send OTLP data to Elastic Serverless or {{ech}}](../../../solutions/observability/get-started/quickstart-elastic-cloud-otel-endpoint.md)
\ No newline at end of file
diff --git a/troubleshoot/ingest/opentelemetry/toc.yml b/troubleshoot/ingest/opentelemetry/toc.yml
index 60d2987880..d68e19de03 100644
--- a/troubleshoot/ingest/opentelemetry/toc.yml
+++ b/troubleshoot/ingest/opentelemetry/toc.yml
@@ -28,4 +28,5 @@ toc:
       - file: edot-sdks/misconfigured-sampling-sdk.md
   - file: no-data-in-kibana.md
   - file: connectivity.md
+  - file: 429-errors-motlp.md
   - file: contact-support.md

From ae34e5223aabb71f0f502fc737c38b8a52a23ef5 Mon Sep 17 00:00:00 2001
From: Aleksandra Spilkowska <aleksandra.spilkowska@elastic.co>
Date: Mon, 27 Oct 2025 13:30:22 +0100
Subject: [PATCH 2/5] Update the quickstart guide

---
 .../get-started/quickstart-elastic-cloud-otel-endpoint.md    | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/solutions/observability/get-started/quickstart-elastic-cloud-otel-endpoint.md b/solutions/observability/get-started/quickstart-elastic-cloud-otel-endpoint.md
index 2faa9ec8cf..c32d6d2f69 100644
--- a/solutions/observability/get-started/quickstart-elastic-cloud-otel-endpoint.md
+++ b/solutions/observability/get-started/quickstart-elastic-cloud-otel-endpoint.md
@@ -162,7 +162,10 @@ You must format your API key as `"Authorization": "ApiKey <api-key-value-here>"`
 
 ### Error: too many requests
 
-The Managed OTLP endpoint has per-project rate limits in place. If you reach this limit, reach out to our [support team](https://support.elastic.co). Refer to [Rate limiting](opentelemetry://reference/motlp.md#rate-limiting) for more information.
+If you see HTTP `429 Too Many Requests` errors when sending data through the Elastic Cloud Managed OTLP Endpoint (mOTLP) endpoint, your project might be hitting ingest rate limits.
+
+Refer to the dedicated [429 errors when using the Elastic Cloud Managed OTLP Endpoint](/troubleshoot/ingest/opentelemetry/429-errors-motlp.md) troubleshooting guide for details on causes, rate limits, and solutions.
+
 
 ## Provide feedback
 

From 7c97bf5e5331ce8efcc4c4ec2c4de3bd1956cecb Mon Sep 17 00:00:00 2001
From: Aleksandra Spilkowska <aleksandra.spilkowska@elastic.co>
Date: Wed, 29 Oct 2025 16:13:26 +0100
Subject: [PATCH 3/5] Apply comments

---
 .../ingest/opentelemetry/429-errors-motlp.md  | 28 ++++++++++---------
 1 file changed, 15 insertions(+), 13 deletions(-)

diff --git a/troubleshoot/ingest/opentelemetry/429-errors-motlp.md b/troubleshoot/ingest/opentelemetry/429-errors-motlp.md
index cf692962fa..342959f725 100644
--- a/troubleshoot/ingest/opentelemetry/429-errors-motlp.md
+++ b/troubleshoot/ingest/opentelemetry/429-errors-motlp.md
@@ -47,15 +47,29 @@ A 429 status means that the rate of requests sent to the Managed OTLP endpoint h
     | Serverless      | 15 MB/s    | 30 MB/s     |
     | ECH             | Depends on deployment size and available {{es}} capacity | Depends on deployment size and available {{es}} capacity |
 
+    Exact limits depend on your subscription tier.
     Refer to the [Rate limiting section](opentelemetry://reference/motlp.md#rate-limiting) in the mOTLP reference documentation for details.
 
-* The {{es}} capacity for your Cloud deployment cannot handle the incoming data rate.
+* In {{ech}}, the {{es}} capacity for your deployment might be underscaled for the current ingest rate.
+* In Elastic Cloud Serverless, rate limiting should not result from {{es}} capacity, since the platform automatically scales ingest capacity. If you suspect a scaling issue, [contact Elastic Support](contact-support.md).
 * Multiple Collectors or SDKs are sending data concurrently without load balancing or backoff mechanisms.
 
 ## Resolution
 
 To resolve 429 errors, identify whether the bottleneck is caused by ingest limits or {{es}} capacity.
 
+### Scale your deployment or request higher limits
+
+If you’ve confirmed that your ingest configuration is stable but still encounter 429 errors:
+
+* Elastic Cloud Serverless: [Contact Elastic Support](contact-support.md) to request an increase in ingest limits.
+* {{ech}} (ECH): Increase your {{es}} capacity by scaling or resizing your deployment:
+  * [Scaling considerations](../../../deploy-manage/production-guidance/scaling-considerations.md)
+  * [Resize deployment](../../../deploy-manage/deploy/cloud-enterprise/resize-deployment.md)
+  * [Autoscaling in ECE and ECH](../../../deploy-manage/autoscaling/autoscaling-in-ece-and-ech.md)
+
+After scaling, monitor your ingest metrics to verify that the rate of accepted requests increases and 429 responses stop appearing.
+
 ### Reduce ingest rate or enable backpressure
 
 Lower the telemetry export rate by enabling batching and retry mechanisms in your EDOT Collector or SDK configuration. For example:
@@ -77,18 +91,6 @@ exporters:
 
 These settings help smooth out spikes and automatically retry failed exports after rate-limit responses.
 
-### Scale your deployment or request higher limits
-
-If you’ve confirmed that your ingest configuration is stable but still encounter 429 errors:
-
-* Elastic Cloud Serverless: [Contact Elastic Support](contact-support.md) to request an increase in ingest limits.
-* {{ech}} (ECH): Increase your {{es}} capacity by scaling or resizing your deployment:
-  * [Scaling considerations](../../../deploy-manage/production-guidance/scaling-considerations.md)
-  * [Resize deployment](../../../deploy-manage/deploy/cloud-enterprise/resize-deployment.md)
-  * [Autoscaling in ECE and ECH](../../../deploy-manage/autoscaling/autoscaling-in-ece-and-ech.md)
-
-After scaling, monitor your ingest metrics to verify that the rate of accepted requests increases and 429 responses stop appearing.
-
 ### Enable retry logic and queueing
 
 To minimize data loss during temporary throttling, configure your exporter to use a sending queue and retry logic. For example:

From 80ba73ed721fd16a1a4ef43c7b94d276cf9ca5a5 Mon Sep 17 00:00:00 2001
From: Aleksandra Spilkowska <aleksandra.spilkowska@elastic.co>
Date: Fri, 31 Oct 2025 11:57:04 +0100
Subject: [PATCH 4/5] Update toc

---
 troubleshoot/toc.yml | 1 +
 1 file changed, 1 insertion(+)

diff --git a/troubleshoot/toc.yml b/troubleshoot/toc.yml
index cb0ec459c2..62fa09897b 100644
--- a/troubleshoot/toc.yml
+++ b/troubleshoot/toc.yml
@@ -171,6 +171,7 @@ toc:
               - file: ingest/opentelemetry/edot-sdks/misconfigured-sampling-sdk.md
           - file: ingest/opentelemetry/no-data-in-kibana.md
           - file: ingest/opentelemetry/connectivity.md
+          - file: ingest/opentelemetry/429-errors-motlp.md
           - file: ingest/opentelemetry/contact-support.md
       - file: ingest/logstash.md
         children:

From c1d66110aa611a65e726dc6a7e25d2053ff992c5 Mon Sep 17 00:00:00 2001
From: Aleksandra Spilkowska <aleksandra.spilkowska@elastic.co>
Date: Fri, 31 Oct 2025 11:59:45 +0100
Subject: [PATCH 5/5] Use placeholders

---
 troubleshoot/ingest/opentelemetry/429-errors-motlp.md | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/troubleshoot/ingest/opentelemetry/429-errors-motlp.md b/troubleshoot/ingest/opentelemetry/429-errors-motlp.md
index 342959f725..25d05751ba 100644
--- a/troubleshoot/ingest/opentelemetry/429-errors-motlp.md
+++ b/troubleshoot/ingest/opentelemetry/429-errors-motlp.md
@@ -16,9 +16,9 @@ products:
 
 # 429 errors when using the Elastic Cloud Managed OTLP Endpoint
 
-When sending telemetry data through the {{motlp}} (mOTLP), you might encounter HTTP `429 Too Many Requests` errors. These indicate that your ingest rate has temporarily exceeded the rate or burst limits configured for your Elastic Cloud project.
+When sending telemetry data through the {{motlp}} (mOTLP), you might encounter HTTP `429 Too Many Requests` errors. These indicate that your ingest rate has temporarily exceeded the rate or burst limits configured for your {{ecloud}} project.
 
-This issue can occur in both Elastic Cloud Serverless and {{ech}} (ECH) environments.
+This issue can occur in both {{serverless-full}} and {{ech}} (ECH) environments.
 
 ## Symptoms
 
@@ -51,7 +51,7 @@ A 429 status means that the rate of requests sent to the Managed OTLP endpoint h
     Refer to the [Rate limiting section](opentelemetry://reference/motlp.md#rate-limiting) in the mOTLP reference documentation for details.
 
 * In {{ech}}, the {{es}} capacity for your deployment might be underscaled for the current ingest rate.
-* In Elastic Cloud Serverless, rate limiting should not result from {{es}} capacity, since the platform automatically scales ingest capacity. If you suspect a scaling issue, [contact Elastic Support](contact-support.md).
+* In {{serverless-full}}, rate limiting should not result from {{es}} capacity, since the platform automatically scales ingest capacity. If you suspect a scaling issue, [contact Elastic Support](contact-support.md).
 * Multiple Collectors or SDKs are sending data concurrently without load balancing or backoff mechanisms.
 
 ## Resolution
@@ -62,7 +62,7 @@ To resolve 429 errors, identify whether the bottleneck is caused by ingest limit
 
 If you’ve confirmed that your ingest configuration is stable but still encounter 429 errors:
 
-* Elastic Cloud Serverless: [Contact Elastic Support](contact-support.md) to request an increase in ingest limits.
+* {{serverless-full}}: [Contact Elastic Support](contact-support.md) to request an increase in ingest limits.
 * {{ech}} (ECH): Increase your {{es}} capacity by scaling or resizing your deployment:
   * [Scaling considerations](../../../deploy-manage/production-guidance/scaling-considerations.md)
   * [Resize deployment](../../../deploy-manage/deploy/cloud-enterprise/resize-deployment.md)