From 98c10c67a985403831817fad31da268e2fd693f7 Mon Sep 17 00:00:00 2001
From: "promptless[bot]" <179508745+promptless[bot]@users.noreply.github.com>
Date: Thu, 4 Dec 2025 20:50:59 +0000
Subject: [PATCH 1/7] Add RunPod changelog in Mintlify format
---
changelog.mdx | 159 ++++++++++++++++++++++++++++++++++++++++++++++++++
docs.json | 1 +
2 files changed, 160 insertions(+)
create mode 100644 changelog.mdx
diff --git a/changelog.mdx b/changelog.mdx
new file mode 100644
index 00000000..eb0fda96
--- /dev/null
+++ b/changelog.mdx
@@ -0,0 +1,159 @@
+---
+title: "Changelog"
+description: "Product updates and announcements"
+---
+
+
+- **Slurm Instant Clusters now available.** Slurm Instant Clusters are now fully available on Runpod. Deploy production-ready HPC clusters in seconds instead of hours or days. The clusters support multi-node performance for distributed training and large-scale simulations. Manage clusters from the web interface with pay-as-you-go billing and no idle costs.
+- **Model Store in beta.** A caching system that eliminates model download times when starting workers. Instead of embedding large models in Docker images or downloading them at startup, Model Store places models on host machines before workers start. When you specify a model from Hugging Face, the scheduler prioritizes hosts with your model already cached. If the model is cached, workers start instantly. If not, the system downloads the model before your worker starts billing.
+- **New public endpoints.** Wan 2.5 combines an image and audio to create lifelike videos. Nano Banana combines multiple images to create composite images for product placement, character development, and more.
+
+
+
+- **Runpod Hub revenue share model.** Publish to the Runpod Hub and earn credits when others deploy your repo. Earn up to 7% of compute revenue via monthly tiers (1/3/5/7%). Credits are auto-deposited into your account.
+- **Pods UX update.** Updated modern interface to interact with Runpod Pods.
+
+
+
+- **Public endpoints.** Instant access to state-of-the-art AI models through simple API calls. An API playground is available through the Runpod Hub. Available public endpoints include Whisper-V3-Large, Seedance 1.0 pro, Seedream 3.0, Qwen Image Edit, FLUX.1 Kontext, Deep Cogito v2 Llama 70B, Minimax Speech, and others.
+- **Instant Clusters v2 (Slurm beta).** Create on-demand multi-node clusters instantly with full Slurm scheduling support in beta.
+
+
+
+- **S3 API.** Upload and retrieve files without compute. Use AWS S3 CLI or Boto3 with zero-config ease. Integrate Runpod storage into any AI pipeline with no rewrites. Manage data with object-level control for teams, models, and apps.
+- **Referrals v2.** Updated rewards and tiers with clearer dashboards to track performance.
+
+
+
+- **Port labeling.** Name exposed ports in the UI and API for clearer collaboration. For example, label ports as "Jupyter" or "TensorBoard" to help team members identify what each port is for.
+- **Price drops.** Additional price reductions on popular GPU SKUs to lower training and inference costs.
+- **Runpod Serverless Hub.** A curated marketplace of one-click endpoints and templates. Fork and deploy community projects without starting from scratch.
+- **Tetra beta test.** A Python library for running parts of your code on a GPU using Runpod. Add a `@remote()` decorator to functions that need GPU power, and Tetra handles offloading the work to Runpod while the rest of your code runs locally.
+
+
+
+- **Login with GitHub.** OAuth sign-in and linking for faster onboarding and repo-driven workflows.
+- **RTX 5090s on Runpod.** RTX 5090 availability for high performance and cost-efficiency on training and inference.
+- **Global networking expansion.** Rollout to many additional data centers approaching full global coverage.
+
+
+
+- **CPU Pods get network storage access.** GA support for network volumes on CPU Pods for persistent, shareable storage.
+- **SOC 2 Type I certification.** Independent attestation of security controls for enterprise readiness.
+- **REST API release.** REST API GA with broad resource coverage for full IaC workflows.
+- **Instant Clusters.** Spin up multi-node GPU clusters in minutes with private interconnect and per-second billing.
+- **Bare metal.** Reserve dedicated GPU servers for maximum control, performance, and long-term savings.
+- **AP-JP-1.** New Fukushima region for low-latency APAC access and in-country data residency.
+
+
+
+- **REST API beta test.** RESTful endpoints for Pods, endpoints, and volumes for simpler automation than GraphQL.
+- **Full time community manager hire.** Dedicated programs, content, and faster community response.
+- **Serverless GitHub integration release.** GA for GitHub-based Serverless deploys with production-ready stability.
+
+
+
+- **CPU Pods v2.** Docker runtime parity with GPU Pods for faster starts. Adds network volume support.
+- **H200s on Runpod.** NVIDIA H200 GPUs available for larger models and higher memory bandwidth.
+- **Serverless upgrades.** Higher GPU counts per worker, new quick-deploy runtimes, and simpler model selection.
+
+
+
+- **Global networking added to CA-MTL-3, US-GA-1, US-GA-2, US-KS-2.** Expanded data center coverage for the private mesh.
+- **Serverless GitHub integration beta test.** Deploy endpoints directly from GitHub repos with automatic builds.
+- **Scoped API keys.** Least-privilege tokens with fine-grained scopes and expirations for safer automation.
+- **Passkey auth.** Passwordless WebAuthn sign-in for phishing-resistant account access.
+
+
+
+- **US-GA-2 added to network storage.** Enable network volumes in US-GA-2.
+- **Global networking.** Private cross-data-center networking with internal DNS for secure service-to-service traffic.
+
+
+
+- **US-TX-3 and EUR-IS-1 added to network storage.** Network volumes available in more regions for local persistence.
+- **Runpod slashes GPU prices.** Broad GPU price reductions to lower training and inference TCO.
+- **Referral program revamp.** Updated commissions and bonuses with an affiliate tier and improved tracking.
+
+
+
+- **$20M seed by Intel Capital and Dell Technologies Capital.** Funds infrastructure expansion and product acceleration.
+- **First in person hackathon.** Community projects, workshops, and real-world feedback.
+- **Serverless CPU Pods.** Scale-to-zero CPU endpoints for services that don't need a GPU.
+- **AMD GPUs.** AMD ROCm-compatible GPU SKUs as cost and performance alternatives to NVIDIA.
+
+
+
+- **CPU Pods.** CPU-only instances with the same networking and storage primitives for cheaper non-GPU stages.
+- **runpodctl.** Official CLI for Pods, endpoints, and volumes to enable scripting and CI/CD workflows.
+
+
+
+- **New navigational changes to Runpod UI.** Consolidated menus, consistent action placement, and fewer clicks for common tasks.
+- **Docs revamp.** New information architecture, improved search, and more runnable examples and quickstarts.
+- **Zhen AMA.** Roadmap Q&A and community feedback session.
+
+
+
+- **US-OR-1.** Additional US region for lower latency and more capacity in the Pacific Northwest.
+- **CA-MTL-1.** New Canadian region to improve latency and in-country data needs.
+- **Our first community manager hire.** Dedicated community programs and faster feedback loops.
+- **Building out the support team.** Expanded coverage and expertise for complex issues.
+
+
+
+- **Serverless quick deploy.** One-click deploy of curated model templates with sensible defaults.
+- **EU domain for Serverless.** EU-specific domain briefly offered for data residency, superseded by other region controls.
+- **Data-center filter for Serverless.** Filter and manage endpoints by region for multi-region fleets.
+
+
+
+- **Self service worker upgrade.** Rebuild and roll workers from the dashboard without support tickets.
+- **Edit template from endpoint page.** Inline edit and redeploy the underlying template directly from the endpoint view.
+- **Improved Serverless metrics page.** Refinements to charts and filters for quicker root-cause analysis.
+- **Flex and active workers.** Discounted always-on "Active" capacity for baseline load. Burst with on-demand "Flex" workers.
+- **Billing explorer.** Inspect costs by resource, region, and time to identify optimization opportunities.
+
+
+
+- **Teams.** Organization workspaces with role-based access control for Pods, endpoints, and billing.
+- **Savings plans.** Plans surfaced prominently in console with easier purchase and management for steady usage.
+- **Network storage to US-KS-1.** Enable network volumes in US-KS-1 for local, persistent data workflows.
+- **Serverless log view.** Stream worker stdout and stderr in the UI and API for real-time debugging.
+- **Serverless health endpoint.** Lightweight /health probe returning endpoint and worker status without creating a billable job.
+- **SOC 2 Type II compliant.** Security and compliance certification for enterprise customers.
+
+
+
+- **Serverless metrics page.** Time-series charts for pXX latencies, queue delay, throughput, and worker states for faster debugging and tuning.
+- **H100 on Runpod.** NVIDIA H100 instances for higher throughput and larger model footprints.
+- **Savings plans.** Commitment-based discounts for predictable workloads to lower effective hourly rates.
+
+
+
+- **The new and improved Runpod login experience.** Streamlined sign-in and team access for faster, more consistent auth flows.
+- **Network volumes added to Serverless.** Attach persistent storage to Serverless workers to retain models and artifacts across restarts and speed cold starts via caching.
+- **Serverless region support.** Pin or allow specific regions for endpoints to reduce latency and meet data-residency needs.
+
+
+
+- **Serverless scaling strategies.** Scale by queue delay and/or concurrency with min/max worker bounds to balance latency and cost.
+- **Queue delay.** Expose time-in-queue as a first-class metric to drive autoscaling and SLO monitoring.
+- **Request count.** Track success and failure totals over windows for quick health checks and alerting.
+- **runsync.** Synchronous invocation path that returns results in the same HTTP call for short-running jobs.
+- **Network storage beta.** Region-scoped, attachable volumes shareable across Pods and endpoints for model caches and datasets.
+- **Job cancel API.** Programmatically terminate queued or running jobs to free capacity and enforce client timeouts.
+
+
+
+- **Serverless API v2.** Revised request and response schema with improved error semantics. New endpoints provide better control over job lifecycle and observability.
+
+
+
+- **Notification preferences.** Configure which platform events trigger alerts to reduce noise for teams and CI systems.
+- **GPU priorities.** Influence scheduling by marking workloads as higher priority to reduce queue time for critical jobs.
+
+
+
+- **Runpod now offers encrypted volumes.** Enable at-rest encryption for persistent volumes with no application changes required. Keys are platform-managed, and encrypted volumes mount like standard volumes.
+
diff --git a/docs.json b/docs.json
index c81eb94b..724a1766 100644
--- a/docs.json
+++ b/docs.json
@@ -191,6 +191,7 @@
{
"group": "Reference",
"pages": [
+ "changelog",
"references/billing-information",
"references/referrals",
"references/security-and-compliance",
From d2b0df5b22a8930ba7b77278a5327476d1fde0b7 Mon Sep 17 00:00:00 2001
From: Mo King
Date: Thu, 4 Dec 2025 16:12:07 -0500
Subject: [PATCH 2/7] Improve changelog formatting and content
---
changelog.mdx | 125 +++++++++++++++++++++++++++++++-------------------
docs.json | 12 ++++-
2 files changed, 90 insertions(+), 47 deletions(-)
diff --git a/changelog.mdx b/changelog.mdx
index eb0fda96..c7252b13 100644
--- a/changelog.mdx
+++ b/changelog.mdx
@@ -1,113 +1,139 @@
---
-title: "Changelog"
-description: "Product updates and announcements"
+title: "Product updates"
+sidebarTitle: "Product updates"
+description: "Product updates and announcements for Runpod"
---
-
-- **Slurm Instant Clusters now available.** Slurm Instant Clusters are now fully available on Runpod. Deploy production-ready HPC clusters in seconds instead of hours or days. The clusters support multi-node performance for distributed training and large-scale simulations. Manage clusters from the web interface with pay-as-you-go billing and no idle costs.
-- **Model Store in beta.** A caching system that eliminates model download times when starting workers. Instead of embedding large models in Docker images or downloading them at startup, Model Store places models on host machines before workers start. When you specify a model from Hugging Face, the scheduler prioritizes hosts with your model already cached. If the model is cached, workers start instantly. If not, the system downloads the model before your worker starts billing.
-- **New public endpoints.** Wan 2.5 combines an image and audio to create lifelike videos. Nano Banana combines multiple images to create composite images for product placement, character development, and more.
+
+## Slurm Clusters GA, cached models in beta, and new Public Endpoints
+
+- [Slurm Clusters are now generally available](/instant-clusters/slurm-clusters) on Runpod. Deploy production-ready HPC clusters in seconds instead of hours or days. The clusters support multi-node performance for distributed training and large-scale simulations. Manage clusters from the web interface with pay-as-you-go billing and no idle costs.
+- [Cached models are now in beta](/serverless/endpoints/model-caching). When enabled, this feature eliminates model download times when starting workers. Instead of embedding large models in Docker images or downloading them at startup, cached models are placed on host machines before workers start. When you specify a model from Hugging Face, the scheduler prioritizes hosts with your model already cached. If the model is cached, workers start instantly. If not, the system downloads the model before your worker starts billing.
+- [New Public Endpoints](/hub/public-endpoints) are now available: Wan 2.5 combines an image and audio to create lifelike videos, and Nano Banana combines multiple images to create composite images for product placement, character development, and more.
-
-- **Runpod Hub revenue share model.** Publish to the Runpod Hub and earn credits when others deploy your repo. Earn up to 7% of compute revenue via monthly tiers (1/3/5/7%). Credits are auto-deposited into your account.
-- **Pods UX update.** Updated modern interface to interact with Runpod Pods.
+
+## Hub revenue share for maintainers and new Pods UX
+
+- [Runpod Hub revenue share model](/hub/revenue-sharing). Publish to the Runpod Hub and earn credits when others deploy your repo. Earn up to 7% of compute revenue via monthly tiers (1/3/5/7%). Credits are auto-deposited into your account.
+- [Pods UX update](/pods/overview). Updated modern interface to interact with Runpod Pods.
-
-- **Public endpoints.** Instant access to state-of-the-art AI models through simple API calls. An API playground is available through the Runpod Hub. Available public endpoints include Whisper-V3-Large, Seedance 1.0 pro, Seedream 3.0, Qwen Image Edit, FLUX.1 Kontext, Deep Cogito v2 Llama 70B, Minimax Speech, and others.
-- **Instant Clusters v2 (Slurm beta).** Create on-demand multi-node clusters instantly with full Slurm scheduling support in beta.
+
+## Public Endpoints and one-click Slurm clusters
+
+- [Public Endpoints](/hub/public-endpoints). Instant access to state-of-the-art AI models through simple API calls. An API playground is available through the Runpod Hub. Available public endpoints include Whisper-V3-Large, Seedance 1.0 pro, Seedream 3.0, Qwen Image Edit, FLUX.1 Kontext, Deep Cogito v2 Llama 70B, Minimax Speech, and others.
+- [Slurm Clusters (beta)](/instant-clusters/slurm-clusters). Create on-demand multi-node clusters instantly with full Slurm scheduling support.
-
-- **S3 API.** Upload and retrieve files without compute. Use AWS S3 CLI or Boto3 with zero-config ease. Integrate Runpod storage into any AI pipeline with no rewrites. Manage data with object-level control for teams, models, and apps.
-- **Referrals v2.** Updated rewards and tiers with clearer dashboards to track performance.
+
+## Upload and retrieve files without compute and updated referral program
+
+- [S3 API](/storage/s3-api). Upload and retrieve files without compute. Use AWS S3 CLI or Boto3 with zero-config ease. Integrate Runpod storage into any AI pipeline with no rewrites. Manage data with object-level control for teams, models, and apps.
+- [Referrals v2](/referrals). Updated rewards and tiers with clearer dashboards to track performance.
-
-- **Port labeling.** Name exposed ports in the UI and API for clearer collaboration. For example, label ports as "Jupyter" or "TensorBoard" to help team members identify what each port is for.
-- **Price drops.** Additional price reductions on popular GPU SKUs to lower training and inference costs.
-- **Runpod Serverless Hub.** A curated marketplace of one-click endpoints and templates. Fork and deploy community projects without starting from scratch.
-- **Tetra beta test.** A Python library for running parts of your code on a GPU using Runpod. Add a `@remote()` decorator to functions that need GPU power, and Tetra handles offloading the work to Runpod while the rest of your code runs locally.
+
+## UX polish, further price relief, and a marketplace with new Python library
+
+- [Port labeling](/pods/overview). Name exposed ports in the UI and API for clearer collaboration. For example, label ports as "Jupyter" or "TensorBoard" to help team members identify what each port is for.
+- [Price drops](/pods/pricing). Additional price reductions on popular GPU SKUs to lower training and inference costs.
+- [Runpod Serverless Hub](/hub/overview). A curated marketplace of one-click endpoints and templates. Fork and deploy community projects without starting from scratch.
+- **Tetra beta test**. A Python library for running parts of your code on a GPU using Runpod. Add a `@remote()` decorator to functions that need GPU power, and Tetra handles offloading the work to Runpod while the rest of your code runs locally.
-
+
+## SSO convenience and wider global mesh
+
- **Login with GitHub.** OAuth sign-in and linking for faster onboarding and repo-driven workflows.
- **RTX 5090s on Runpod.** RTX 5090 availability for high performance and cost-efficiency on training and inference.
- **Global networking expansion.** Rollout to many additional data centers approaching full global coverage.
-
+
+## Enterprise features: compliance, APIs, clusters, bare metal, and APAC expansion
+
- **CPU Pods get network storage access.** GA support for network volumes on CPU Pods for persistent, shareable storage.
- **SOC 2 Type I certification.** Independent attestation of security controls for enterprise readiness.
-- **REST API release.** REST API GA with broad resource coverage for full IaC workflows.
-- **Instant Clusters.** Spin up multi-node GPU clusters in minutes with private interconnect and per-second billing.
+- [REST API release](/api-reference/overview). REST API GA with broad resource coverage for full IaC workflows.
+- [Instant Clusters](/instant-clusters/overview). Spin up multi-node GPU clusters in minutes with private interconnect and per-second billing.
- **Bare metal.** Reserve dedicated GPU servers for maximum control, performance, and long-term savings.
- **AP-JP-1.** New Fukushima region for low-latency APAC access and in-country data residency.
-
-- **REST API beta test.** RESTful endpoints for Pods, endpoints, and volumes for simpler automation than GraphQL.
-- **Full time community manager hire.** Dedicated programs, content, and faster community response.
-- **Serverless GitHub integration release.** GA for GitHub-based Serverless deploys with production-ready stability.
+
+## Modern API surface in beta and stronger community investment
+- [REST API beta test](/api-reference/overview). RESTful endpoints for Pods, endpoints, and volumes for simpler automation than GraphQL.
+- [Full time community manager hire](/community/community-manager). Dedicated programs, content, and faster community response.
+- [Serverless GitHub integration release](/serverless/workers/github-integration). GA for GitHub-based Serverless deploys with production-ready stability.
-
+
+## New silicon options and LLM-centric Serverless upgrades
- **CPU Pods v2.** Docker runtime parity with GPU Pods for faster starts. Adds network volume support.
- **H200s on Runpod.** NVIDIA H200 GPUs available for larger models and higher memory bandwidth.
- **Serverless upgrades.** Higher GPU counts per worker, new quick-deploy runtimes, and simpler model selection.
-
+
+## Global networking rollout continues and GitHub deploys arrive in beta
- **Global networking added to CA-MTL-3, US-GA-1, US-GA-2, US-KS-2.** Expanded data center coverage for the private mesh.
- **Serverless GitHub integration beta test.** Deploy endpoints directly from GitHub repos with automatic builds.
- **Scoped API keys.** Least-privilege tokens with fine-grained scopes and expirations for safer automation.
- **Passkey auth.** Passwordless WebAuthn sign-in for phishing-resistant account access.
-
+
+## More storage coverage and private cross-DC connectivity
- **US-GA-2 added to network storage.** Enable network volumes in US-GA-2.
- **Global networking.** Private cross-data-center networking with internal DNS for secure service-to-service traffic.
-
+
+## Storage coverage grows, major price cuts, and revamped referrals
- **US-TX-3 and EUR-IS-1 added to network storage.** Network volumes available in more regions for local persistence.
- **Runpod slashes GPU prices.** Broad GPU price reductions to lower training and inference TCO.
- **Referral program revamp.** Updated commissions and bonuses with an affiliate tier and improved tracking.
-
+
+## $20M seed round, community event, and broader Serverless and accelerator options
- **$20M seed by Intel Capital and Dell Technologies Capital.** Funds infrastructure expansion and product acceleration.
- **First in person hackathon.** Community projects, workshops, and real-world feedback.
- **Serverless CPU Pods.** Scale-to-zero CPU endpoints for services that don't need a GPU.
- **AMD GPUs.** AMD ROCm-compatible GPU SKUs as cost and performance alternatives to NVIDIA.
-
+
+## Compute beyond GPUs and first-class automation tooling
- **CPU Pods.** CPU-only instances with the same networking and storage primitives for cheaper non-GPU stages.
-- **runpodctl.** Official CLI for Pods, endpoints, and volumes to enable scripting and CI/CD workflows.
+- [runpodctl](/runpodctl/overview). Official CLI for Pods, endpoints, and volumes to enable scripting and CI/CD workflows.
-
+
+## Console navigation overhaul and documentation refresh
- **New navigational changes to Runpod UI.** Consolidated menus, consistent action placement, and fewer clicks for common tasks.
- **Docs revamp.** New information architecture, improved search, and more runnable examples and quickstarts.
- **Zhen AMA.** Roadmap Q&A and community feedback session.
-
+
+## New regions and investment in community and support
- **US-OR-1.** Additional US region for lower latency and more capacity in the Pacific Northwest.
- **CA-MTL-1.** New Canadian region to improve latency and in-country data needs.
- **Our first community manager hire.** Dedicated community programs and faster feedback loops.
- **Building out the support team.** Expanded coverage and expertise for complex issues.
-
+
+## Faster starts from templates and better multi-region hygiene
- **Serverless quick deploy.** One-click deploy of curated model templates with sensible defaults.
- **EU domain for Serverless.** EU-specific domain briefly offered for data residency, superseded by other region controls.
- **Data-center filter for Serverless.** Filter and manage endpoints by region for multi-region fleets.
-
+
+## Self-service upgrades, clearer metrics, new pricing model, and cost visibility
- **Self service worker upgrade.** Rebuild and roll workers from the dashboard without support tickets.
- **Edit template from endpoint page.** Inline edit and redeploy the underlying template directly from the endpoint view.
- **Improved Serverless metrics page.** Refinements to charts and filters for quicker root-cause analysis.
@@ -115,7 +141,8 @@ description: "Product updates and announcements"
- **Billing explorer.** Inspect costs by resource, region, and time to identify optimization opportunities.
-
+
+## Team governance, storage expansion, and better debugging and health
- **Teams.** Organization workspaces with role-based access control for Pods, endpoints, and billing.
- **Savings plans.** Plans surfaced prominently in console with easier purchase and management for steady usage.
- **Network storage to US-KS-1.** Enable network volumes in US-KS-1 for local, persistent data workflows.
@@ -124,19 +151,22 @@ description: "Product updates and announcements"
- **SOC 2 Type II compliant.** Security and compliance certification for enterprise customers.
-
+
+## Observability, top-tier GPUs, and commitment-based savings
- **Serverless metrics page.** Time-series charts for pXX latencies, queue delay, throughput, and worker states for faster debugging and tuning.
- **H100 on Runpod.** NVIDIA H100 instances for higher throughput and larger model footprints.
- **Savings plans.** Commitment-based discounts for predictable workloads to lower effective hourly rates.
-
+
+## Smoother auth and multi-region Serverless with persistent storage
- **The new and improved Runpod login experience.** Streamlined sign-in and team access for faster, more consistent auth flows.
- **Network volumes added to Serverless.** Attach persistent storage to Serverless workers to retain models and artifacts across restarts and speed cold starts via caching.
- **Serverless region support.** Pin or allow specific regions for endpoints to reduce latency and meet data-residency needs.
-
+
+## Deeper autoscaling controls, richer metrics, persistent storage, and job cancellation
- **Serverless scaling strategies.** Scale by queue delay and/or concurrency with min/max worker bounds to balance latency and cost.
- **Queue delay.** Expose time-in-queue as a first-class metric to drive autoscaling and SLO monitoring.
- **Request count.** Track success and failure totals over windows for quick health checks and alerting.
@@ -145,15 +175,18 @@ description: "Product updates and announcements"
- **Job cancel API.** Programmatically terminate queued or running jobs to free capacity and enforce client timeouts.
-
+
+## Serverless platform hardens with a cleaner, more capable API
- **Serverless API v2.** Revised request and response schema with improved error semantics. New endpoints provide better control over job lifecycle and observability.
-
+
+## Better control over notifications and GPU allocation during contention
- **Notification preferences.** Configure which platform events trigger alerts to reduce noise for teams and CI systems.
- **GPU priorities.** Influence scheduling by marking workloads as higher priority to reduce queue time for critical jobs.
-
+
+## Security-first release enabling encryption for persistent data
- **Runpod now offers encrypted volumes.** Enable at-rest encryption for persistent volumes with no application changes required. Keys are platform-managed, and encrypted volumes mount like standard volumes.
diff --git a/docs.json b/docs.json
index 724a1766..3fdd5e47 100644
--- a/docs.json
+++ b/docs.json
@@ -191,7 +191,6 @@
{
"group": "Reference",
"pages": [
- "changelog",
"references/billing-information",
"references/referrals",
"references/security-and-compliance",
@@ -465,6 +464,17 @@
]
}
]
+ },
+ {
+ "tab": "Changelog",
+ "groups": [
+ {
+ "group": "Changelog",
+ "pages": [
+ "changelog"
+ ]
+ }
+ ]
}
]
},
From 1013f25ac1f89780544e50a5a184106999562c3a Mon Sep 17 00:00:00 2001
From: Mo King
Date: Thu, 4 Dec 2025 16:28:49 -0500
Subject: [PATCH 3/7] Update description
---
changelog.mdx | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/changelog.mdx b/changelog.mdx
index c7252b13..e438d779 100644
--- a/changelog.mdx
+++ b/changelog.mdx
@@ -1,7 +1,7 @@
---
title: "Product updates"
sidebarTitle: "Product updates"
-description: "Product updates and announcements for Runpod"
+description: "New features, bug fixes, and improvements for Runpod"
---
@@ -16,7 +16,7 @@ description: "Product updates and announcements for Runpod"
## Hub revenue share for maintainers and new Pods UX
- [Runpod Hub revenue share model](/hub/revenue-sharing). Publish to the Runpod Hub and earn credits when others deploy your repo. Earn up to 7% of compute revenue via monthly tiers (1/3/5/7%). Credits are auto-deposited into your account.
-- [Pods UX update](/pods/overview). Updated modern interface to interact with Runpod Pods.
+- [Pods UI updated](/pods/overview). Updated modern interface to interact with Runpod Pods.
From 647d0b548d02678aa3bae22bf47ab341e852dc9d Mon Sep 17 00:00:00 2001
From: Mo King
Date: Fri, 5 Dec 2025 09:22:55 -0500
Subject: [PATCH 4/7] Changelong -> release notes
---
docs.json | 6 +++---
changelog.mdx => release-notes.mdx | 4 ++--
2 files changed, 5 insertions(+), 5 deletions(-)
rename changelog.mdx => release-notes.mdx (98%)
diff --git a/docs.json b/docs.json
index 3fdd5e47..68970c66 100644
--- a/docs.json
+++ b/docs.json
@@ -466,12 +466,12 @@
]
},
{
- "tab": "Changelog",
+ "tab": "Release notes",
"groups": [
{
- "group": "Changelog",
+ "group": "Release notes",
"pages": [
- "changelog"
+ "release-notes"
]
}
]
diff --git a/changelog.mdx b/release-notes.mdx
similarity index 98%
rename from changelog.mdx
rename to release-notes.mdx
index e438d779..c5bf1be3 100644
--- a/changelog.mdx
+++ b/release-notes.mdx
@@ -1,11 +1,11 @@
---
title: "Product updates"
sidebarTitle: "Product updates"
-description: "New features, bug fixes, and improvements for Runpod"
+description: "New features, fixes, and improvements for the Runpod platform."
---
-## Slurm Clusters GA, cached models in beta, and new Public Endpoints
+## Slurm Clusters GA, cached models in beta, and Public Endpoints available
- [Slurm Clusters are now generally available](/instant-clusters/slurm-clusters) on Runpod. Deploy production-ready HPC clusters in seconds instead of hours or days. The clusters support multi-node performance for distributed training and large-scale simulations. Manage clusters from the web interface with pay-as-you-go billing and no idle costs.
- [Cached models are now in beta](/serverless/endpoints/model-caching). When enabled, this feature eliminates model download times when starting workers. Instead of embedding large models in Docker images or downloading them at startup, cached models are placed on host machines before workers start. When you specify a model from Hugging Face, the scheduler prioritizes hosts with your model already cached. If the model is cached, workers start instantly. If not, the system downloads the model before your worker starts billing.
From 840280ad4bf22f9d13b2d3f52894727767d951fb Mon Sep 17 00:00:00 2001
From: Mo King
Date: Fri, 5 Dec 2025 10:28:10 -0500
Subject: [PATCH 5/7] Improve writing & consistency for release notes
---
release-notes.mdx | 245 +++++++++++++++++++++++++++-------------------
1 file changed, 144 insertions(+), 101 deletions(-)
diff --git a/release-notes.mdx b/release-notes.mdx
index c5bf1be3..3b5a9e0f 100644
--- a/release-notes.mdx
+++ b/release-notes.mdx
@@ -5,188 +5,231 @@ description: "New features, fixes, and improvements for the Runpod platform."
---
-## Slurm Clusters GA, cached models in beta, and Public Endpoints available
+## Slurm Clusters GA, cached models in beta, and new Public Endpoints available
+
+- [Slurm Clusters are now generally available](/instant-clusters/slurm-clusters): Deploy production-ready HPC clusters in seconds. These clusters support multi-node performance for distributed training and large-scale simulations with pay-as-you-go billing and no idle costs.
+- [Cached models are now in beta](/serverless/endpoints/model-caching): Eliminate model download times when starting workers. The system places cached models on host machines before workers start, prioritizing hosts with your model already available for instant startup.
+- [New Public Endpoints available](/hub/public-endpoints): Wan 2.5 combines image and audio to create lifelike videos, while Nano Banana merges multiple images for composite creations.
-- [Slurm Clusters are now generally available](/instant-clusters/slurm-clusters) on Runpod. Deploy production-ready HPC clusters in seconds instead of hours or days. The clusters support multi-node performance for distributed training and large-scale simulations. Manage clusters from the web interface with pay-as-you-go billing and no idle costs.
-- [Cached models are now in beta](/serverless/endpoints/model-caching). When enabled, this feature eliminates model download times when starting workers. Instead of embedding large models in Docker images or downloading them at startup, cached models are placed on host machines before workers start. When you specify a model from Hugging Face, the scheduler prioritizes hosts with your model already cached. If the model is cached, workers start instantly. If not, the system downloads the model before your worker starts billing.
-- [New Public Endpoints](/hub/public-endpoints) are now available: Wan 2.5 combines an image and audio to create lifelike videos, and Nano Banana combines multiple images to create composite images for product placement, character development, and more.
-## Hub revenue share for maintainers and new Pods UX
+## Hub revenue sharing launches and Pods UI gets refreshed
+
+- [Hub revenue share model](/hub/revenue-sharing): Publish to the Runpod Hub and earn credits when others deploy your repo. Earn up to 7% of compute revenue through monthly tiers with credits auto-deposited into your account.
+- [Pods UI updated](/pods/overview): Refreshed modern interface for interacting with Runpod Pods.
-- [Runpod Hub revenue share model](/hub/revenue-sharing). Publish to the Runpod Hub and earn credits when others deploy your repo. Earn up to 7% of compute revenue via monthly tiers (1/3/5/7%). Credits are auto-deposited into your account.
-- [Pods UI updated](/pods/overview). Updated modern interface to interact with Runpod Pods.
-## Public Endpoints and one-click Slurm clusters
+## Public Endpoints arrive, Slurm Clusters in beta
+
+- [Public Endpoints](/hub/public-endpoints): Access state-of-the-art AI models through simple API calls with an integrated playground. Available endpoints include Whisper-V3-Large, Seedance 1.0 pro, Seedream 3.0, Qwen Image Edit, FLUX.1 Kontext, Deep Cogito v2 Llama 70B, and Minimax Speech.
+- [Slurm Clusters (beta)](/instant-clusters/slurm-clusters): Create on-demand multi-node clusters instantly with full Slurm scheduling support.
-- [Public Endpoints](/hub/public-endpoints). Instant access to state-of-the-art AI models through simple API calls. An API playground is available through the Runpod Hub. Available public endpoints include Whisper-V3-Large, Seedance 1.0 pro, Seedream 3.0, Qwen Image Edit, FLUX.1 Kontext, Deep Cogito v2 Llama 70B, Minimax Speech, and others.
-- [Slurm Clusters (beta)](/instant-clusters/slurm-clusters). Create on-demand multi-node clusters instantly with full Slurm scheduling support.
-## Upload and retrieve files without compute and updated referral program
+## S3-compatible storage and updated referral program
+
+- [S3-compatible API for network volumes](/storage/s3-api): Upload and retrieve files from your network volumes without compute using AWS S3 CLI or Boto3. Integrate Runpod storage into any AI pipeline with zero-config ease and object-level control.
+- [Referral program revamp](/referrals): Updated rewards and tiers with clearer dashboards to track performance.
-- [S3 API](/storage/s3-api). Upload and retrieve files without compute. Use AWS S3 CLI or Boto3 with zero-config ease. Integrate Runpod storage into any AI pipeline with no rewrites. Manage data with object-level control for teams, models, and apps.
-- [Referrals v2](/referrals). Updated rewards and tiers with clearer dashboards to track performance.
-## UX polish, further price relief, and a marketplace with new Python library
+## Port labeling, price drops, Runpod Hub, and Tetra beta test
+
+- [Port labeling](/pods/overview): Name exposed ports in the UI and API to help team members identify services like Jupyter or TensorBoard.
+- [Price drops](/pods/pricing): Additional price reductions on popular GPU SKUs to lower training and inference costs.
+- [Runpod Hub](/hub/overview): A curated catalog of one-click endpoints and templates for deploying community projects without starting from scratch.
+- **Tetra beta test**: A Python library for running code on GPU with Runpod. Add a `@remote()` decorator to functions that need GPU power while the rest of your code runs locally.
-- [Port labeling](/pods/overview). Name exposed ports in the UI and API for clearer collaboration. For example, label ports as "Jupyter" or "TensorBoard" to help team members identify what each port is for.
-- [Price drops](/pods/pricing). Additional price reductions on popular GPU SKUs to lower training and inference costs.
-- [Runpod Serverless Hub](/hub/overview). A curated marketplace of one-click endpoints and templates. Fork and deploy community projects without starting from scratch.
-- **Tetra beta test**. A Python library for running parts of your code on a GPU using Runpod. Add a `@remote()` decorator to functions that need GPU power, and Tetra handles offloading the work to Runpod while the rest of your code runs locally.
-## SSO convenience and wider global mesh
+## GitHub login, RTX 5090s, and global networking expansion
+
+- **Login with GitHub**: OAuth sign-in and linking for faster onboarding and repo-driven workflows.
+- **RTX 5090s on Runpod**: High-performance RTX 5090 availability for cost-efficient training and inference.
+- [Global networking expansion](/pods/networking): Rollout to additional data centers approaching full global coverage.
-- **Login with GitHub.** OAuth sign-in and linking for faster onboarding and repo-driven workflows.
-- **RTX 5090s on Runpod.** RTX 5090 availability for high performance and cost-efficiency on training and inference.
-- **Global networking expansion.** Rollout to many additional data centers approaching full global coverage.
-## Enterprise features: compliance, APIs, clusters, bare metal, and APAC expansion
+## Enterprise features arrive, REST API goes GA, Instant Clusters in beta, and APAC expansion
+
+- [CPU Pods get network storage access](/storage/network-volumes): GA support for network volumes on CPU Pods for persistent, shareable storage.
+- **SOC 2 Type I certification**: Independent attestation of security controls for enterprise readiness.
+- [REST API release](/api-reference/overview): REST API GA with broad resource coverage for full infrastructure-as-code workflows.
+- [Instant Clusters](/instant-clusters/overview): Spin up multi-node GPU clusters in minutes with private interconnect and per-second billing.
+- **Bare metal**: Reserve dedicated GPU servers for maximum control, performance, and long-term savings.
+- **AP-JP-1**: New Fukushima region for low-latency APAC access and in-country data residency.
-- **CPU Pods get network storage access.** GA support for network volumes on CPU Pods for persistent, shareable storage.
-- **SOC 2 Type I certification.** Independent attestation of security controls for enterprise readiness.
-- [REST API release](/api-reference/overview). REST API GA with broad resource coverage for full IaC workflows.
-- [Instant Clusters](/instant-clusters/overview). Spin up multi-node GPU clusters in minutes with private interconnect and per-second billing.
-- **Bare metal.** Reserve dedicated GPU servers for maximum control, performance, and long-term savings.
-- **AP-JP-1.** New Fukushima region for low-latency APAC access and in-country data residency.
-## Modern API surface in beta and stronger community investment
-- [REST API beta test](/api-reference/overview). RESTful endpoints for Pods, endpoints, and volumes for simpler automation than GraphQL.
-- [Full time community manager hire](/community/community-manager). Dedicated programs, content, and faster community response.
-- [Serverless GitHub integration release](/serverless/workers/github-integration). GA for GitHub-based Serverless deploys with production-ready stability.
+## REST API enters beta with full-time community manager
+
+- [REST API beta test](/api-reference/overview): RESTful endpoints for Pods, endpoints, and volumes for simpler automation than GraphQL.
+- [Full-time community manager hire](/community/community-manager): Dedicated programs, content, and faster community response.
+- [Serverless GitHub integration release](/serverless/workers/github-integration): GA for GitHub-based Serverless deploys with production-ready stability.
+
-## New silicon options and LLM-centric Serverless upgrades
-- **CPU Pods v2.** Docker runtime parity with GPU Pods for faster starts. Adds network volume support.
-- **H200s on Runpod.** NVIDIA H200 GPUs available for larger models and higher memory bandwidth.
-- **Serverless upgrades.** Higher GPU counts per worker, new quick-deploy runtimes, and simpler model selection.
+## New silicon and LLM-focused Serverless upgrades
+
+- **CPU Pods v2**: Docker runtime parity with GPU Pods for faster starts with network volume support.
+- [H200s on Runpod](/references/gpu-types): NVIDIA H200 GPUs available for larger models and higher memory bandwidth.
+- [Serverless upgrades](/serverless/overview): Higher GPU counts per worker, new quick-deploy runtimes, and simpler model selection.
+
-## Global networking rollout continues and GitHub deploys arrive in beta
-- **Global networking added to CA-MTL-3, US-GA-1, US-GA-2, US-KS-2.** Expanded data center coverage for the private mesh.
-- **Serverless GitHub integration beta test.** Deploy endpoints directly from GitHub repos with automatic builds.
-- **Scoped API keys.** Least-privilege tokens with fine-grained scopes and expirations for safer automation.
-- **Passkey auth.** Passwordless WebAuthn sign-in for phishing-resistant account access.
+## Global networking expands and GitHub deploys enter beta
+
+- [Global networking expansion](/pods/networking): Added to CA-MTL-3, US-GA-1, US-GA-2, and US-KS-2 for expanded private mesh coverage.
+- [Serverless GitHub integration beta test](/serverless/workers/github-integration): Deploy endpoints directly from GitHub repos with automatic builds.
+- **Scoped API keys**: Least-privilege tokens with fine-grained scopes and expirations for safer automation.
+- **Passkey auth**: Passwordless WebAuthn sign-in for phishing-resistant account access.
+
-## More storage coverage and private cross-DC connectivity
-- **US-GA-2 added to network storage.** Enable network volumes in US-GA-2.
-- **Global networking.** Private cross-data-center networking with internal DNS for secure service-to-service traffic.
+## Storage expansion and private cross-data-center connectivity
+
+- [US-GA-2 added to network storage](/storage/network-volumes): Enable network volumes in US-GA-2.
+- [Global networking](/pods/networking): Private cross-data-center networking with internal DNS for secure service-to-service traffic.
+
-## Storage coverage grows, major price cuts, and revamped referrals
-- **US-TX-3 and EUR-IS-1 added to network storage.** Network volumes available in more regions for local persistence.
-- **Runpod slashes GPU prices.** Broad GPU price reductions to lower training and inference TCO.
-- **Referral program revamp.** Updated commissions and bonuses with an affiliate tier and improved tracking.
+## Storage coverage grows with major price cuts and revamped referrals
+
+- **US-TX-3 and EUR-IS-1 added to network storage**: Network volumes available in more regions for local persistence.
+- **Runpod slashes GPU prices**: Broad GPU price reductions to lower training and inference total cost of ownership.
+- [Referral program revamp](/referrals): Updated commissions and bonuses with an affiliate tier and improved tracking.
+
-## $20M seed round, community event, and broader Serverless and accelerator options
-- **$20M seed by Intel Capital and Dell Technologies Capital.** Funds infrastructure expansion and product acceleration.
-- **First in person hackathon.** Community projects, workshops, and real-world feedback.
-- **Serverless CPU Pods.** Scale-to-zero CPU endpoints for services that don't need a GPU.
-- **AMD GPUs.** AMD ROCm-compatible GPU SKUs as cost and performance alternatives to NVIDIA.
+## $20M seed round, community event, and broader Serverless options
+
+- **$20M seed by Intel Capital and Dell Technologies Capital**: Funds infrastructure expansion and product acceleration.
+- **First in-person hackathon**: Community projects, workshops, and real-world feedback.
+- [Serverless CPU Pods](/references/cpu-types): Scale-to-zero CPU endpoints for services that don't need a GPU.
+- [AMD GPUs](/references/gpu-types): AMD ROCm-compatible GPU SKUs as cost and performance alternatives to NVIDIA.
+
-## Compute beyond GPUs and first-class automation tooling
-- **CPU Pods.** CPU-only instances with the same networking and storage primitives for cheaper non-GPU stages.
-- [runpodctl](/runpodctl/overview). Official CLI for Pods, endpoints, and volumes to enable scripting and CI/CD workflows.
+## CPU compute and first-class automation tooling
+
+- **CPU Pods**: CPU-only instances with the same networking and storage primitives for cheaper non-GPU stages.
+- [runpodctl](/runpodctl/overview): Official CLI for Pods, endpoints, and volumes to enable scripting and CI/CD workflows.
+
## Console navigation overhaul and documentation refresh
-- **New navigational changes to Runpod UI.** Consolidated menus, consistent action placement, and fewer clicks for common tasks.
-- **Docs revamp.** New information architecture, improved search, and more runnable examples and quickstarts.
-- **Zhen AMA.** Roadmap Q&A and community feedback session.
+
+- **New navigational changes to Runpod UI**: Consolidated menus, consistent action placement, and fewer clicks for common tasks.
+- **Docs revamp**: New information architecture, improved search, and more runnable examples and quickstarts.
+- **Zhen AMA**: Roadmap Q&A and community feedback session.
+
-## New regions and investment in community and support
-- **US-OR-1.** Additional US region for lower latency and more capacity in the Pacific Northwest.
-- **CA-MTL-1.** New Canadian region to improve latency and in-country data needs.
-- **Our first community manager hire.** Dedicated community programs and faster feedback loops.
-- **Building out the support team.** Expanded coverage and expertise for complex issues.
+## New regions and investment in community support
+
+- **US-OR-1**: Additional US region for lower latency and more capacity in the Pacific Northwest.
+- **CA-MTL-1**: New Canadian region to improve latency and meet in-country data needs.
+- **First community manager hire**: Dedicated community programs and faster feedback loops.
+- **Building out the support team**: Expanded coverage and expertise for complex issues.
+
-## Faster starts from templates and better multi-region hygiene
-- **Serverless quick deploy.** One-click deploy of curated model templates with sensible defaults.
-- **EU domain for Serverless.** EU-specific domain briefly offered for data residency, superseded by other region controls.
-- **Data-center filter for Serverless.** Filter and manage endpoints by region for multi-region fleets.
+## Faster template starts and better multi-region hygiene
+
+- **Serverless quick deploy**: One-click deploy of curated model templates with sensible defaults.
+- **EU domain for Serverless**: EU-specific domain briefly offered for data residency, superseded by other region controls.
+- **Data-center filter for Serverless**: Filter and manage endpoints by region for multi-region fleets.
+
## Self-service upgrades, clearer metrics, new pricing model, and cost visibility
-- **Self service worker upgrade.** Rebuild and roll workers from the dashboard without support tickets.
-- **Edit template from endpoint page.** Inline edit and redeploy the underlying template directly from the endpoint view.
-- **Improved Serverless metrics page.** Refinements to charts and filters for quicker root-cause analysis.
-- **Flex and active workers.** Discounted always-on "Active" capacity for baseline load. Burst with on-demand "Flex" workers.
-- **Billing explorer.** Inspect costs by resource, region, and time to identify optimization opportunities.
+
+- **Self-service worker upgrade**: Rebuild and roll workers from the dashboard without support tickets.
+- **Edit template from endpoint page**: Inline edit and redeploy the underlying template directly from the endpoint view.
+- **Improved Serverless metrics page**: Refinements to charts and filters for quicker root-cause analysis.
+- [Flex and active workers](/serverless/pricing): Discounted always-on "active" capacity for baseline load with on-demand "flex" workers for bursts.
+- **Billing explorer**: Inspect costs by resource, region, and time to identify optimization opportunities.
+
-## Team governance, storage expansion, and better debugging and health
-- **Teams.** Organization workspaces with role-based access control for Pods, endpoints, and billing.
-- **Savings plans.** Plans surfaced prominently in console with easier purchase and management for steady usage.
-- **Network storage to US-KS-1.** Enable network volumes in US-KS-1 for local, persistent data workflows.
-- **Serverless log view.** Stream worker stdout and stderr in the UI and API for real-time debugging.
-- **Serverless health endpoint.** Lightweight /health probe returning endpoint and worker status without creating a billable job.
-- **SOC 2 Type II compliant.** Security and compliance certification for enterprise customers.
+## Team governance, storage expansion, and better debugging
+
+- [Teams](/get-started/manage-accounts): Organization workspaces with role-based access control for Pods, endpoints, and billing.
+- [Savings plans](/pods/pricing): Plans surfaced prominently in console with easier purchase and management for steady usage.
+- **Network storage to US-KS-1**: Enable network volumes in US-KS-1 for local, persistent data workflows.
+- [Serverless log view](/serverless/development/logs): Stream worker stdout and stderr in the UI and API for real-time debugging.
+- **Serverless health endpoint**: Lightweight /health probe returning endpoint and worker status without creating a billable job.
+- **SOC 2 Type II compliant**: Security and compliance certification for enterprise customers.
+
## Observability, top-tier GPUs, and commitment-based savings
-- **Serverless metrics page.** Time-series charts for pXX latencies, queue delay, throughput, and worker states for faster debugging and tuning.
-- **H100 on Runpod.** NVIDIA H100 instances for higher throughput and larger model footprints.
-- **Savings plans.** Commitment-based discounts for predictable workloads to lower effective hourly rates.
+
+- **Serverless metrics page**: Time-series charts for pXX latencies, queue delay, throughput, and worker states for faster debugging and tuning.
+- **H100 on Runpod](/references/gpu-types): NVIDIA H100 instances for higher throughput and larger model footprints.
+- [Savings plans](/pods/pricing): Commitment-based discounts for predictable workloads to lower effective hourly rates.
+
## Smoother auth and multi-region Serverless with persistent storage
-- **The new and improved Runpod login experience.** Streamlined sign-in and team access for faster, more consistent auth flows.
-- **Network volumes added to Serverless.** Attach persistent storage to Serverless workers to retain models and artifacts across restarts and speed cold starts via caching.
-- **Serverless region support.** Pin or allow specific regions for endpoints to reduce latency and meet data-residency needs.
+
+- **The new and improved Runpod login experience**: Streamlined sign-in and team access for faster, more consistent auth flows.
+- [Network volumes added to Serverless](/storage/network-volumes): Attach persistent storage to Serverless workers to retain models and artifacts across restarts and speed cold starts through caching.
+- **Serverless region support**: Pin or allow specific regions for endpoints to reduce latency and meet data-residency needs.
+
## Deeper autoscaling controls, richer metrics, persistent storage, and job cancellation
-- **Serverless scaling strategies.** Scale by queue delay and/or concurrency with min/max worker bounds to balance latency and cost.
-- **Queue delay.** Expose time-in-queue as a first-class metric to drive autoscaling and SLO monitoring.
-- **Request count.** Track success and failure totals over windows for quick health checks and alerting.
-- **runsync.** Synchronous invocation path that returns results in the same HTTP call for short-running jobs.
-- **Network storage beta.** Region-scoped, attachable volumes shareable across Pods and endpoints for model caches and datasets.
-- **Job cancel API.** Programmatically terminate queued or running jobs to free capacity and enforce client timeouts.
+
+- **Serverless scaling strategies**: Scale by queue delay and/or concurrency with min/max worker bounds to balance latency and cost.
+- **Queue delay**: Expose time-in-queue as a first-class metric to drive autoscaling and SLO monitoring.
+- **Request count**: Track success and failure totals over windows for quick health checks and alerting.
+- **runsync**: Synchronous invocation path that returns results in the same HTTP call for short-running jobs.
+- **Network storage beta**: Region-scoped, attachable volumes shareable across Pods and endpoints for model caches and datasets.
+- **Job cancel API**: Programmatically terminate queued or running jobs to free capacity and enforce client timeouts.
+
-## Serverless platform hardens with a cleaner, more capable API
-- **Serverless API v2.** Revised request and response schema with improved error semantics. New endpoints provide better control over job lifecycle and observability.
+## Serverless platform hardens with cleaner API
+
+- **Serverless API v2**: Revised request and response schema with improved error semantics and new endpoints for better control over job lifecycle and observability.
+
-## Better control over notifications and GPU allocation during contention
-- **Notification preferences.** Configure which platform events trigger alerts to reduce noise for teams and CI systems.
-- **GPU priorities.** Influence scheduling by marking workloads as higher priority to reduce queue time for critical jobs.
+## Better control over notifications and GPU allocation
+
+- **Notification preferences**: Configure which platform events trigger alerts to reduce noise for teams and CI systems.
+- **GPU priorities**: Influence scheduling by marking workloads as higher priority to reduce queue time for critical jobs.
+
-## Security-first release enabling encryption for persistent data
-- **Runpod now offers encrypted volumes.** Enable at-rest encryption for persistent volumes with no application changes required. Keys are platform-managed, and encrypted volumes mount like standard volumes.
-
+## Encrypted volumes for persistent data
+
+- **Runpod now offers encrypted volumes**: Enable at-rest encryption for persistent volumes with no application changes required using platform-managed keys.
+
+
\ No newline at end of file
From a438c2c50578ea2d586eda5a1c6b46270fc1c07e Mon Sep 17 00:00:00 2001
From: Mo King
Date: Fri, 5 Dec 2025 10:41:25 -0500
Subject: [PATCH 6/7] Fix broken links
---
containers.mdx | 2 +-
pods/manage-pods.mdx | 2 +-
pods/overview.mdx | 2 +-
release-notes.mdx | 8 ++++----
runpodctl/reference/runpodctl-ssh-add-key.mdx | 1 -
serverless/endpoints/manage-endpoints.mdx | 2 +-
serverless/workers/overview.mdx | 2 +-
7 files changed, 9 insertions(+), 10 deletions(-)
diff --git a/containers.mdx b/containers.mdx
index 0d98be7d..deac9ed2 100644
--- a/containers.mdx
+++ b/containers.mdx
@@ -2,7 +2,7 @@
title: "Containers"
---
-
+
## 📄️ Overview
Learn how to build and deploy applications on the Runpod platform with this set of tutorials, covering tools, technologies, and deployment methods, including Containers, Docker, and Serverless implementation.
diff --git a/pods/manage-pods.mdx b/pods/manage-pods.mdx
index cab26478..cf349f88 100644
--- a/pods/manage-pods.mdx
+++ b/pods/manage-pods.mdx
@@ -149,7 +149,7 @@ With custom templates, you can:
* Install specific dependencies and packages.
* Configure your development environment.
-* Create [portable Docker images](/tutorials/introduction/containers/overview) that work consistently across deployments.
+* Create [portable Docker images](/tutorials/introduction/containers) that work consistently across deployments.
* Share environments with team members for collaborative work.
## Stop a Pod
diff --git a/pods/overview.mdx b/pods/overview.mdx
index 57726fd1..0d5263cd 100644
--- a/pods/overview.mdx
+++ b/pods/overview.mdx
@@ -42,7 +42,7 @@ Every Pod comes with a resizable **container disk** that houses the operating sy
**Volume disks** provide persistent storage that is preserved throughout the Pod's lease, functioning like a dedicated hard drive. Data stored in the volume disk directory (`/workspace` by default) persists when you stop the Pod, but is erased when the Pod is deleted.
-Optional [network volumes](/pods/storage/network-volumes) provide more flexible permanent storage that can be transferred between Pods, replacing the volume disk when attached. When using a Pod with network volume attached, you can safely delete your Pod without losing the data stored in your network volume directory (`/workspace` by default).
+Optional [network volumes](/storage/network-volumes) provide more flexible permanent storage that can be transferred between Pods, replacing the volume disk when attached. When using a Pod with network volume attached, you can safely delete your Pod without losing the data stored in your network volume directory (`/workspace` by default).
To learn more, see [Storage options](/pods/storage/types).
diff --git a/release-notes.mdx b/release-notes.mdx
index 3b5a9e0f..7da4c5b2 100644
--- a/release-notes.mdx
+++ b/release-notes.mdx
@@ -33,7 +33,7 @@ description: "New features, fixes, and improvements for the Runpod platform."
## S3-compatible storage and updated referral program
- [S3-compatible API for network volumes](/storage/s3-api): Upload and retrieve files from your network volumes without compute using AWS S3 CLI or Boto3. Integrate Runpod storage into any AI pipeline with zero-config ease and object-level control.
-- [Referral program revamp](/referrals): Updated rewards and tiers with clearer dashboards to track performance.
+- [Referral program revamp](/references/referrals): Updated rewards and tiers with clearer dashboards to track performance.
@@ -62,7 +62,7 @@ description: "New features, fixes, and improvements for the Runpod platform."
- [CPU Pods get network storage access](/storage/network-volumes): GA support for network volumes on CPU Pods for persistent, shareable storage.
- **SOC 2 Type I certification**: Independent attestation of security controls for enterprise readiness.
- [REST API release](/api-reference/overview): REST API GA with broad resource coverage for full infrastructure-as-code workflows.
-- [Instant Clusters](/instant-clusters/overview): Spin up multi-node GPU clusters in minutes with private interconnect and per-second billing.
+- [Instant Clusters](/instant-clusters): Spin up multi-node GPU clusters in minutes with private interconnect and per-second billing.
- **Bare metal**: Reserve dedicated GPU servers for maximum control, performance, and long-term savings.
- **AP-JP-1**: New Fukushima region for low-latency APAC access and in-country data residency.
@@ -72,7 +72,7 @@ description: "New features, fixes, and improvements for the Runpod platform."
## REST API enters beta with full-time community manager
- [REST API beta test](/api-reference/overview): RESTful endpoints for Pods, endpoints, and volumes for simpler automation than GraphQL.
-- [Full-time community manager hire](/community/community-manager): Dedicated programs, content, and faster community response.
+- **Full-time community manager hire**: Dedicated programs, content, and faster community response.
- [Serverless GitHub integration release](/serverless/workers/github-integration): GA for GitHub-based Serverless deploys with production-ready stability.
@@ -109,7 +109,7 @@ description: "New features, fixes, and improvements for the Runpod platform."
- **US-TX-3 and EUR-IS-1 added to network storage**: Network volumes available in more regions for local persistence.
- **Runpod slashes GPU prices**: Broad GPU price reductions to lower training and inference total cost of ownership.
-- [Referral program revamp](/referrals): Updated commissions and bonuses with an affiliate tier and improved tracking.
+- [Referral program revamp](/references/referrals): Updated commissions and bonuses with an affiliate tier and improved tracking.
diff --git a/runpodctl/reference/runpodctl-ssh-add-key.mdx b/runpodctl/reference/runpodctl-ssh-add-key.mdx
index 0af48ef2..b5c42997 100644
--- a/runpodctl/reference/runpodctl-ssh-add-key.mdx
+++ b/runpodctl/reference/runpodctl-ssh-add-key.mdx
@@ -31,5 +31,4 @@ The path to a file containing the SSH public key to add. This is typically a `.p
## Related commands
-- [`runpodctl ssh`](/runpodctl/reference/runpodctl-ssh)
- [`runpodctl ssh list-keys`](/runpodctl/reference/runpodctl-ssh-list-keys)
diff --git a/serverless/endpoints/manage-endpoints.mdx b/serverless/endpoints/manage-endpoints.mdx
index 5e545b81..e430218d 100644
--- a/serverless/endpoints/manage-endpoints.mdx
+++ b/serverless/endpoints/manage-endpoints.mdx
@@ -21,7 +21,7 @@ To create a new Serverless endpoint through the Runpod web interface:
* **Endpoint Name**: The display name for your endpoint in the console.
* **Endpoint Type**: Select **Queue** for traditional queue-based processing or **Load balancer** for direct HTTP access (see [Load balancing endpoints](/serverless/load-balancing/overview) for details).
* **GPU Configuration**: Select the appropriate GPU types and configure worker settings.
- * **Model (optional)**: Enter a model URL from Hugging Face to optimize worker startup times. See [Pre-cached models](/storage/model-caching) for details.
+ * **Model (optional)**: Enter a model URL from Hugging Face to optimize worker startup times. See [Pre-cached models](/serverless/endpoints/model-caching) for details.
* **Container Configuration**: Edit the container start command, specify the [container disk size](/serverless/storage/overview), and expose HTTP/TCP ports.
* **Environment Variables**: Add [environment variables](/serverless/development/environment-variables) for your worker containers.
6. Click **Create Endpoint** to deploy.
diff --git a/serverless/workers/overview.mdx b/serverless/workers/overview.mdx
index 29f428b8..75b4e7ba 100644
--- a/serverless/workers/overview.mdx
+++ b/serverless/workers/overview.mdx
@@ -52,7 +52,7 @@ You can view the state of your workers using the **Workers** tab of a Serverless
## Debugging workers
-To debug issues in production, you can access [worker logs](/serverless/development/logs) and [SSH directly into running workers](/serverless/workers/ssh-into-workers) to inspect file systems and environment variables in real-time.
+To debug issues in production, you can access [worker logs](/serverless/development/logs) and [SSH directly into running workers](/serverless/development/ssh-into-workers) to inspect file systems and environment variables in real-time.
## Max worker limit
From 3aeb9aada2cafd4adf8c41517472f01613458195 Mon Sep 17 00:00:00 2001
From: Mo King
Date: Sat, 6 Dec 2025 22:32:50 -0500
Subject: [PATCH 7/7] Fix typo in release notes
---
release-notes.mdx | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/release-notes.mdx b/release-notes.mdx
index 7da4c5b2..79c8354e 100644
--- a/release-notes.mdx
+++ b/release-notes.mdx
@@ -186,7 +186,7 @@ description: "New features, fixes, and improvements for the Runpod platform."
## Observability, top-tier GPUs, and commitment-based savings
- **Serverless metrics page**: Time-series charts for pXX latencies, queue delay, throughput, and worker states for faster debugging and tuning.
-- **H100 on Runpod](/references/gpu-types): NVIDIA H100 instances for higher throughput and larger model footprints.
+- [H100s on Runpod](/references/gpu-types): NVIDIA H100 instances for higher throughput and larger model footprints.
- [Savings plans](/pods/pricing): Commitment-based discounts for predictable workloads to lower effective hourly rates.