diff --git a/containers.mdx b/containers.mdx index 0d98be7d..deac9ed2 100644 --- a/containers.mdx +++ b/containers.mdx @@ -2,7 +2,7 @@ title: "Containers" --- - + ## 📄️ Overview Learn how to build and deploy applications on the Runpod platform with this set of tutorials, covering tools, technologies, and deployment methods, including Containers, Docker, and Serverless implementation. diff --git a/docs.json b/docs.json index c54029ab..7fbb72d2 100644 --- a/docs.json +++ b/docs.json @@ -465,6 +465,17 @@ ] } ] + }, + { + "tab": "Release notes", + "groups": [ + { + "group": "Release notes", + "pages": [ + "release-notes" + ] + } + ] } ] }, diff --git a/pods/manage-pods.mdx b/pods/manage-pods.mdx index cab26478..cf349f88 100644 --- a/pods/manage-pods.mdx +++ b/pods/manage-pods.mdx @@ -149,7 +149,7 @@ With custom templates, you can: * Install specific dependencies and packages. * Configure your development environment. -* Create [portable Docker images](/tutorials/introduction/containers/overview) that work consistently across deployments. +* Create [portable Docker images](/tutorials/introduction/containers) that work consistently across deployments. * Share environments with team members for collaborative work. ## Stop a Pod diff --git a/pods/overview.mdx b/pods/overview.mdx index 57726fd1..0d5263cd 100644 --- a/pods/overview.mdx +++ b/pods/overview.mdx @@ -42,7 +42,7 @@ Every Pod comes with a resizable **container disk** that houses the operating sy **Volume disks** provide persistent storage that is preserved throughout the Pod's lease, functioning like a dedicated hard drive. Data stored in the volume disk directory (`/workspace` by default) persists when you stop the Pod, but is erased when the Pod is deleted. -Optional [network volumes](/pods/storage/network-volumes) provide more flexible permanent storage that can be transferred between Pods, replacing the volume disk when attached. When using a Pod with network volume attached, you can safely delete your Pod without losing the data stored in your network volume directory (`/workspace` by default). +Optional [network volumes](/storage/network-volumes) provide more flexible permanent storage that can be transferred between Pods, replacing the volume disk when attached. When using a Pod with network volume attached, you can safely delete your Pod without losing the data stored in your network volume directory (`/workspace` by default). To learn more, see [Storage options](/pods/storage/types). diff --git a/release-notes.mdx b/release-notes.mdx new file mode 100644 index 00000000..79c8354e --- /dev/null +++ b/release-notes.mdx @@ -0,0 +1,235 @@ +--- +title: "Product updates" +sidebarTitle: "Product updates" +description: "New features, fixes, and improvements for the Runpod platform." +--- + + +## Slurm Clusters GA, cached models in beta, and new Public Endpoints available + +- [Slurm Clusters are now generally available](/instant-clusters/slurm-clusters): Deploy production-ready HPC clusters in seconds. These clusters support multi-node performance for distributed training and large-scale simulations with pay-as-you-go billing and no idle costs. +- [Cached models are now in beta](/serverless/endpoints/model-caching): Eliminate model download times when starting workers. The system places cached models on host machines before workers start, prioritizing hosts with your model already available for instant startup. +- [New Public Endpoints available](/hub/public-endpoints): Wan 2.5 combines image and audio to create lifelike videos, while Nano Banana merges multiple images for composite creations. + + + + +## Hub revenue sharing launches and Pods UI gets refreshed + +- [Hub revenue share model](/hub/revenue-sharing): Publish to the Runpod Hub and earn credits when others deploy your repo. Earn up to 7% of compute revenue through monthly tiers with credits auto-deposited into your account. +- [Pods UI updated](/pods/overview): Refreshed modern interface for interacting with Runpod Pods. + + + + +## Public Endpoints arrive, Slurm Clusters in beta + +- [Public Endpoints](/hub/public-endpoints): Access state-of-the-art AI models through simple API calls with an integrated playground. Available endpoints include Whisper-V3-Large, Seedance 1.0 pro, Seedream 3.0, Qwen Image Edit, FLUX.1 Kontext, Deep Cogito v2 Llama 70B, and Minimax Speech. +- [Slurm Clusters (beta)](/instant-clusters/slurm-clusters): Create on-demand multi-node clusters instantly with full Slurm scheduling support. + + + + +## S3-compatible storage and updated referral program + +- [S3-compatible API for network volumes](/storage/s3-api): Upload and retrieve files from your network volumes without compute using AWS S3 CLI or Boto3. Integrate Runpod storage into any AI pipeline with zero-config ease and object-level control. +- [Referral program revamp](/references/referrals): Updated rewards and tiers with clearer dashboards to track performance. + + + + +## Port labeling, price drops, Runpod Hub, and Tetra beta test + +- [Port labeling](/pods/overview): Name exposed ports in the UI and API to help team members identify services like Jupyter or TensorBoard. +- [Price drops](/pods/pricing): Additional price reductions on popular GPU SKUs to lower training and inference costs. +- [Runpod Hub](/hub/overview): A curated catalog of one-click endpoints and templates for deploying community projects without starting from scratch. +- **Tetra beta test**: A Python library for running code on GPU with Runpod. Add a `@remote()` decorator to functions that need GPU power while the rest of your code runs locally. + + + + +## GitHub login, RTX 5090s, and global networking expansion + +- **Login with GitHub**: OAuth sign-in and linking for faster onboarding and repo-driven workflows. +- **RTX 5090s on Runpod**: High-performance RTX 5090 availability for cost-efficient training and inference. +- [Global networking expansion](/pods/networking): Rollout to additional data centers approaching full global coverage. + + + + +## Enterprise features arrive, REST API goes GA, Instant Clusters in beta, and APAC expansion + +- [CPU Pods get network storage access](/storage/network-volumes): GA support for network volumes on CPU Pods for persistent, shareable storage. +- **SOC 2 Type I certification**: Independent attestation of security controls for enterprise readiness. +- [REST API release](/api-reference/overview): REST API GA with broad resource coverage for full infrastructure-as-code workflows. +- [Instant Clusters](/instant-clusters): Spin up multi-node GPU clusters in minutes with private interconnect and per-second billing. +- **Bare metal**: Reserve dedicated GPU servers for maximum control, performance, and long-term savings. +- **AP-JP-1**: New Fukushima region for low-latency APAC access and in-country data residency. + + + + +## REST API enters beta with full-time community manager + +- [REST API beta test](/api-reference/overview): RESTful endpoints for Pods, endpoints, and volumes for simpler automation than GraphQL. +- **Full-time community manager hire**: Dedicated programs, content, and faster community response. +- [Serverless GitHub integration release](/serverless/workers/github-integration): GA for GitHub-based Serverless deploys with production-ready stability. + + + + +## New silicon and LLM-focused Serverless upgrades + +- **CPU Pods v2**: Docker runtime parity with GPU Pods for faster starts with network volume support. +- [H200s on Runpod](/references/gpu-types): NVIDIA H200 GPUs available for larger models and higher memory bandwidth. +- [Serverless upgrades](/serverless/overview): Higher GPU counts per worker, new quick-deploy runtimes, and simpler model selection. + + + + +## Global networking expands and GitHub deploys enter beta + +- [Global networking expansion](/pods/networking): Added to CA-MTL-3, US-GA-1, US-GA-2, and US-KS-2 for expanded private mesh coverage. +- [Serverless GitHub integration beta test](/serverless/workers/github-integration): Deploy endpoints directly from GitHub repos with automatic builds. +- **Scoped API keys**: Least-privilege tokens with fine-grained scopes and expirations for safer automation. +- **Passkey auth**: Passwordless WebAuthn sign-in for phishing-resistant account access. + + + + +## Storage expansion and private cross-data-center connectivity + +- [US-GA-2 added to network storage](/storage/network-volumes): Enable network volumes in US-GA-2. +- [Global networking](/pods/networking): Private cross-data-center networking with internal DNS for secure service-to-service traffic. + + + + +## Storage coverage grows with major price cuts and revamped referrals + +- **US-TX-3 and EUR-IS-1 added to network storage**: Network volumes available in more regions for local persistence. +- **Runpod slashes GPU prices**: Broad GPU price reductions to lower training and inference total cost of ownership. +- [Referral program revamp](/references/referrals): Updated commissions and bonuses with an affiliate tier and improved tracking. + + + + +## $20M seed round, community event, and broader Serverless options + +- **$20M seed by Intel Capital and Dell Technologies Capital**: Funds infrastructure expansion and product acceleration. +- **First in-person hackathon**: Community projects, workshops, and real-world feedback. +- [Serverless CPU Pods](/references/cpu-types): Scale-to-zero CPU endpoints for services that don't need a GPU. +- [AMD GPUs](/references/gpu-types): AMD ROCm-compatible GPU SKUs as cost and performance alternatives to NVIDIA. + + + + +## CPU compute and first-class automation tooling + +- **CPU Pods**: CPU-only instances with the same networking and storage primitives for cheaper non-GPU stages. +- [runpodctl](/runpodctl/overview): Official CLI for Pods, endpoints, and volumes to enable scripting and CI/CD workflows. + + + + +## Console navigation overhaul and documentation refresh + +- **New navigational changes to Runpod UI**: Consolidated menus, consistent action placement, and fewer clicks for common tasks. +- **Docs revamp**: New information architecture, improved search, and more runnable examples and quickstarts. +- **Zhen AMA**: Roadmap Q&A and community feedback session. + + + + +## New regions and investment in community support + +- **US-OR-1**: Additional US region for lower latency and more capacity in the Pacific Northwest. +- **CA-MTL-1**: New Canadian region to improve latency and meet in-country data needs. +- **First community manager hire**: Dedicated community programs and faster feedback loops. +- **Building out the support team**: Expanded coverage and expertise for complex issues. + + + + +## Faster template starts and better multi-region hygiene + +- **Serverless quick deploy**: One-click deploy of curated model templates with sensible defaults. +- **EU domain for Serverless**: EU-specific domain briefly offered for data residency, superseded by other region controls. +- **Data-center filter for Serverless**: Filter and manage endpoints by region for multi-region fleets. + + + + +## Self-service upgrades, clearer metrics, new pricing model, and cost visibility + +- **Self-service worker upgrade**: Rebuild and roll workers from the dashboard without support tickets. +- **Edit template from endpoint page**: Inline edit and redeploy the underlying template directly from the endpoint view. +- **Improved Serverless metrics page**: Refinements to charts and filters for quicker root-cause analysis. +- [Flex and active workers](/serverless/pricing): Discounted always-on "active" capacity for baseline load with on-demand "flex" workers for bursts. +- **Billing explorer**: Inspect costs by resource, region, and time to identify optimization opportunities. + + + + +## Team governance, storage expansion, and better debugging + +- [Teams](/get-started/manage-accounts): Organization workspaces with role-based access control for Pods, endpoints, and billing. +- [Savings plans](/pods/pricing): Plans surfaced prominently in console with easier purchase and management for steady usage. +- **Network storage to US-KS-1**: Enable network volumes in US-KS-1 for local, persistent data workflows. +- [Serverless log view](/serverless/development/logs): Stream worker stdout and stderr in the UI and API for real-time debugging. +- **Serverless health endpoint**: Lightweight /health probe returning endpoint and worker status without creating a billable job. +- **SOC 2 Type II compliant**: Security and compliance certification for enterprise customers. + + + + +## Observability, top-tier GPUs, and commitment-based savings + +- **Serverless metrics page**: Time-series charts for pXX latencies, queue delay, throughput, and worker states for faster debugging and tuning. +- [H100s on Runpod](/references/gpu-types): NVIDIA H100 instances for higher throughput and larger model footprints. +- [Savings plans](/pods/pricing): Commitment-based discounts for predictable workloads to lower effective hourly rates. + + + + +## Smoother auth and multi-region Serverless with persistent storage + +- **The new and improved Runpod login experience**: Streamlined sign-in and team access for faster, more consistent auth flows. +- [Network volumes added to Serverless](/storage/network-volumes): Attach persistent storage to Serverless workers to retain models and artifacts across restarts and speed cold starts through caching. +- **Serverless region support**: Pin or allow specific regions for endpoints to reduce latency and meet data-residency needs. + + + + +## Deeper autoscaling controls, richer metrics, persistent storage, and job cancellation + +- **Serverless scaling strategies**: Scale by queue delay and/or concurrency with min/max worker bounds to balance latency and cost. +- **Queue delay**: Expose time-in-queue as a first-class metric to drive autoscaling and SLO monitoring. +- **Request count**: Track success and failure totals over windows for quick health checks and alerting. +- **runsync**: Synchronous invocation path that returns results in the same HTTP call for short-running jobs. +- **Network storage beta**: Region-scoped, attachable volumes shareable across Pods and endpoints for model caches and datasets. +- **Job cancel API**: Programmatically terminate queued or running jobs to free capacity and enforce client timeouts. + + + + +## Serverless platform hardens with cleaner API + +- **Serverless API v2**: Revised request and response schema with improved error semantics and new endpoints for better control over job lifecycle and observability. + + + + +## Better control over notifications and GPU allocation + +- **Notification preferences**: Configure which platform events trigger alerts to reduce noise for teams and CI systems. +- **GPU priorities**: Influence scheduling by marking workloads as higher priority to reduce queue time for critical jobs. + + + + +## Encrypted volumes for persistent data + +- **Runpod now offers encrypted volumes**: Enable at-rest encryption for persistent volumes with no application changes required using platform-managed keys. + + \ No newline at end of file diff --git a/runpodctl/reference/runpodctl-ssh-add-key.mdx b/runpodctl/reference/runpodctl-ssh-add-key.mdx index 0af48ef2..b5c42997 100644 --- a/runpodctl/reference/runpodctl-ssh-add-key.mdx +++ b/runpodctl/reference/runpodctl-ssh-add-key.mdx @@ -31,5 +31,4 @@ The path to a file containing the SSH public key to add. This is typically a `.p ## Related commands -- [`runpodctl ssh`](/runpodctl/reference/runpodctl-ssh) - [`runpodctl ssh list-keys`](/runpodctl/reference/runpodctl-ssh-list-keys) diff --git a/serverless/endpoints/manage-endpoints.mdx b/serverless/endpoints/manage-endpoints.mdx index 5e545b81..e430218d 100644 --- a/serverless/endpoints/manage-endpoints.mdx +++ b/serverless/endpoints/manage-endpoints.mdx @@ -21,7 +21,7 @@ To create a new Serverless endpoint through the Runpod web interface: * **Endpoint Name**: The display name for your endpoint in the console. * **Endpoint Type**: Select **Queue** for traditional queue-based processing or **Load balancer** for direct HTTP access (see [Load balancing endpoints](/serverless/load-balancing/overview) for details). * **GPU Configuration**: Select the appropriate GPU types and configure worker settings. - * **Model (optional)**: Enter a model URL from Hugging Face to optimize worker startup times. See [Pre-cached models](/storage/model-caching) for details. + * **Model (optional)**: Enter a model URL from Hugging Face to optimize worker startup times. See [Pre-cached models](/serverless/endpoints/model-caching) for details. * **Container Configuration**: Edit the container start command, specify the [container disk size](/serverless/storage/overview), and expose HTTP/TCP ports. * **Environment Variables**: Add [environment variables](/serverless/development/environment-variables) for your worker containers. 6. Click **Create Endpoint** to deploy. diff --git a/serverless/workers/overview.mdx b/serverless/workers/overview.mdx index 29f428b8..75b4e7ba 100644 --- a/serverless/workers/overview.mdx +++ b/serverless/workers/overview.mdx @@ -52,7 +52,7 @@ You can view the state of your workers using the **Workers** tab of a Serverless ## Debugging workers -To debug issues in production, you can access [worker logs](/serverless/development/logs) and [SSH directly into running workers](/serverless/workers/ssh-into-workers) to inspect file systems and environment variables in real-time. +To debug issues in production, you can access [worker logs](/serverless/development/logs) and [SSH directly into running workers](/serverless/development/ssh-into-workers) to inspect file systems and environment variables in real-time. ## Max worker limit