Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
13 changes: 13 additions & 0 deletions docs/serverless/endpoints/manage-endpoints.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,6 +37,19 @@ You can edit a running Endpoint in the Web interface after you've deployed it.
3. Select **Edit Endpoint** and make your changes.
4. Choose **Update**.

## Set GPU prioritization an Endpoint

When creating or modifying a Worker Endpoint, specify your GPU preferences in descending order of priority.
This allows you to configure the desired GPU models for your Worker Endpoints.

RunPod attempts to allocate your first choice if it's available.
If your preferred GPU isn't available, the system automatically defaults to the next available GPU in your priority list.

1. Navigate to [Serverless Endpoints](https://www.dev.runpod.io/console/serverless).
2. Select the Endpoint you'd like to update.
3. Select the priority of the GPUs you'd like to use.
4. Choose **Update**.

:::note

You can force a configuration update by setting **Max Workers** to 0, selecting **Update**, then updating your max workers back to your needed value.
Expand Down
2 changes: 1 addition & 1 deletion docs/serverless/workers/overview.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,6 @@ Workers run your code in the cloud.

To start using RunPod Workers:

1. **Write Your Function**: Code your Handler Functions in a supported language.
1. **Write your function**: Code your Handler Functions in a supported language.
2. **Deploy to RunPod**: Upload your Handler Functions to RunPod.
3. **Integrate and Execute**: Use the provided Endpoint to integrate with your application.