-
Notifications
You must be signed in to change notification settings - Fork 0
MAF-19288: docs(website): add heimdall plugin reference #57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| @@ -0,0 +1,284 @@ | ||||||||||||||||||
| --- | ||||||||||||||||||
| title: Heimdall Plugin | ||||||||||||||||||
| sidebar_label: Plugin | ||||||||||||||||||
| sidebar_position: 2 | ||||||||||||||||||
| --- | ||||||||||||||||||
|
Comment on lines
+1
to
+5
|
||||||||||||||||||
|
|
||||||||||||||||||
| ## Profile Handlers | ||||||||||||||||||
|
|
||||||||||||||||||
| ### `single-profile-handler` | ||||||||||||||||||
|
|
||||||||||||||||||
| Handles a single profile which is always the primary profile. | ||||||||||||||||||
|
|
||||||||||||||||||
| No parameters. | ||||||||||||||||||
|
|
||||||||||||||||||
| ### `pd-profile-handler` | ||||||||||||||||||
|
|
||||||||||||||||||
| Handles scheduler profiles for Prefill-Decode (PD) disaggregation. | ||||||||||||||||||
|
|
||||||||||||||||||
| | Parameter | Type | Default | Description | | ||||||||||||||||||
| | :----------------- | :------- | :---------------------- | :--------------------------------------------------- | | ||||||||||||||||||
| | `threshold` | `int` | `0` | Threshold for decoding operations. | | ||||||||||||||||||
| | `decodeProfile` | `string` | `"decode"` | Name of the profile to use for decode operations. | | ||||||||||||||||||
| | `prefillProfile` | `string` | `"prefill"` | Name of the profile to use for prefill operations. | | ||||||||||||||||||
| | `prefixPluginType` | `string` | `"prefix-cache-scorer"` | Type of the prefix cache plugin to use. | | ||||||||||||||||||
| | `prefixPluginName` | `string` | `"prefix-cache-scorer"` | Name of the prefix cache plugin to use. | | ||||||||||||||||||
| | `hashBlockSize` | `int` | `64` | Block size used for hashing tokens. | | ||||||||||||||||||
| | `primaryPort` | `int` | `0` | Port number of the primary container (0 to disable). | | ||||||||||||||||||
|
|
||||||||||||||||||
| ## Filters | ||||||||||||||||||
|
|
||||||||||||||||||
| ### `by-label` | ||||||||||||||||||
|
|
||||||||||||||||||
| Filters out pods based on the values defined by the given label. | ||||||||||||||||||
|
|
||||||||||||||||||
| | Parameter | Type | Default | Description | | ||||||||||||||||||
| | :-------------- | :--------- | :------ | :------------------------------------------------------------------------------ | | ||||||||||||||||||
| | `label` | `string` | - | The label key to filter by. (Required) | | ||||||||||||||||||
| | `validValues` | `[]string` | - | List of allowed values for the label. (Required unless `allowsNoLabel` is true) | | ||||||||||||||||||
| | `allowsNoLabel` | `bool` | `false` | Whether to allow pods that do not have the specified label. | | ||||||||||||||||||
|
|
||||||||||||||||||
| ### `by-label-selector` | ||||||||||||||||||
|
|
||||||||||||||||||
| Filters out pods that do not match the configured label selector criteria. | ||||||||||||||||||
|
|
||||||||||||||||||
| | Parameter | Type | Default | Description | | ||||||||||||||||||
| | :------------ | :------------------ | :------ | :----------------------------------------- | | ||||||||||||||||||
| | `matchLabels` | `map[string]string` | - | Key-value pairs of labels that must match. | | ||||||||||||||||||
|
|
||||||||||||||||||
| ### `prefill-filter` | ||||||||||||||||||
|
|
||||||||||||||||||
| Filters for pods designated with the `prefill` role. It retains pods that have the label `mif.moreh.io/role` set to `prefill`. | ||||||||||||||||||
|
|
||||||||||||||||||
| No parameters. | ||||||||||||||||||
|
|
||||||||||||||||||
| ### `decode-filter` | ||||||||||||||||||
|
|
||||||||||||||||||
| Filters for pods designated with the `decode` role. It retains pods that satisfy one of the following conditions: | ||||||||||||||||||
|
|
||||||||||||||||||
| - The label `mif.moreh.io/role` is set to `decode` or `both`. | ||||||||||||||||||
| - The label `mif.moreh.io/role` is not set. | ||||||||||||||||||
|
|
||||||||||||||||||
| No parameters. | ||||||||||||||||||
|
|
||||||||||||||||||
| ## Scorers | ||||||||||||||||||
|
|
||||||||||||||||||
| ### `active-request-scorer` | ||||||||||||||||||
|
|
||||||||||||||||||
| Scores pods based on the number of active requests being served. Scores are normalized from 0 to 1. | ||||||||||||||||||
|
||||||||||||||||||
| Scores pods based on the number of active requests being served. Scores are normalized from 0 to 1. | |
| Scores pods based on the number of active requests being served. The scores are normalized to the range [0, 1]. |
Copilot
AI
Feb 12, 2026
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
inMemoryConfig.size is documented as an int but the default is written as 1e8. Many configuration formats treat scientific notation as a float (or reject it for integers), so this default may be confusing/misleading. Prefer spelling this out as an integer literal (e.g., 100000000) if that’s the intended value.
| | Parameter | Type | Default | Description | | |
| | :------------- | :---- | :------ | :------------------------------------- | | |
| | `size` | `int` | `1e8` | Maximum number of keys in the index. | | |
| | `podCacheSize` | `int` | `10` | Maximum number of pod entries per key. | | |
| | Parameter | Type | Default | Description | | |
| | :------------- | :---- | :--------- | :------------------------------------- | | |
| | `size` | `int` | `100000000`| Maximum number of keys in the index. | | |
| | `podCacheSize` | `int` | `10` | Maximum number of pod entries per key. | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The frontmatter doesn’t follow the contributor guidance in
website/AGENTS.mdthat says to wraptitlein single quotes. Either update this page’s frontmatter to match the rule or adjust the rule to reflect the actual convention used across the docs.