Skip to content

DX-596-together-dedicated-endpoints: sync with mintlify-docs#936#32

Merged
zainhas merged 1 commit into
mainfrom
docs-sync/together-dedicated-endpoints/mintlify-docs-pr-936
Jun 9, 2026
Merged

DX-596-together-dedicated-endpoints: sync with mintlify-docs#936#32
zainhas merged 1 commit into
mainfrom
docs-sync/together-dedicated-endpoints/mintlify-docs-pr-936

Conversation

@zainhas

@zainhas zainhas commented Jun 8, 2026

Copy link
Copy Markdown
Collaborator

Syncs the together-dedicated-endpoints skill with the merged docs PR togethercomputer/mintlify-docs#936 (DX-642: Auto-sync dedicated endpoint hardware pricing).

Triggering docs files

  • docs/dedicated-endpoints/overview.mdx — auto-generated hardware pricing table and new "Scaling out" / "List hardware options" sections.

Skill changes

  • Replace the GPU Types table with the currently-offered families (H100 SXM, H200 SXM, B200 SXM) and call out A100 / L40 / L40S / RTX 6000 as deprecated for new dedicated endpoints.
  • Drop A100 rows from the Common Configurations table and add H200 (1x / 4x / 8x) and B200 (1x / 8x) rows with current hardware IDs.
  • Add a "Single-GPU on-demand rates" sub-section under Pricing Model that mirrors the upstream auto-generated table ($6.49, $7.89, $11.95/hour) plus the multi-GPU hardware-ID prefix rule and the tg endpoints hardware --model <MODEL_ID> discovery command.
  • Refresh the cents_per_minute example in both the hardware response object (in hardware-options.md) and the API reference's List Hardware example to a current H100 value.
  • Update the GPU Selection Guide to drop A100/L40 references and add H200/B200 guidance.

No SKILL.md, script, or agents/openai.yaml changes were needed.


Generated by the Sync Skills Cursor Automation. Please review before merging.

…ify-docs#936

Updates the dedicated-endpoints hardware reference to match the new
upstream pricing table generated from /v1/hardware:

- GPU Types and Common Configurations now list only H100/H200/B200
  (the currently-offered families); A100, L40, L40S, and RTX 6000 are
  called out as deprecated for new endpoints.
- Adds a single-GPU on-demand rate table (H100 $6.49/hr,
  H200 $7.89/hr, B200 $11.95/hr) and notes the multi-GPU hardware-ID
  prefix convention.
- Refreshes the cents_per_minute example in the hardware response
  object to reflect the current H100 rate.
- Updates the GPU Selection Guide to drop A100/L40 and add H200/B200
  guidance.

Generated by the Sync Skills Cursor Automation.
@zainhas zainhas requested a review from muhsinking June 8, 2026 16:58
@zainhas zainhas merged commit cc58d2b into main Jun 9, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants