[Klaud Cold] Update dsr1-fp8-b200-trt (+mtp) TRT-LLM image to v1.3.0rc14 by functionstackx · Pull Request #1488 · SemiAnalysisAI/InferenceX

functionstackx · 2026-05-18T06:16:16Z

Summary

Update TensorRT-LLM image (off: v1.2.0rc6.post2 109d / mtp: v1.2.0rc6.post3 102d) to v1.3.0rc14 (latest pre-release)

dsr1-fp8-b200-trt: nvcr.io#nvidia/tensorrt-llm/release:1.2.0rc6.post2 → nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc14
dsr1-fp8-b200-trt-mtp: nvcr.io#nvidia/tensorrt-llm/release:1.2.0rc6.post3 → nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc14

Test plan

full-sweep-enabled sweep passes.

🤖 Generated with Claude Code

Update TensorRT-LLM image (off: v1.2.0rc6.post2 109d / mtp: v1.2.0rc6.post3 102d) to v1.3.0rc14 (latest pre-release) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-actions · 2026-05-18T06:16:24Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-05-18T06:16:25Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-05-18T06:16:49Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26016871706
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26016871706

claude

LGTM, straightforward TRT-LLM image version bump with matching changelog entry.

Extended reasoning...

Overview

This PR bumps the TensorRT-LLM image for two configs (dsr1-fp8-b200-trt and dsr1-fp8-b200-trt-mtp) from v1.2.0rc6.post2/v1.2.0rc6.post3 to v1.3.0rc14 in .github/configs/nvidia-master.yaml, and adds a corresponding entry to perf-changelog.yaml.

Security risks

None. These are YAML config changes pointing to an official NVIDIA NGC image (nvcr.io#nvidia/tensorrt-llm/release), and there is no executable code being introduced.

Level of scrutiny

Low. This is a mechanical, auto-generated image version bump (Klaud Cold) of the kind that lands routinely in this repo — see recent commits like fc84ab7 (the previous TRT-LLM update for the same configs) and similar SGLang/vLLM image bumps. Correctness is enforced by the full-sweep-enabled label triggering a sweep job.

Other factors

The bug hunting system found no issues, the changelog entry correctly references PR #1488 with accurate before/after image tags, and the change pattern exactly mirrors prior approved image bumps in this repository.

github-actions · 2026-05-18T14:35:10Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26016874169
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26016874169

functionstackx · 2026-05-18T16:17:38Z

/reuse-sweep-run

github-actions · 2026-05-18T16:18:39Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26045823723
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26045823723

Update dsr1-fp8-b200-trt (+mtp) TRT-LLM image to v1.3.0rc14

fc84ab7

Update TensorRT-LLM image (off: v1.2.0rc6.post2 109d / mtp: v1.2.0rc6.post3 102d) to v1.3.0rc14 (latest pre-release) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

functionstackx requested a review from a team May 18, 2026 06:16

functionstackx added the full-sweep-enabled label May 18, 2026

functionstackx requested review from jgangani and kedarpotdar-nv as code owners May 18, 2026 06:16

github-project-automation Bot added this to InferenceMAX Board May 18, 2026

chore: fill pr-link for #1488

fff5373

claude Bot reviewed May 18, 2026

View reviewed changes

Merge branch 'main' into update-dsr1-fp8-b200-trt-v1.3.0rc14

5d1b736

functionstackx merged commit e43ae2f into main May 18, 2026
3 of 5 checks passed

functionstackx deleted the update-dsr1-fp8-b200-trt-v1.3.0rc14 branch May 18, 2026 16:17

github-project-automation Bot moved this to Done in InferenceMAX Board May 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Klaud Cold] Update dsr1-fp8-b200-trt (+mtp) TRT-LLM image to v1.3.0rc14#1488

[Klaud Cold] Update dsr1-fp8-b200-trt (+mtp) TRT-LLM image to v1.3.0rc14#1488
functionstackx merged 3 commits into
mainfrom
update-dsr1-fp8-b200-trt-v1.3.0rc14

functionstackx commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

claude Bot left a comment

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

functionstackx commented May 18, 2026

Uh oh!

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

functionstackx commented May 18, 2026

Summary

Test plan

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Overview

Security risks

Level of scrutiny

Other factors

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

functionstackx commented May 18, 2026

Uh oh!

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant