feat: add CUJ2 inference demo chat UI and update CUJ2 instructions by yuanchen8911 · Pull Request #151 · NVIDIA/aicr

yuanchen8911 · 2026-02-19T04:46:34Z

Summary

Add a browser-based chat UI for testing Dynamo inference deployments and update CUJ2
documentation with chat UI instructions.

Motivation / Context

Provides a simple way to interactively test Dynamo vLLM inference deployments via a
browser chat interface, in addition to the existing curl-based approach.

Fixes: N/A
Related: N/A

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update
Refactoring (no functional changes)
Build/CI/tooling

Component(s) Affected

CLI (cmd/eidos, pkg/cli)
API server (cmd/eidosd, pkg/api, pkg/server)
Recipe engine / data (pkg/recipe)
Bundlers (pkg/bundler, pkg/component/*)
Collectors / snapshotter (pkg/collector, pkg/snapshotter)
Validator (pkg/validator)
Core libraries (pkg/errors, pkg/k8s)
Docs/examples (docs/, examples/)
Other: ____________

Implementation Notes

examples/demos/workloads/inference/chat.html — Browser chat UI targeting
Dynamo's OpenAI-compatible /v1/chat/completions endpoint
examples/demos/workloads/inference/chat-server.sh — Single script that starts
kubectl port-forward and a local proxy server, then serves the chat UI on port 9090
examples/demos/cuj2.md — Updated "Test the endpoint" section with chat UI as
Option 1 and curl as Option 2

Testing

make lint

Tested manually against Dynamo vLLM deployment on H100 EKS cluster.

Risk Assessment

Low — Isolated change, well-tested, easy to revert
Medium — Touches multiple components or has broader impact
High — Breaking change, affects critical paths, or complex rollout

Rollout notes: N/A

Checklist

Tests pass locally (make test with -race)
Linter passes (make lint)
I did not skip/disable tests to make CI green
I added/updated tests for new functionality
I updated docs if user-facing behavior changed
Changes follow existing patterns in the codebase
Commits are cryptographically signed (git commit -S) — GPG signing info

Signed-off-by: Yuan Chen <yuanchen97@gmail.com>

mchmarny

/lgtm

feat: add chat UI and update inference demo instructions

15872ed

Signed-off-by: Yuan Chen <yuanchen97@gmail.com>

yuanchen8911 requested a review from a team as a code owner February 19, 2026 04:46

github-actions bot added the size/L label Feb 19, 2026

yuanchen8911 requested review from dims and mchmarny February 19, 2026 04:48

yuanchen8911 changed the title ~~feat: add inference demo chat UI and update CUJ2 instructions~~ feat: add CUJ2 inference demo chat UI and update CUJ2 instructions Feb 19, 2026

yuanchen8911 added enhancement New feature or request area/docs size/S area/tests area/validator and removed size/S enhancement New feature or request labels Feb 19, 2026

mchmarny approved these changes Feb 19, 2026

View reviewed changes

mchmarny merged commit 9a96d23 into NVIDIA:main Feb 19, 2026
13 checks passed

mchmarny deleted the feat/inference-demo-chat-ui branch February 19, 2026 12:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add CUJ2 inference demo chat UI and update CUJ2 instructions#151

feat: add CUJ2 inference demo chat UI and update CUJ2 instructions#151
mchmarny merged 1 commit intoNVIDIA:mainfrom
yuanchen8911:feat/inference-demo-chat-ui

yuanchen8911 commented Feb 19, 2026

Uh oh!

mchmarny left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yuanchen8911 commented Feb 19, 2026

Summary

Motivation / Context

Type of Change

Component(s) Affected

Implementation Notes

Testing

Risk Assessment

Checklist

Uh oh!

mchmarny left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants