Enable conformance tests for Inference Extension #4079

salonichf5 · 2025-10-15T19:39:19Z

Proposed changes

Write a clear and concise description that helps reviewers understand the purpose and impact of your changes. Use the
following format:

Problem: Users want to enable conformance tests to ensure correct checks are in place

Solution: Enables conformance test

Testing: Manually being tested still.

Please focus on (optional): If you any specific areas where you would like reviewers to focus their attention or provide
specific feedback, add them here.

Closes #3842

Checklist

Before creating a PR, run through this checklist and mark each as complete.

I have read the CONTRIBUTING doc
I have added tests that prove my fix is effective or that my feature works
I have checked that all unit tests pass after adding my changes
I have updated necessary documentation
I have rebased my branch onto main
I will ensure my PR is targeting the main branch and pulling from my branch from my own fork

Release notes

If this PR introduces a change that affects users and needs to be mentioned in the release notes,
please add a brief note that summarizes the change.

Enable conformance tests for Inference Extension

Problem: To support the full Gateway API Inference Extension, we need to be able to extract the model name from the client request body in certain situations. Solution: Add a basic NJS module to extract the model name. This module will be enhanced (I've added notes) to be included in the full solution. On its own, it is not yet used.

This commit adds support for the control plane to watch InferencePools. A feature flag has been added to enable/disable processing these resources. By default, it is disabled. When an HTTPRoute references an InferencePool, we will create a headless Service associated with that InferencePool, and reference it internally in the graph config for that Route. This allows us to use all of our existing logic to get the endpoints and build the proper nginx config for those endpoints. In a future commit, the nginx config will be updated to handle the proper load balancing for the AI workloads, but for now we just use our default methods by proxy_passing to the upstream.

Problem: In order for NGINX to get the endpoint of the AI workload from the EndpointPicker, it needs to send a gRPC request using the proper protobuf protocol. Solution: A simple Go server is injected as an additional container when the inference extension feature is enabled, that will listen for a request from our (upcoming) NJS module, and forward to the configured EPP to get a response in a header.

Problem: We need to connect NGINX to the Golang shim that talks to the EndpointPicker, and then pass client traffic to the proper inference workload. Solution: Write an NJS module that will query the local Go server to get the AI endpoint to route traffic to. Then redirect the original client request to an internal location that proxies the traffic to the chosen endpoint. The location building gets a bit complicated especially when using both HTTP matching conditions and inference workloads. It requires 2 layers of internal redirects. I added lots of comments to hopefully clear up how we build these locations to perform all the routing steps.

Update the inference extension design doc to specify different status that needs to be set on Inference Pools to understand its state

…4006) Update gateway inference extension proposal on inability to provide a secure TLS connection to EPP.

Add status to Inference Pools Problem: Users want to see the current status of their Inference pools Solution: Add status for inference pools

Proposed changes Problem: Want to collect number of referenced InferencePools in cluster. Solution: Collect the count of referenced InferencePools. Testing: Unit tests and manually verified collection via debug logs.

salonichf5 · 2025-10-15T20:04:19Z

Need to still address this comment and this one

internal/controller/nginx/conf/nginx.conf

.github/workflows/conformance.yml

examples/cafe-example/eppinvalidservice.yaml

ciarams87 · 2025-10-16T10:19:31Z

cmd/gateway/commands.go

+	cmd.Flags().BoolVar(
+		&endpointPickerEnableTLS,
+		endpointPickerEnableTLSFlag,
+		false,
+		"Enable TLS for gRPC connections to the EndpointPicker extension.",
+	)


We need to provide some mechanism to actually expose and then set these flags. As we deploy this as a sidecar in the NGINX pod, I think exposing these options in the NginxProxy makes most sense. This can then be propagated to the cli args in the pod spec in objects.go. We can do this in a separate PR, but at the very least for now we need to default this flag to true for the conformance tests

Yeah currently the flags disable TLS and enable InsecureSkipVerify. Yeah that seems like a better idea let me do that

so I have been trying to set the flags but its seeming a bit trickier than i initially thought.

As per our offline discussion I will
I will create a story to make these configurable in the future. But by default I will enable TLS and secure skip verify.

Also, I am squash these commits into one and make sue I co-author you so its easier and faster to rebase

internal/controller/nginx/config/servers_template.go

internal/controller/nginx/config/servers_test.go

tests/Makefile

sjberman and others added 9 commits October 15, 2025 11:20

Adds status information to describe the state of Inference Pools (#3970)

7fbfcf2

Update the inference extension design doc to specify different status that needs to be set on Inference Pools to understand its state

Update gateway inference extension proposal security considerations (#…

c8dbd30

…4006) Update gateway inference extension proposal on inability to provide a secure TLS connection to EPP.

Add status to Inference Pools (#4005)

4441d35

Add status to Inference Pools Problem: Users want to see the current status of their Inference pools Solution: Add status for inference pools

Collect InferencePoolCount in telemetry (#4008)

94b249f

Proposed changes Problem: Want to collect number of referenced InferencePools in cluster. Solution: Collect the count of referenced InferencePools. Testing: Unit tests and manually verified collection via debug logs.

rebase inference extension branch with main

52fb31b

github-project-automation bot added this to NGINX Gateway Fabric Oct 15, 2025

github-project-automation bot moved this to 🆕 New in NGINX Gateway Fabric Oct 15, 2025

github-actions bot added the documentation Improvements or additions to documentation label Oct 15, 2025

nginx-bot bot added the release-notes label Oct 15, 2025

salonichf5 force-pushed the tests/conformance-inference-extension branch from 4f307c1 to 0bf576c Compare October 15, 2025 20:03

salonichf5 mentioned this pull request Oct 15, 2025

Enable Inference Extension Conformance Tests #4072

Closed

6 tasks

salonichf5 changed the title ~~Do not review: Tests/conformance inference extension~~ Enable conformance tests for Inference Extension Oct 15, 2025

salonichf5 marked this pull request as ready for review October 15, 2025 20:06

salonichf5 requested a review from a team as a code owner October 15, 2025 20:06

salonichf5 force-pushed the tests/conformance-inference-extension branch from cb8e184 to 5a30751 Compare October 15, 2025 23:50

salonichf5 commented Oct 15, 2025

View reviewed changes

internal/controller/nginx/conf/nginx.conf Show resolved Hide resolved

salonichf5 commented Oct 16, 2025

View reviewed changes

.github/workflows/conformance.yml Show resolved Hide resolved

salonichf5 force-pushed the tests/conformance-inference-extension branch from 5a30751 to ddb3ae3 Compare October 16, 2025 01:05

shaun-nx mentioned this pull request Oct 16, 2025

Use relative module path #4085

Closed

6 tasks

ciarams87 requested changes Oct 16, 2025

View reviewed changes

github-project-automation bot moved this from 🆕 New to 🏗 In Progress in NGINX Gateway Fabric Oct 16, 2025

ciarams87 force-pushed the feat/inference-extension branch from 52fb31b to ba7bd6d Compare October 16, 2025 11:01

ciarams87 requested a review from a team as a code owner October 16, 2025 11:01

ciarams87 force-pushed the tests/conformance-inference-extension branch from ddb3ae3 to e13a1ec Compare October 16, 2025 11:10

salonichf5 force-pushed the tests/conformance-inference-extension branch from e13a1ec to 2ccea14 Compare October 16, 2025 16:13

github-actions bot added dependencies Pull requests that update a dependency file helm-chart Relates to helm chart labels Oct 16, 2025

update inference extension conformance tests

16cb93a

salonichf5 force-pushed the tests/conformance-inference-extension branch from 2ccea14 to 16cb93a Compare October 16, 2025 19:25

salonichf5 closed this Oct 16, 2025

github-project-automation bot moved this from 🏗 In Progress to ✅ Done in NGINX Gateway Fabric Oct 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable conformance tests for Inference Extension #4079

Enable conformance tests for Inference Extension #4079

Uh oh!

salonichf5 commented Oct 15, 2025 •

edited

Loading

Uh oh!

salonichf5 commented Oct 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ciarams87 Oct 16, 2025

Uh oh!

salonichf5 Oct 16, 2025

Uh oh!

salonichf5 Oct 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Enable conformance tests for Inference Extension #4079

Enable conformance tests for Inference Extension #4079

Uh oh!

Conversation

salonichf5 commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Checklist

Release notes

Uh oh!

salonichf5 commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ciarams87 Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

salonichf5 Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

salonichf5 Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

salonichf5 commented Oct 15, 2025 •

edited

Loading

salonichf5 commented Oct 15, 2025 •

edited

Loading