diff --git a/docs/proposals/gateway-inference-extension.md b/docs/proposals/gateway-inference-extension.md index fb6abe3ae2..bf17bf51d4 100644 --- a/docs/proposals/gateway-inference-extension.md +++ b/docs/proposals/gateway-inference-extension.md @@ -76,20 +76,18 @@ flowchart TD A[Client Request] --> B[NGINX] subgraph NGINX Pod subgraph NGINX Container - B --1--> C[NJS Module: extract model name if needed] - C --2--> B - B --3--> D[NJS Module: Subrequest to Go App] + B --1--> C[NJS Module: Subrequest to Go App] end subgraph Go Application Container - E[Go Application] + D[Go Application] end - D -- 4. subrequest --> E + C -- 2. subrequest --> D end - E -- 5. gRPC ext_proc protocol --> F[Endpoint Picker Pod] - F -- 6. Endpoint in Header --> E - E --7--> D - D --8--> B - B --9--> G[AI Workload Endpoint] + D -- 3. gRPC ext_proc protocol --> E[Endpoint Picker Pod] + E -- 4. Endpoint in Header --> D + D --5--> C + C --6--> B + B --7--> F[AI Workload Endpoint] ``` ## API, Customer Driven Interfaces, and User Experience