LCORE-331: Azure inference supported #654

asimurka · 2025-10-10T12:24:20Z

Description

Azure inference supported

Type of change

Related Tickets & Documents

Related Issue # LCORE-331
Closes #

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

Please provide detailed steps to perform tests related to this code change.
How were the fix/results from this change verified? Please provide relevant screenshots or results.

Summary by CodeRabbit

New Features
- Added Azure provider support across the stack with an example Azure run configuration and model entries.
Documentation
- Updated compatibility table and provider docs to list new Azure models and provider details.
Tests
- Added end-to-end Azure test configuration for E2E runs.
Chores
- CI extended to run E2E on Azure, including secure acquisition and injection of an Azure access token.
- Docker setup accepts Azure API key via environment.

coderabbitai · 2025-10-10T12:25:02Z

Walkthrough

Adds Azure as an explicit E2E/stack target: CI workflow gains an Azure matrix entry and token acquisition step, docker-compose forwards AZURE_API_KEY to services, README and provider docs updated for Azure, and two new Azure-focused stack configs are added (example and E2E). (50 words)

Changes

Cohort / File(s)	Summary
CI workflow: Azure matrix & token acquisition `.github/workflows/e2e_tests.yaml`	Adds `azure` to the workflow matrix, exposes Azure client secrets, and adds a conditional step performing OAuth2 client-credentials against Azure AD to extract `access_token` and export `AZURE_API_KEY`.
Runtime env wiring for Azure key `docker-compose.yaml`	Adds `AZURE_API_KEY=${AZURE_API_KEY}` to the environment of `llama-stack` and `lightspeed-stack` services.
Documentation updates for Azure support `README.md`, `docs/providers.md`	Adds Azure rows to LLM compatibility table in `README.md`; updates `docs/providers.md` to mark Azure supported and adjusts pip-deps/display and documentation link.
Azure example stack config `examples/azure-run.yaml`	New example YAML defining a full Azure-backed stack: providers, stores, telemetry, models (Azure provider), and other stack settings.
E2E Azure config `tests/e2e/configs/run-azure.yaml`	New minimal E2E YAML for Azure: provider entries, sqlite stores, server settings, and model mapping to Azure provider.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor Dev as Developer
  participant GH as GitHub Actions
  participant AAD as Azure AD
  participant DC as Docker Compose
  participant Svc as Services (llama-stack / lightspeed-stack)
  participant T as E2E Tests

  Dev->>GH: Push/PR triggers E2E (matrix: ci, azure)
  alt environment == "azure"
    GH->>AAD: OAuth2 client_credentials (CLIENT_ID, TENANT_ID, CLIENT_SECRET)
    AAD-->>GH: JSON { access_token }
    GH->>GH: Extract & export AZURE_API_KEY
  else environment != "azure"
    GH->>GH: Skip Azure token step
  end

  GH->>DC: docker-compose up (env includes AZURE_API_KEY if set)
  DC->>Svc: Start containers with env vars
  Svc-->>T: Run E2E tests (inference may call Azure with AZURE_API_KEY)
  T-->>GH: Results (pass/fail)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

LCORE-306: setup e2e test env #306 — Also modifies E2E CI and docker-compose related to test environments and Azure token/updating workflows.

Suggested reviewers

radofuchs

Poem

I thump my paws on clouds of blue,
A token nibble, fresh and new—
Compose hums softly, stacks awake,
Azure whispers keys to take.
With models purring, tests hop through 🥕✨

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The title clearly summarizes the primary change of adding Azure inference support and directly reflects the main intent of the changeset without extraneous detail or vague phrasing.
Docstring Coverage	✅ Passed	No functions found in the changes. Docstring coverage check skipped.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

📜 Recent review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c26dffd and fcf2306.

📒 Files selected for processing (6)

.github/workflows/e2e_tests.yaml (2 hunks)
README.md (1 hunks)
docker-compose.yaml (2 hunks)
docs/providers.md (2 hunks)
examples/azure-run.yaml (1 hunks)
tests/e2e/configs/run-azure.yaml (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (3)

docker-compose.yaml
tests/e2e/configs/run-azure.yaml
examples/azure-run.yaml

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: build-pr
GitHub Check: e2e_tests (ci)

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (2)

docker-compose.yaml (1)
15-15: Pass through AZURE_API_TYPE for Azure auth mode parity

If you use AAD tokens in CI (Bearer) vs API keys (api-key), pass AZURE_API_TYPE through compose as well so llama-stack can switch modes.
 services:
   llama-stack:
     environment:
       - OPENAI_API_KEY=${OPENAI_API_KEY}
       - AZURE_API_KEY=${AZURE_API_KEY}
+      - AZURE_API_TYPE=${AZURE_API_TYPE:-}
 ...
   lightspeed-stack:
     environment:
       - OPENAI_API_KEY=${OPENAI_API_KEY}
       - AZURE_API_KEY=${AZURE_API_KEY}
+      - AZURE_API_TYPE=${AZURE_API_TYPE:-}
Also applies to: 40-40
examples/azure-run.yaml (1)
72-79: Fix env substitution and generalize api_base in example

Use supported env syntax and avoid hard‑coded API base in examples.
       config: 
         api_key: ${env.AZURE_API_KEY}
-        api_base: https://ols-test.openai.azure.com/
+        api_base: ${env.AZURE_API_BASE}
         api_version: 2024-02-15-preview
-        api_type: ${env.AZURE_API_TYPE:=}
+        api_type: ${env.AZURE_API_TYPE}

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3ee967a and c26dffd.

📒 Files selected for processing (6)

.github/workflows/e2e_tests.yaml (2 hunks)
README.md (1 hunks)
docker-compose.yaml (2 hunks)
docs/providers.md (2 hunks)
examples/azure-run.yaml (1 hunks)
tests/e2e/configs/run-azure.yaml (1 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: build-pr
GitHub Check: e2e_tests (ci)

🔇 Additional comments (5)

docs/providers.md (1)

39-39: Azure provider doc updates look good

Azure marked supported and link updated. Please verify no extra pip deps are needed for the azure remote in upstream llama-stack.

Also applies to: 290-290

README.md (1)

128-130: Azure rows added to compatibility table look good

Aligns with new example and tests.

tests/e2e/configs/run-azure.yaml (1)

125-128: Confirm Azure model/deployment mapping

Azure OpenAI often requires a deployment name rather than a raw model id. Confirm llama‑stack’s Azure provider expects provider_model_id=gpt-4o-mini or a deployment identifier; adjust if needed.

examples/azure-run.yaml (1)

125-128: Verify deployment vs model id

As above, confirm whether provider_model_id should be the Azure deployment name rather than the model string.
.github/workflows/e2e_tests.yaml (1)
79-104: Export AZURE_API_TYPE for AAD/Bearer auth
Add immediately after setting AZURE_API_KEY:
 echo "AZURE_API_KEY=$ACCESS_TOKEN" >> $GITHUB_ENV
+echo "AZURE_API_TYPE=azure_ad" >> $GITHUB_ENV
Verify that the Azure provider reads api_type from the environment (and accepts azure_ad); adjust the key or value if needed.

coderabbitai · 2025-10-10T12:34:59Z

.github/workflows/e2e_tests.yaml

+      CLIENT_SECRET: ${{ secrets.CLIENT_SECRET }}
+      CLIENT_ID: ${{ secrets.CLIENT_ID }}
+      TENANT_ID: ${{ secrets.TENAND_ID }}



⚠️ Potential issue | 🔴 Critical

Fix secret name and restrict secrets to the Azure step (avoid broad exposure)

Typo: TENAND_ID → TENANT_ID.

Don’t export CLIENT_ID/CLIENT_SECRET/TENANT_ID at job scope; they leak to all steps (incl. third‑party actions). Keep them only in the Azure token step where they’re used.

env: OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }} - CLIENT_SECRET: ${{ secrets.CLIENT_SECRET }} - CLIENT_ID: ${{ secrets.CLIENT_ID }} - TENANT_ID: ${{ secrets.TENAND_ID }}

They are already correctly provided again via the Azure token step’s env (keep that).

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

CLIENT_SECRET: ${{ secrets.CLIENT_SECRET }}

CLIENT_ID: ${{ secrets.CLIENT_ID }}

TENANT_ID: ${{ secrets.TENAND_ID }}

env:

OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}

🤖 Prompt for AI Agents

.github/workflows/e2e_tests.yaml around lines 14 to 17: the job-level env has a typo and exposes sensitive secrets broadly; change TENAND_ID to TENANT_ID and remove CLIENT_SECRET, CLIENT_ID and TENANT_ID from the job-level env block so they are not exported to all steps, leaving those secrets only in the specific Azure token step’s env where they are already provided.

coderabbitai · 2025-10-10T12:34:59Z

tests/e2e/configs/run-azure.yaml

+    - provider_id: azure
+      provider_type: remote::azure
+      config: 
+        api_key: ${env.AZURE_API_KEY}
+        api_base: https://ols-test.openai.azure.com/
+        api_version: 2024-02-15-preview
+        api_type: ${env.AZURE_API_TYPE:=}
+  post_training:


⚠️ Potential issue | 🔴 Critical

Invalid env default syntax; parameterize api_base

llama-stack env substitution supports ${env.VAR}; the default form ${env.VAR:=} is likely unsupported and may be passed as a literal.

config: api_key: ${env.AZURE_API_KEY} - api_base: https://ols-test.openai.azure.com/ + api_base: ${env.AZURE_API_BASE} api_version: 2024-02-15-preview - api_type: ${env.AZURE_API_TYPE:=} + api_type: ${env.AZURE_API_TYPE}

Set AZURE_API_BASE and (optionally) AZURE_API_TYPE in the environment (compose/workflow).

🤖 Prompt for AI Agents

In tests/e2e/configs/run-azure.yaml around lines 72 to 79, the file uses unsupported env default syntax (${env.AZURE_API_TYPE:=}) and hardcodes api_base; change api_base to use an environment variable (e.g. api_base: ${env.AZURE_API_BASE}) and replace the unsupported default form with a plain env substitution (api_type: ${env.AZURE_API_TYPE}) or remove the api_type line if optional, then ensure AZURE_API_BASE (and AZURE_API_TYPE if used) are set in the environment/compose or workflow.

radofuchs

LGTM

tisnik

LGTM

coderabbitai bot reviewed Oct 10, 2025

View reviewed changes

Azure inference supported

fcf2306

asimurka force-pushed the azure-inference-provider branch from c26dffd to fcf2306 Compare October 10, 2025 12:41

radofuchs approved these changes Oct 10, 2025

View reviewed changes

tisnik approved these changes Oct 10, 2025

View reviewed changes

tisnik merged commit 5ad22fb into lightspeed-core:main Oct 10, 2025
18 of 19 checks passed

coderabbitai bot mentioned this pull request Oct 24, 2025

LCORE-333: Lightspeed core needs to fully support RHEL AI LLM provider #714

Merged

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LCORE-331: Azure inference supported #654

LCORE-331: Azure inference supported #654

Uh oh!

asimurka commented Oct 10, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 10, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Oct 10, 2025

Uh oh!

coderabbitai bot Oct 10, 2025

Uh oh!

radofuchs left a comment

Uh oh!

tisnik left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

LCORE-331: Azure inference supported #654

LCORE-331: Azure inference supported #654

Uh oh!

Conversation

asimurka commented Oct 10, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Related Tickets & Documents

Checklist before requesting a review

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

radofuchs left a comment

Choose a reason for hiding this comment

Uh oh!

tisnik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

asimurka commented Oct 10, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 10, 2025 •

edited

Loading