You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copilot CLI supports BYOM providers since v1.0.32+, but the supported providers are limited to Anthropic-specific configurations. There is no configuration path for generic local inference endpoints (e.g., LM Studio, Ollama, llama.cpp) that serve OpenAI-compatible APIs.
This means:
Local models cannot be registered as BYOM providers for main session tool calls
Subagent tasks dispatched via runSubagent have no local routing path
Extensions like VSCode-LMStudio-Bridge cannot expose local models to the Copilot ecosystem
How Other Agents Handle This
Opencode (anomalyco/opencode)
Provider config system with explicit provider registration
Supports llama.cpp, Ollama, and generic OpenAI-compatible endpoints
Subagents inherit session model; provider configuration flows through all dispatch paths
Claude Code (anthropics/claude-code)
Defaults to cloud but overridable via SG_AGENTIC_MODEL env var
Model selection is per-request, not session-bound
Allows explicit model override for subagent tasks
Codex (openai/codex)
ModelProvider abstraction with routing layer (models_endpoint.rs)
Supports multiple providers including Amazon Bedrock and local endpoints
Clean separation between tool execution and model inference
What We Need
Generic BYOM provider registration: Allow configuration of arbitrary OpenAI-compatible endpoints as BYOM providers
Subagent model inheritance: Ensure subagent tasks respect the session model when it is a local endpoint
Provider priority/fallback: Support tiered routing (local first, cloud fallback with warning)
Security Implications
Data leakage: Workspace context transmitted to cloud without user consent when local models are available
Cost opacity: Untracked cloud API usage from subagent tasks
Problem Statement
Copilot CLI supports BYOM providers since v1.0.32+, but the supported providers are limited to Anthropic-specific configurations. There is no configuration path for generic local inference endpoints (e.g., LM Studio, Ollama, llama.cpp) that serve OpenAI-compatible APIs.
This means:
runSubagenthave no local routing pathHow Other Agents Handle This
Opencode (anomalyco/opencode)
Claude Code (anthropics/claude-code)
SG_AGENTIC_MODELenv varCodex (openai/codex)
ModelProviderabstraction with routing layer (models_endpoint.rs)What We Need
Security Implications
Related Issues
modeloverride both ignored) #3565 — multiplier guard silently downgrades subagent modelopencode web) #3301 — local model feature requestAcceptance Criteria