-
Notifications
You must be signed in to change notification settings - Fork 0
Gen3 Model Catalog
- Open Management → Model Catalog from the tenant sidebar (alongside Observability, Users, and Groups when those routes are visible to your role).
- Read the Default model by capability strip at the top to see which models are currently marked default for chat, embeddings, vision, speech-to-text, text-to-speech, and image generation.
- Use the shared filter toolbar (search with icon, provider/model-type/default/source/country dropdowns, match-count pill, Reset all) to narrow the catalog table.
- Click a column header to sort when you need a different table ordering.
- When you need to change what appears here, ask a Control Panel operator to update Models—this tenant page is browse-only.
Model Catalog is the read-only inventory of inference models published to your tenant. It helps you confirm which chat, embedding, vision, speech, and image models are available—and which ones are marked as deployment defaults—before you configure Agents or interpret model choices in chat and GT API.
For how to use a specific model in GT AI OS (for example Orpheus vocal directions in Read aloud), see Model Guides. For official vendor specifications and pricing, see Model Cards.
The tenant Model Catalog route (/model-catalog) lists inference models the deployment exposes to this tenant. The page does not add, remove, or edit models. The hero note states: Browse only. Catalog changes are made from the control panel.
All signed-in tenant roles (Tenant User, Tenant Manager, and Tenant Owner) can open Model Catalog when it appears under Management. Unlike Users, Model Catalog is not role-gated off the management menu.

| Item | Value |
|---|---|
| Route | /model-catalog |
| Sidebar label | Model Catalog |
| Nav section | Management → Model Catalog (with Observability, Users, Groups) |
| Page eyebrow | Inference |
| Page title | Model catalog |
Management nav order in the sidebar: Observability (when your role can view it), Users (owners and managers only), Groups, then Model Catalog.
When models load successfully, the header shows summary chips:
- Total models — unique models in the catalog
- Providers — distinct provider count
- Rows with a default — models that carry at least one default badge
- Showing X of Y — appears while filters are active
Below the summary, a compact strip lists one row per catalog capability intent:
| Capability | Internal intent |
|---|---|
| Chat | chatText |
| Embeddings | embedText |
| Vision | visionDescribe |
| Speech-to-text | transcribeAudio |
| Text-to-speech | synthesizeSpeech |
| Image generation | generateImage |
Each cell shows the default model name(s) for that capability, formatted as [Provider] Model name when provider metadata exists. When no default is configured for that slice, the cell reads No default set.
Default badges follow this precedence:
-
Control Panel deployment defaults — when the operator configured a default model id for that capability (
/deployment-default-models), matching catalog rows receive a default badge for that intent. -
Resource-cluster slice default — when no Control Panel default id is set for an intent, a model marked
isDefaultin the per-intent inference list can receive the badge instead.
The table lists one row per unique model (deduplicated across intents). Columns:
| Column | Content |
|---|---|
| Model | Display name (with provider prefix when available) and default badges such as Default · Chat
|
| Provider | Provider name or id |
| Model type | Intent chips (Chat, Embeddings, Vision, and so on); chips highlight when that intent is a default for the row |
| Model source | Published source metadata when present |
| Country of origin | Country metadata when present |
| Capabilities | Summarized input/output modalities plus flags such as streaming, tools, embeddings, STT, and TTS |
Empty metadata cells show an em dash (—).
Model Catalog uses the same gt2-agent-filter-toolbar pattern as Agents, Datasets, Users, and Conversations: one card-style toolbar row with consistent control sizing, adorned search, facet dropdowns, an active-filter count pill, and Reset all.
The search field matches (case-insensitive) against model name, model id, provider, model type labels, default badge text, capability summary, source, and country.
| Filter | Options |
|---|---|
| Provider | All providers, or one provider from the loaded catalog |
| Model type | All model types, or one intent (Chat, Embeddings, Vision, Speech-to-text, Text-to-speech, Image generation) |
| Default status | All rows, Defaults only, or No default badges |
| Model source | All sources, or one source from the loaded catalog |
| Country | All countries, or one country from the loaded catalog |
When any filter or search is active, Reset all clears every filter and the search box. A meta pill shows how many models match (N models match or N models listed).
Click a column header to sort by Model, Provider, Model type, Model source, Country of origin, or Capabilities. Click the same header again to reverse direction. The active column shows an ascending or descending indicator.
Models marked default for one or more intents show chips under the model name, for example Default · Chat or Default · Embeddings.
In the Model type column, intent chips use a highlighted style when that intent is a default for the row. Column meta text reads Has default badge or No default badge.
-
Loading:
Loading model catalog… -
No models published:
No inference models are available for this tenant yet.with guidance that models appear after the operator publishes them. -
Filters match nothing:
No models match this filter.with a hint to try another search or dropdown. -
Load failure: an error alert with the API message (for example
Failed to load models.).
| Surface | Role | What you do |
|---|---|---|
| Model Catalog (tenant app) | All signed-in tenant roles | Browse models, defaults, providers, and capabilities |
| Models (Control Panel) | Operators | Configure inference providers, register models, and set deployment-wide defaults |
| Agents (tenant app) | All signed-in tenant roles | Pick models inside agent configuration from models available to the tenant—not from this catalog editor |
Operators work on Models across three tabs—Inference Providers, Configured Models, and Default Models. Tenant users see the result in Model Catalog and in agent model pickers.
When an agent or chat workflow uses a model, it draws from the tenant inference registry. Model Catalog helps you verify what the registry contains and which capabilities are marked default before you assign models in Agents.
Open Model Catalog and read the Chat cell in the default strip, or filter Default status to Defaults only and look for Default · Chat badges.
Set Provider to that provider and review the table. Use Model type if you only need chat or embedding models.
Sort or scan the Capabilities column. Summaries describe input and output modalities (text, image, audio) and note extras such as streaming, tool use, embeddings, STT, or TTS when the model supports them.
- Treat Model Catalog as reference inventory, not a configuration page.
- Check the default strip when troubleshooting unexpected model behavior in chat or agents.
- Route catalog changes (new providers, imports, default changes) to Control Panel operators via Models.
- Use filters instead of scrolling when the deployment publishes many models.