-
Notifications
You must be signed in to change notification settings - Fork 0
Python Subjects
В Python SDK есть четыре основные сущности.
ModelInfo - read-only metadata из каталога. Модель сама по себе ничего не запускает.
Она отвечает на вопросы: какой modelId, какой unit, какой backend, сколько disk/VRAM,
какие aliases, license, languages, use cases и quantizations.
Unit - выбранная capability/model пара. Примеры:
xlocllm.unit("LLM", "Qwen-3.5-0.8b")xlocllm.unit("embedding", "multilingual-e5-small")xlocllm.rag(emb=..., rerank=...)xlocllm.unit("model.onnx", type="regression", name="reg")
Unit хранит resolved model id, runtime mode, options, quantization и attached RAG unit, но обычно запускается через Runtime.
Runtime - набор units и controller lifecycle. Он выбирает bridge, порт, режим, ставит artifacts,
запускает модели, дает runtime.url, runtime.chat(), runtime.embed(), runtime.invoke().
Bridge / NativeBridge - локальный HTTP control plane на 127.0.0.1.
Он держит /v1 OpenAI-compatible API, /xlocllm/v1/* control endpoints, runtime status,
logs, model activation, cache deletion и invoke endpoints.
В обычном коде вы чаще работаете через Runtime: runtime сам создает нужный bridge.
Прямой bridge нужен, когда требуется manual control, registry lookup, health/status checks
или интеграция с уже поднятым local endpoint.
Типичный поток:
{code_block(''' model = xlocllm.model("Qwen-3.5-0.8b", unit="LLM") unit = xlocllm.unit(model) runtime = xlocllm.runtime([unit]) runtime.run() print(runtime.bridge.status()) ''')}
- xlocllm
- Quickstart
- About
- Functions Python
- Functions TypeScript
- Use cases
- Examples Python
- Examples TypeScript
- Shared GPU mode
-
Models catalog
- Models The best
- Models Full model list
- Models Use your model
- For native mode
- Models Native LLM tiny small
- Models Native LLM medium
- Models Native LLM large
- Models Native embedding
- Models Native reranker
- Models Native translator
- Models Native tts
- Models Native vlm
- Models Native asr
- Models Native ocr
- Models Native image-classification
- Models Native object-detection
- Models Native image-segmentation
- Models Native depth-estimation
- Models Native document-layout
- Models Native table-detection
- Models Native document-qa
- Models Native language-id
- Models Native audio-classification
- Models Native text-classification
- Models Native ner
- Models Native zero-shot-text
- Models Native summarization
- Models Native text2text
- Models Native code
- For webgpu mode
- For web mode
- Models Web LLM
- Models Web embedding
- Models Web reranker
- Models Web translator
- Models Web tts
- Models Web vlm
- Models Web asr
- Models Web ocr
- Models Web image-classification
- Models Web object-detection
- Models Web image-segmentation
- Models Web depth-estimation
- Models Web document-layout
- Models Web table-detection
- Models Web document-qa
- Models Web zero-shot-image
- Models Web language-id
- Models Web audio-classification
- Models Web text-classification
- Models Web ner
- Models Web zero-shot-text
- Models Web summarization
- Models Web text2text
- Models Web code
- Dev