-
Notifications
You must be signed in to change notification settings - Fork 0
Python Model Syntax
Mike edited this page May 28, 2026
·
1 revision
info = xlocllm.model("Qwen-3.5-0.8b", unit="LLM", mode="native")
print(info.model_id)
print(info.to_dict())| Фильтр | Описание |
|---|---|
unit |
тип unit |
mode |
native или web
|
runtime |
native, mlc, transformers
|
task |
backend task |
task_group |
категория task |
hardware_tier |
tiny/small/medium/large |
language |
язык из languages
|
provider |
provider из каталога |
availability |
verified, candidate, unsupported, ... |
npu |
фильтр npuEligible
|
webgpu=False |
только CPU/WASM fallback candidates |
cpu=True |
alias для CPU/WASM fallback |
available_without_webgpu=True |
явный fallback filter |
installed |
фильтр installed artifacts, если есть bridge/cache status |
hardware |
provider/hardware snapshot для native filtering |
include_unavailable |
включить unsupported/unavailable entries |
quant |
выбор GGUF quantization metadata |
subtype |
подтип: reasoning, cross-encoder, caption-vqa, ... |
modality |
text, image, audio, ... |
use_case |
rag, chat, ocr, search-ranking, ... |
license |
точный license |
min_context |
минимальная длина контекста |
search |
поиск по id/label/provider/unit/tags/aliases |
max_vram_mb |
ограничение VRAM |
max_disk_mb |
ограничение disk/cache |
max_size_gb |
ограничение size GB |
max_parameters_b |
ограничение параметров B |
limit_per_unit |
лимит записей на unit type |
small_llms = xlocllm.models(unit="LLM", max_vram_mb=1500)
native_llms = xlocllm.models(unit="LLM", mode="native")
cpu_web = xlocllm.models(mode="web", webgpu=False)
rag_embeddings = xlocllm.models(unit="embedding", mode="native", use_case="rag")
vlms = xlocllm.models(unit="vlm", mode="native", modality="image")
long_context = xlocllm.models(unit="LLM", min_context=32768)- xlocllm
- Quickstart
- About
- Functions Python
- Functions TypeScript
- Use cases
- Examples Python
- Examples TypeScript
- Shared GPU mode
-
Models catalog
- Models The best
- Models Full model list
- Models Use your model
- For native mode
- Models Native LLM tiny small
- Models Native LLM medium
- Models Native LLM large
- Models Native embedding
- Models Native reranker
- Models Native translator
- Models Native tts
- Models Native vlm
- Models Native asr
- Models Native ocr
- Models Native image-classification
- Models Native object-detection
- Models Native image-segmentation
- Models Native depth-estimation
- Models Native document-layout
- Models Native table-detection
- Models Native document-qa
- Models Native language-id
- Models Native audio-classification
- Models Native text-classification
- Models Native ner
- Models Native zero-shot-text
- Models Native summarization
- Models Native text2text
- Models Native code
- For webgpu mode
- For web mode
- Models Web LLM
- Models Web embedding
- Models Web reranker
- Models Web translator
- Models Web tts
- Models Web vlm
- Models Web asr
- Models Web ocr
- Models Web image-classification
- Models Web object-detection
- Models Web image-segmentation
- Models Web depth-estimation
- Models Web document-layout
- Models Web table-detection
- Models Web document-qa
- Models Web zero-shot-image
- Models Web language-id
- Models Web audio-classification
- Models Web text-classification
- Models Web ner
- Models Web zero-shot-text
- Models Web summarization
- Models Web text2text
- Models Web code
- Dev