Models Web vlm

Режим: Web. Категория: VLM / image-to-text.

Всего моделей: 7.

SmolVLM 256M Instruct

Поле	Значение
Название	`HuggingFaceTB/SmolVLM-256M-Instruct`
Label	SmolVLM 256M Instruct
Категория	VLM / image-to-text
Runtime/backend	transformers
Вес	0.24 GB catalog size, ~243 MB disk/cache
Параметры	0.256B
Минимальные требования	tiny - очень слабое железо, CPU/WASM или небольшой GPU; disk >= 243 MB; memory/VRAM около 291 MB; браузер CPU/WASM; WebGPU полезен, если доступен
Оптимальные требования	modern CPU plus WebGPU/NPU when available; желательно 803 MB+ свободной VRAM/RAM и 755 MB+ disk cache
HF link	HuggingFaceTB/SmolVLM-256M-Instruct
Доступные quantizations	`auto`
Краткое описание	ONNX/WebGPU artifact marker
Best in	captioning, VQA, image-to-text

FastVLM 0.5B

Поле	Значение
Название	`onnx-community/FastVLM-0.5B-ONNX`
Label	FastVLM 0.5B
Категория	VLM / image-to-text
Runtime/backend	transformers
Вес	0.46 GB catalog size, ~475 MB disk/cache
Параметры	0.5B
Минимальные требования	small - ноутбук или небольшой GPU; disk >= 475 MB; memory/VRAM около 570 MB; браузер CPU/WASM; WebGPU полезен, если доступен
Оптимальные требования	modern CPU plus WebGPU/NPU when available; желательно 1082 MB+ свободной VRAM/RAM и 987 MB+ disk cache
HF link	onnx-community/FastVLM-0.5B-ONNX
Доступные quantizations	`auto`
Краткое описание	known browser-ready provider
Best in	captioning, VQA, image-to-text

Florence 2 base

Поле	Значение
Название	`onnx-community/Florence-2-base`
Label	Florence 2 base
Категория	VLM / image-to-text
Runtime/backend	transformers
Вес	0.63 GB catalog size, ~650 MB disk/cache
Параметры	не указаны
Минимальные требования	small - ноутбук или небольшой GPU; disk >= 650 MB; memory/VRAM около 780 MB; браузер CPU/WASM; WebGPU полезен, если доступен
Оптимальные требования	modern CPU plus WebGPU/NPU when available; желательно 1292 MB+ свободной VRAM/RAM и 1162 MB+ disk cache
HF link	onnx-community/Florence-2-base
Доступные quantizations	`auto`
Краткое описание	known browser-ready provider
Best in	captioning, VQA, image-to-text

Florence 2 large

Поле	Значение
Название	`onnx-community/Florence-2-large`
Label	Florence 2 large
Категория	VLM / image-to-text
Runtime/backend	transformers
Вес	0.63 GB catalog size, ~650 MB disk/cache
Параметры	не указаны
Минимальные требования	small - ноутбук или небольшой GPU; disk >= 650 MB; memory/VRAM около 780 MB; браузер CPU/WASM; WebGPU полезен, если доступен
Оптимальные требования	modern CPU plus WebGPU/NPU when available; желательно 1292 MB+ свободной VRAM/RAM и 1162 MB+ disk cache
HF link	onnx-community/Florence-2-large
Доступные quantizations	`auto`
Краткое описание	known browser-ready provider
Best in	captioning, VQA, image-to-text

Florence 2 large ft

Поле	Значение
Название	`onnx-community/Florence-2-large-ft`
Label	Florence 2 large ft
Категория	VLM / image-to-text
Runtime/backend	transformers
Вес	0.63 GB catalog size, ~650 MB disk/cache
Параметры	не указаны
Минимальные требования	small - ноутбук или небольшой GPU; disk >= 650 MB; memory/VRAM около 780 MB; браузер CPU/WASM; WebGPU полезен, если доступен
Оптимальные требования	modern CPU plus WebGPU/NPU when available; желательно 1292 MB+ свободной VRAM/RAM и 1162 MB+ disk cache
HF link	onnx-community/Florence-2-large-ft
Доступные quantizations	`auto`
Краткое описание	known browser-ready provider
Best in	captioning, VQA, image-to-text

ViT GPT-2 image captioning

Поле	Значение
Название	`Xenova/vit-gpt2-image-captioning`
Label	ViT GPT-2 image captioning
Категория	VLM / image-to-text
Runtime/backend	transformers
Вес	0.45 GB catalog size, ~460 MB disk/cache
Параметры	не указаны
Минимальные требования	small - ноутбук или небольшой GPU; disk >= 460 MB; memory/VRAM около 950 MB; браузер CPU/WASM; WebGPU полезен, если доступен
Оптимальные требования	modern CPU plus WebGPU/NPU when available; желательно 1462 MB+ свободной VRAM/RAM и 972 MB+ disk cache
HF link	Xenova/vit-gpt2-image-captioning
Доступные quantizations	`q8`
Краткое описание	Catalog entry for VLM / image-to-text.
Best in	captioning, VQA, image-to-text

Florence 2 base finetuned

Поле	Значение
Название	`onnx-community/Florence-2-base-ft`
Label	Florence 2 base finetuned
Категория	VLM / image-to-text
Runtime/backend	transformers
Вес	0.76 GB catalog size, ~780 MB disk/cache
Параметры	не указаны
Минимальные требования	medium - рабочая станция или заметный GPU; disk >= 780 MB; memory/VRAM около 1500 MB; браузер CPU/WASM; WebGPU полезен, если доступен
Оптимальные требования	modern CPU plus WebGPU/NPU when available; желательно 2025 MB+ свободной VRAM/RAM и 1292 MB+ disk cache
HF link	onnx-community/Florence-2-base-ft
Доступные quantizations	`q8`
Краткое описание	Catalog entry for VLM / image-to-text.
Best in	captioning, VQA, image-to-text

Models Web vlm

Models Web vlm

SmolVLM 256M Instruct

FastVLM 0.5B

Florence 2 base

Florence 2 large

Florence 2 large ft

ViT GPT-2 image captioning

Florence 2 base finetuned

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!