streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
-
Updated
Oct 17, 2024 - Python
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
A collection of guides and examples for the Gemma open models from Google.
MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.
Use PaliGemma to auto-label data for use in training fine-tuned vision models.
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
vision language models finetuning notebooks & use cases (paligemma - florence .....)
PaliGemma FineTuning
PaliGemma Inference and Fine Tuning
Notes for the Vision Language Model implementation by Umar Jamil
Using PaliGemma with 🤗 transformers
Add a description, image, and links to the paligemma topic page so that developers can more easily learn about it.
To associate your repository with the paligemma topic, visit your repo's landing page and select "manage topics."