llava

Here are 101 public repositories matching this topic...

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

chatbot llama multimodal multi-modality gpt-4 foundation-models visual-language-learning chatgpt instruction-tuning vision-language-model llava llama2 llama-2

Updated Aug 12, 2024
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

cuda inference pytorch transformer moe llama vlm llm llm-serving llava llama2 llama3 llama3-1

Updated Nov 3, 2024
Python

Fanghua-Yu / SUPIR

Star

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

deep-learning pytorch super-resolution restoration diffusion-models pytorch-lightning stable-diffusion llava sdxl

Updated Jul 30, 2024
Python

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

Updated Nov 2, 2024
Python

InternLM / xtuner

Star

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent chatbot conversational-ai peft baichuan msagent large-language-models llm supervised-finetuning llava llm-training chatglm2 internlm llama2 qwen chatglm3 mixtral llama3 phi3

Updated Oct 22, 2024
Python

modelscope / data-juicer

Star

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Updated Nov 1, 2024
Python

open-compass / VLMEvalKit

Star

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

computer-vision evaluation pytorch gemini openai vqa vit gpt multi-modal clip claude openai-api gpt4 large-language-models llm chatgpt llava qwen gpt-4v

Updated Nov 3, 2024
Python

mbzuai-oryx / Video-ChatGPT

Star

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

chatbot llama clip mulit-modal vision-language vicuna gpt-4 vision-language-pretraining llava video-chatboat video-conversation

Updated Aug 27, 2024
Python

unum-cloud / uform

Star

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Updated Oct 1, 2024
Python

mbzuai-oryx / LLaVA-pp

Star

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

conversation lmms vision-language llm llava llama3 phi3 llava-llama3 llava-phi3 llama3-llava phi3-llava llama-3-vision phi3-vision llama-3-llava phi-3-llava llama3-vision phi-3-vision

Updated Jul 10, 2024
Python

jhc13 / taggui

Star

Tag manager and captioner for image datasets

image-captioning image-tagging tag-manager pyside6 stable-diffusion llava cogvlm florence-2

Updated Nov 1, 2024
Python

PsyChip / machina

Star

OpenCV+YOLO+LLAVA powered video surveillance system

python opencv camera rtsp yolo llava ollama-api

Updated Oct 22, 2024
Python

TinyLLaVA / TinyLLaVA_Factory

Star

A Framework of Small-scale Large Multimodal Models

nlp transformers llama vision-language llava large-multimodal-models tinyllama

Updated Oct 16, 2024
Python

SkalskiP / awesome-foundation-and-multimodal-models

Sponsor

Star

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

nlp computer-vision image-captioning clip blip multimodal zero-shot-detection foundational-models llava segment-anything open-vocabulary-detection open-vocabulary-segmentation grounding-dino

Updated Feb 29, 2024
Python

NVlabs / EAGLE

Star

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

demo eagle llama lmm nvdia huggingface gpt4 large-language-models llm mllm llava lvlm llama3

Updated Sep 19, 2024
Python

Blaizzy / mlx-vlm

Sponsor

Star

MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.

mlx vision-framework apple-silicon vision-transformer llm vision-language-model llava local-ai idefics paligemma

Updated Nov 3, 2024
Python

gokayfem / ComfyUI_VLM_nodes

Star

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

image-captioning nodes vlm custom-nodes img2text llm mllm llava comfyui siglip phi15 joytag img2sfx

Updated Nov 3, 2024
Python

nrl-ai / llama-assistant

Star

AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more.

personal-assistant llama owen llava moondream private-gpt llama3 llama-3-2

Updated Oct 11, 2024
Python

apocas / restai

Sponsor

Star

RESTai is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex & Langchain. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama/vLLM/etc. Precise embeddings usage and tuning. Image generation (Dall-E, SD, Flux).

python transformers embeddings openai llama rag fastapi llm stable-diffusion langchain openaiapi llava llamaindex ollama

Updated Nov 2, 2024
Python

jakobdylanc / llmcord

Sponsor

Star

A Discord LLM chat bot that supports any OpenAI compatible API (OpenAI, xAI, Mistral, Groq, OpenRouter, ollama, LM Studio and more)

bot discord chatbot openai llama gpt grok mistral xai groq gpt-4 llm chatgpt llava oobabooga ollama lmstudio llama3 gpt-4o

Updated Nov 3, 2024
Python

Improve this page

Add a description, image, and links to the llava topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llava topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llava

Here are 101 public repositories matching this topic...

haotian-liu / LLaVA

sgl-project / sglang

Fanghua-Yu / SUPIR

modelscope / ms-swift

InternLM / xtuner

modelscope / data-juicer

open-compass / VLMEvalKit

mbzuai-oryx / Video-ChatGPT

unum-cloud / uform

mbzuai-oryx / LLaVA-pp

jhc13 / taggui

PsyChip / machina

TinyLLaVA / TinyLLaVA_Factory

SkalskiP / awesome-foundation-and-multimodal-models

NVlabs / EAGLE

Blaizzy / mlx-vlm

gokayfem / ComfyUI_VLM_nodes

nrl-ai / llama-assistant

apocas / restai

jakobdylanc / llmcord

Improve this page

Add this topic to your repo