multimodel-large-language-model

Here are 11 public repositories matching this topic...

dvlab-research / Seg-Zero

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

reinforcement-learning segmentation multimodal multimodel-large-language-model reasoning-language-models

Updated Mar 12, 2025
Python

🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features WebSocket streaming, voice interactions, and OpenAI API compatibility. Built with FastAPI, Redis, and PostgreSQL. Perfect for private AI conversations and custom voice assistants.

agent docker docker-compose openai llama lgm realtime-api fastapi llm ollama llama3 multimodel-large-language-model

Updated Nov 9, 2024
Jupyter Notebook

xinyanghuang7 / Basic-Visual-Language-Model

Star

Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖

visual-language-learning large-language-models visual-language-models multimodel-large-language-model

Updated Jun 19, 2024
Python

BIGBALLON / GME-Search

Star

A multimodal image search engine built on the GME model, capable of handling diverse input types. Whether you're querying with text, images, or both, provides powerful and flexible image retrieval under arbitrary inputs. Perfect for research and demos.

information-retrieval retrieval image-search image-retrieval universal-embedding composed-image-retrieval text-image-retrieval large-language-models multimodel-large-language-model

Updated Jan 1, 2025
Python

sun-hailong / TVC

Star

🎉 The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorch.

reasoning r1 cot forgetting mllms multimodel-large-language-model

Updated Mar 18, 2025
Python

charanhu / Assets_Youtube_Videos

Star

This repository showcases a collection of innovative projects by Charan H U, focusing on cutting-edge technologies such as facial emotion recognition, fitness tracking, and multi-model applications. Each project demonstrates practical implementations of advanced AI/ML techniques, making it a valuable resource for developers and researchers.

opencv machine-learning deep-learning neural-network agents rag llm generative-ai retrieval-augmented-generation multimodel-large-language-model