ORYX

Awesome Reasoning LLM Tutorial/Survey/Guide

Video-ChatGPT Public

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1.4k 111

groundingLMM Public

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 867 47

LLaVA-pp Public

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 836 61

MobiLlama Public

MobiLlama : Small Language Model tailored for edge devices

Python 635 48

GeoChat Public

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

Python 555 48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Popular repositories Loading

Repositories

People

Top languages

Most used topics