multi-modal

Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)

open-source api-wrapper accelerate multi-modal pretraining large-language-models llm rlhf instruction-tuning

Updated Jun 7, 2024
Python

docarray / docarray

Star

Represent, send, store and search multimodal data

elasticsearch machine-learning deep-learning protobuf pytorch data-structures nearest-neighbor-search cross-modal multi-modal semantic-search multimodal nested-data weaviate dataclass pydantic fastapi neural-search qdrant docarray

Updated Jun 6, 2024
Python

vercel / modelfusion

Star

The TypeScript library for building AI applications.

Updated Jun 6, 2024
TypeScript

valhalla / valhalla

Star

Open Source Routing Engine for OpenStreetMap

directions openstreetmap routing astar traveling-salesman dijkstra routing-engine isochrones multi-modal tiled

Updated Jun 8, 2024
C++

colurw / temporal_CNN

Star

Time-series forecasting of market price data using a multi-modal Convolutional Neural Network

numpy pandas multi-modal time-series-forecasting tensorflow2

Updated Jun 6, 2024
Jupyter Notebook

modelscope / modelscope

Star

ModelScope: bring the notion of Model-as-a-Service to life.

python nlp science machine-learning deep-learning cv speech multi-modal

Updated Jun 6, 2024
Python

modelscope / data-juicer

Star

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据！

Updated Jun 6, 2024
Python

OpenGVLab / InternVL

Star

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

image-classification gpt multi-modal semantic-segmentation video-classification mme image-text-retrieval llm vision-language-model gpt-4v vit-6b vit-22b gpt-4o

Updated Jun 6, 2024
Python

zjysteven / VLM-Visualizer

Star

Visualizing the attention of vision-language models

attention multi-modal attention-mechanism vision-language vision-language-model llava

Updated Jun 6, 2024
Jupyter Notebook

open-compass / VLMEvalKit

Star

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks

computer-vision evaluation pytorch gemini openai vqa vit gpt multi-modal clip claude openai-api gpt4 large-language-models llm chatgpt llava qwen gpt-4v

Updated Jun 5, 2024
Python

howard-hou / VisualRWKV

Star

VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.

multi-modal large-language-models rwkv

Updated Jun 4, 2024
Python

Yuan-ManX / ai-multimodal-timeline

Star

Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥

ai multi-modal ai-agents deeplearning-ai multimodal multimodal-deep-learning llm

Updated Jun 4, 2024

IntelLabs / fastRAG

Star

Efficient Retrieval Augmentation and Generation Framework

nlp benchmark information-retrieval transformers knowledge-graph question-answering summarization multi-modal semantic-search diffusion sentence-transformers colbert llm generative-ai

Updated Jun 4, 2024
Python

Improve this page

Add a description, image, and links to the multi-modal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-modal topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-modal

Here are 272 public repositories matching this topic...

SciSharp / LLamaSharp

Lizhecheng02 / MultiModal

modelscope / agentscope

OpenBMB / MiniCPM-V

THUDM / CogVLM2

deep-symbolic-mathematics / Multimodal-Math-Pretraining

marqo-ai / marqo

patrick-tssn / LM-Research-Hub

docarray / docarray

vercel / modelfusion

valhalla / valhalla

colurw / temporal_CNN

modelscope / modelscope

modelscope / data-juicer

OpenGVLab / InternVL

zjysteven / VLM-Visualizer

open-compass / VLMEvalKit

howard-hou / VisualRWKV

Yuan-ManX / ai-multimodal-timeline

IntelLabs / fastRAG

Improve this page

Add this topic to your repo