Stars
Open Source framework for voice and multimodal conversational AI
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
An open source framework for programming photonic quantum computers
A high-throughput and memory-efficient inference and serving engine for LLMs
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Sample code for Azure Communication Services Python quickstarts
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
OCR, layout analysis, reading order, table recognition in 90+ languages
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Enforce the output format (JSON Schema, Regex etc) of a language model
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
An open source implementation of CLIP.
This repository contains demos I made with the Transformers library by HuggingFace.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
swagger-codegen contains a template-driven engine to generate documentation, API clients and server stubs in different languages by parsing your OpenAPI / Swagger definition.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Abstraction for local and remote filesystems
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.