Stars
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
GIM: Learning Generalizable Image Matcher From Internet Videos (ICLR 2024 Spotlight)
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
A code driven low-code builder, develop low-code app on your codebase.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Instant voice cloning by MIT and MyShell. Audio foundation model.
🔊 Text-Prompted Generative Audio Model
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
🔥🔥The pytorch implement of the head pose estimation(yaw,roll,pitch) and emotion detection with SOTA performance in real time.Easy to deploy, easy to use, and high accuracy.Solve all problems of fac…
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
Python Decision Tree + D3 Visualizer
code developed in D3 to visualize a XGBoost Decision Tree in an interactive way
Eye blink(Closeness-Openess) detection using CNN (Keras)
Trains and implements liveness detection with video input in Python code
Semantic segmentation for hair, face and background