Stars
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.01935
Fully open reproduction of DeepSeek-R1
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Example distributed app composed of multiple containers for Docker, Compose, Swarm, and Kubernetes
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared …
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Build Multimodal AI Agents with memory, knowledge and tools. Simple, fast and model-agnostic.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation
RAGChecker: A Fine-grained Framework For Diagnosing RAG
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]
📚Open Source Curriculum for CNCF Certification Courses