Lists (21)
Sort Name ascending (A-Z)
3DFace
audio
chatgpt
data selection
detection
diffusion
Face Edit
faceswap
GPT
great vision model
inpainting
matting
network compression
palm
pet
program skill
self supervised
transportation
utilities
video
vpn
Stars
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Pioneering Multimodal Reasoning with CoT
Understanding R1-Zero-Like Training: A Critical Perspective
[NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.
The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
No fortress, purely open ground. OpenManus is Coming.
AI models trained by Google to classify species in images from motion-triggered widlife cameras.
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
like jq but for Markdown: find specific elements in a md doc
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
#1 Locally hosted web application that allows you to perform various operations on PDF files
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
A simple, easy-to-hack GraphRAG implementation
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
🤗 smolagents: a barebones library for agents that think in python code.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
SGLang is a fast serving framework for large language models and vision language models.
Local Ollama and OpenAI-like GPT's assistance for maximum privacy and offline access
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Open source API development ecosystem - https://hoppscotch.io (open-source alternative to Postman, Insomnia)