Lists (2)
Sort Name ascending (A-Z)
Stars
mash up of Wan2.1 + Meta Sapiens + Seaweed Diffusion APT for One-Step Video Generation if you have compute - call me
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas…
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
Official Implementation of "KBLaM: Knowledge Base augmented Language Model"
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
YT Navigator: AI-powered YouTube content explorer that lets you search and chat with channel videos using AI agents. Extract insights from hours of content in seconds with semantic search and preci…
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.
No fortress, purely open ground. OpenManus is Coming.
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A Gradio web UI for Large Language Models with support for multiple inference backends.
Toolkit for linearizing PDFs for LLM datasets/training
A simple screen parsing tool towards pure vision based GUI agent
Wan: Open and Advanced Large-Scale Video Generative Models
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
DeepEP: an efficient expert-parallel communication library