Lists (1)
Sort Name ascending (A-Z)
Stars
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Python tool for converting files and office documents to Markdown.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
SkyReels V1: The first and most advanced open-source human-centric video foundation model
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
A simple screen parsing tool towards pure vision based GUI agent
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Make websites accessible for AI agents
No fortress, purely open ground. OpenManus is Coming.
😎简单易用、🧩丰富生态 - 大模型原生即时通信机器人平台 | 适配 QQ / 微信(企业微信、个人微信)/ 飞书 / 钉钉 / Discord / Telegram 等平台 | 支持 ChatGPT、DeepSeek、Dify、Claude、Gemini、xAI Grok、Ollama、LM Studio、阿里云百炼、火山方舟、SiliconFlow、Qwen、Moonshot、ChatGL…
微信机器人框架,个人微信二次开发,最简单易用的免费二开框架,微信ipad登录(非HOOK破解桌面端)
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
微信机器人,可接入DeepSeek、Gemini、ChatGPT、ChatGLM、讯飞星火、Tigerbot等大模型。微信 hook WeChat Robot Hook.
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.
This is the official source code of FreeCAD, a free and opensource multiplatform 3D parametric modeler.
Python package for 3D geometry CAD/BIM/CAM
A NodeJS RAG framework to easily work with LLMs and embeddings
AutoHotkey - macro-creation and automation-oriented scripting utility for Windows.