
Starred repositories
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
AI 助手全套开源解决方案,自带运营管理后台,开箱即用。集成了 ChatGPT, Azure, ChatGLM,讯飞星火,文心一言等多个平台的大语言模型。支持 MJ AI 绘画,Stable Diffusion AI 绘画,微博热搜等插件工具。采用 Go + Vue3 + element-plus 实现。
🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
A versatile Unity scroll view component that enables highly flexible animations.
Implementation of the Slay the Spire Map in Unity3d
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
A set of web-based tools for generating graphics and other assets that would eventually be in an Android application's res/ directory.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
IDM Activation & Trail Reset Script
MFCC-based LipSync plug-in for Unity using Job System and Burst Compiler
# Edge-TTS Web 一个基于 Microsoft Edge 浏览器 TTS 引擎的在线语音合成系统,提供简单易用的 Web 界面。 特性 🌍 支持多语言:中文(简体、繁体、粤语)、英语、日语等 74 种语言 - 🎭 丰富音色:提供 318 种不同的声音选项 - 🎛️ 灵活调节:支持语速调整(0.25x-4x) - 📝 字幕支持:自动生成 SRT 格式字幕 - 🎯 精准同步:音频与字…
An open-source cross-platform alternative to AirDrop
zero-shot voice conversion & singing voice conversion, with real-time support
TikTok 发布/喜欢/合辑/直播/视频/图集/音乐;抖音发布/喜欢/收藏/收藏夹/视频/图集/实况/直播/音乐/合集/评论/账号/搜索/热榜数据采集工具
Android 动画各种实现,包括帧动画、补间动画和属性动画的总结分享
Edit lottie animation colors --> https://magna25.github.io/lottie-editor/
Real-Time Animation Editor with Collaboration
🌝 MLKit是一个强大易用的工具包。通过ML Kit您可以很轻松的实现文字识别、条码识别、图像标记、人脸检测、对象检测等功能。
Android filters based on OpenGL (idea from GPUImage for iOS)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
✨ AsrTools: Smart Voice-to-Text Tool | Efficient Batch Processing | User-Friendly Interface | No GPU Required | Supports SRT/TXT Output | Turn your audio into accurate text in an instant!
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
🔥🔥🔥自定义Android相机(仿抖音 TikTok),其中功能包括视频人脸识别贴纸,美颜,分段录制,视频裁剪,视频帧处理,获取视频关键帧,视频旋转,添加滤镜,添加水印,合成Gif到视频,文字转视频,图片转视频,音视频合成,音频变声处理,SoundTouch,Fmod音频处理。 Android camera(imitation Tik Tok), which includes video e…
Starred topics
