Stars
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
Privacy-Preserving Computing Platform 由密码学专家团队打造的开源隐私计算平台,支持多方安全计算、联邦学习、隐私求交、匿踪查询等。
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Use PEFT or Full-parameter to finetune 500+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
For releasing code related to compression methods for transformers, accompanying our publications
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
A generative speech model for daily dialogue.
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
TinyChatEngine: On-Device LLM Inference Library
Offsite-Tuning: Transfer Learning without Full Model
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
High-Resolution Image Synthesis with Latent Diffusion Models
[ACL 2020] Generating Diverse and Consistent QA pairs from Contexts with Information-Maximizing Hierarchical Conditional VAEs
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。
An easy-to-use federated learning platform
An Industrial Grade Federated Learning Framework
Open-Sora: Democratizing Efficient Video Production for All
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!