An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,370 178 Updated Feb 14, 2025

SciPhi-AI / R2R

The most advanced AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.

Python 5,342 399 Updated Mar 10, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 19,855 1,539 Updated Mar 3, 2025

astramind-ai / Auralis

A Fast TTS Engine

Python 463 34 Updated Jan 23, 2025

github / gitignore

A collection of useful .gitignore templates

164,911 83,099 Updated Mar 3, 2025

lifeiteng / OmniSenseVoice

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 806 32 Updated Mar 7, 2025

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…

C++ 5,176 584 Updated Mar 10, 2025

microsoft / vscode-jupyter

VS Code Jupyter extension

TypeScript 1,339 304 Updated Mar 7, 2025

chonkie-ai / chonkie

🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

Python 2,725 119 Updated Mar 10, 2025

Delgan / loguru

Python logging made (stupidly) simple

Python 21,009 720 Updated Mar 1, 2025

vllm-project / flash-attention

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 48 49 Updated Mar 5, 2025

clash-verge-rev / clash-verge-rev

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 50,181 3,926 Updated Mar 9, 2025

mtdvio / every-programmer-should-know

A collection of (mostly) technical things every software developer should know about

86,528 7,956 Updated Aug 6, 2024

jina-ai / late-chunking

Code for explaining and evaluating late chunking (chunked pooling)

Python 340 35 Updated Dec 23, 2024

HKUDS / LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,618 1,782 Updated Mar 8, 2025

gusye1234 / nano-graphrag

A simple, easy-to-hack GraphRAG implementation

Python 2,573 256 Updated Jan 15, 2025

adbar / trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 4,014 284 Updated Feb 17, 2025

Vahe1994 / AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

Python 1,220 183 Updated Mar 3, 2025

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,845 194 Updated Nov 14, 2024

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,832 417 Updated Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will Liang liangmingxin

Block or report liangmingxin

Stars

sgl-project / sglang

SesameAILabs / csm

github / docs

deepseek-ai / awesome-deepseek-integration

deepseek-ai / FlashMLA

deepseek-ai / DeepSeek-V3

huggingface / open-r1

jendrikseipp / vulture

gaogaotiantian / viztracer

ray-project / llmperf

modelscope / ClearerVoice-Studio