Lists (8)
Sort Name ascending (A-Z)
- All languages
- Astro
- Blade
- C
- C#
- C++
- CMake
- CSS
- CUE
- Clojure
- CodeQL
- ColdFusion
- Dart
- Dockerfile
- EJS
- Elixir
- Erlang
- F#
- Go
- HCL
- HTML
- Haskell
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Lua
- MDX
- Makefile
- Markdown
- Nix
- OCaml
- Objective-C
- PHP
- PLpgSQL
- Pascal
- PowerShell
- Processing
- Python
- QML
- R
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Scheme
- Shell
- Svelte
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
- YAML
- Zig
Starred repositories
Web Extension for saving a faithful copy of a complete web page in a single HTML file
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
Converts SRT subtitle file to SSML file with speech durations
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
Maix Speech AI lib, a fast and small speech lib running on embedded devices, including ASR, chat, TTS etc.
AI Vtuber for Streaming on Youtube/Twitch
Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser
Fast STT, LLM, and TTS for personal AI assistants using OpenAI, Groq, AssemblyAI and ElevenLabs.
Make your AirPlay devices as TTS speakers
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
Transform PDFs into AI podcasts for engaging on-the-go audio content.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
Foundational model for human-like, expressive TTS
MARS5 speech model (TTS) from CAMB.AI
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
SDK & Sample to do speech recognition using websockets in Javascript
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
JeForceX / OpenTTS
Forked from synesthesiam/openttsOpen Text to Speech Server
Simple, unified interface to multiple Generative AI providers
This is a project that unifies the management of LLM APIs. It can call multiple backend services through a unified API interface, convert them to the OpenAI format uniformly, and support load balan…