speech-to-speech

Here are 52 public repositories matching this topic...

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

speech-to-text speech-to-speech large-language-models multimodal-large-language-models speech-language-model speech-interaction

Updated Nov 14, 2024
Python

IAHispano / Applio

Star

A simple, high-quality voice conversion tool focused on ease of use and performance.

text-to-speech ai voice speech pytorch tts rvc voice-conversion vc voice-cloning speech-to-speech vits voice-clone applio

Updated Mar 9, 2025
Python

opendilab / CleanS2S

Star

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

python machine-learning streaming ai speech-synthesis speech-recognition speech-to-speech gpt-4o

Updated Mar 4, 2025
Python

VITA-MLLM / Freeze-Omni

Star

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

speech speech-synthesis speech-recognition speech-to-speech large-language-models multimodal-large-language-models

Updated Jan 2, 2025
Python

SamirPaulb / real-time-voice-translator

Star

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

Updated Jan 22, 2024
Tcl

MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.

speech-recognition speech-to-text speech-translation speech-to-speech large-language-models chatgpt gpt-4o speech-interaction

Updated Jan 8, 2025
Python

amanvirparhar / weebo

Star

A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.

llama whisper kokoro speech-to-speech

Updated Jan 20, 2025
Python

dqqcasia / awesome-speech-translation

Star

natural-language-processing machine-translation speech speech-synthesis speech-recognition speech-processing text-translation disfluency-detection speech-translation multimodal-machine-learning multimodal-machine-translation punctuation-restoration speech-to-speech simultaneous-translation cascaded-speech-translation non-autoregressive-translation speech-to-subtitles

Updated Nov 10, 2021

jesuscopado / samantha-os1

Star

Samantha OS1 is a conversational AI assistant powered by the Realtime API from OpenAI

agent openai realtime-api speech-to-speech ai-agent

Updated Dec 27, 2024
Python

OpenBMB / UltraEval-Audio

Star

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

evaluation speech-recognition speech-to-text speech-to-speech

Updated Mar 6, 2025
Python

flo-bit / svelte-openai-realtime-api

Sponsor

Star

svelte component for using the openai realtime api

svelte openai realtime-api speech-to-speech sveltekit

Updated Dec 21, 2024
Svelte

ictnlp / DASpeech

Star

Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

machine-translation speech-translation speech-to-speech speech-to-speech-translation

Updated Jul 22, 2024
Python

jofizcd / Soul-of-Waifu

Star

If you've ever had the wish to talk to your AI Waifu using quality characters and voices for character voicing, then I suggest Soul of Waifu. Don't miss the opportunity to touch your dream!

text-to-speech ai chatbot artificial-intelligence tts speech-to-text waifu stt aichatbot aigirl speech-to-speech characterai aigirlfriend aiwaifu

Updated Dec 2, 2024
Python

hparcells / rtvc

Star

💬 "Realtime" voice transcription and cloning using ElevenLabs's API.

api website web ai interactive transcription voice-synthesis voice-cloning speech-to-speech voicecloning elevenlabs

Updated Mar 1, 2023
TypeScript

asiff00 / On-Device-Speech-to-Speech-Conversational-AI

Star

This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and natural interruption handling.

tts vad audio-processing asr voice-assistant conversational-ai speech-to-speech ollama kokoro-tts

Updated Feb 16, 2025
Python

liamdugan / speech-to-speech

Star

Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"

speech speech-processing speech-translation speech-to-speech simultaneous-translation

Updated Jan 14, 2025
Python

lugia19 / Echo-XI

Star

Speech to text to speech using Elevenlabs

python voice speech tts speech-synthesis speech-recognition speech-to-text speech-to-speech elevenlabs

Updated Jul 2, 2023
Python

rryam / SakuraKit

Sponsor

Star

Swift SDK for Prototyping AI Speech Generation

swift ios text-to-speech speech-to-speech

Updated Nov 13, 2024
Swift

codename0og / codename-rvc-fork-3

Star

Codename's rvc fork version 3, based on Applio.

text-to-speech ai voice speech pytorch tts rvc voice-conversion vc voice-cloning speech-to-speech vits applio retrieval-based-voice-conversion

Updated Mar 12, 2025
Python

Ankur2606 / Low-latency-AI-Voice-Assistant

Star

End-to-End AI Voice Assistant pipeline with Whisper for Speech-to-Text, Hugging Face LLM for response generation, and Edge-TTS for Text-to-Speech. Features include Voice Activity Detection (VAD), tunable parameters for pitch, gender, and speed, and real-time response with latency optimization.

ai versatile jarvis hacktoberfest voice-assistant multimodel speech-to-speech

Updated Mar 3, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-to-speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-to-speech

Here are 52 public repositories matching this topic...

ictnlp / LLaMA-Omni

IAHispano / Applio

opendilab / CleanS2S

VITA-MLLM / Freeze-Omni

SamirPaulb / real-time-voice-translator

MooreThreads / MooER

amanvirparhar / weebo

dqqcasia / awesome-speech-translation

jesuscopado / samantha-os1

OpenBMB / UltraEval-Audio

flo-bit / svelte-openai-realtime-api

ictnlp / DASpeech

jofizcd / Soul-of-Waifu

hparcells / rtvc

asiff00 / On-Device-Speech-to-Speech-Conversational-AI

liamdugan / speech-to-speech

lugia19 / Echo-XI

rryam / SakuraKit

codename0og / codename-rvc-fork-3

Ankur2606 / Low-latency-AI-Voice-Assistant

Improve this page

Add this topic to your repo