AI Voice Chat

Overview

This is a simple react app that allows you to chat with an AI assistant using your voice.

It uses Whisper large v3 for transcription, openchat 3.5 AWQ for the AI assistant, and XTTS v2 for text-to-speech.

Its main feature is speech-to-speech latency, it more than halves the latency of the ChatGPT with voice demo video. This repo runs on a single RTX 3090 GPU.

No concurrency is supported and the project is not optimized production ready. It's also probably riddled with bugs so if you experience some, please open an issue or send a PR.

Demo

ai_voice_chat_demo_3.mp4

Installation

Requisites

Have a Nvidia GPU with more than 16GB of VRAM and latest drivers
Have docker-compose installed

Steps

Clone the repo

git clone https://github.com/WeberJulian/AI-voice-chat.git

Build the react app

cd AI-voice-chat
cd web-app
npm install && npm run build

Start everything 🚀

cd ..
docker-compose up

Usage

Open the app in your browser at http://localhost:5000
Allow microphone access
Push to talk either with the Shift ⇧ key or the circle
Enjoy!

To reset the conversation, refresh the page.

Custom models

If you fine-tune XTTS and want to use your own model, you can add that line to the docker-compose.yml file, in the tts service:

services:
    ...
    tts:
        ...
        volumes:
            - /path/to/your/model:/app/tts_models

In the /path/to/your/model folder, you must have the following files:

config.json
model.pth
vocab.json

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
web-app		web-app
docker-compose.yml		docker-compose.yml
nginx.conf		nginx.conf
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

web-app

web-app

docker-compose.yml

docker-compose.yml

nginx.conf

nginx.conf

readme.md

readme.md

Repository files navigation

AI Voice Chat

Overview

Demo

Installation

Requisites

Steps

Usage

Custom models

About

Releases

Packages

Languages

mabry1985/AI-voice-chat

Folders and files

Latest commit

History

Repository files navigation

AI Voice Chat

Overview

Demo

Installation

Requisites

Steps

Usage

Custom models

About

Resources

Stars

Watchers

Forks

Languages