Talker

A super simple tool for a chatbot with voice control

How it works

The whole code contains close to no logic in itself, rather it is mostly glue code between:

getUserMedia and MediaRecorder to record the user's audio
OpenAI's Whisper to convert the user audio into a question text
Google's Gemma as an LLM to compute a answer text
Huggingface's Transformers python lib to wrap around the LLM, or any model you want to use (just replace the checkpoint string)
SpeechSynthesis to convert the answer text into audio

As of now it's way too basic to be practically used on a daily basis, but it serves as a POC for future applications (eg: LLM-powered local vocal chat in video games). It's also a surprisingly small repository: 85 lines for the python server, 58 lines for the web app

Installation

Install torch with GPU support

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

Install dependencies

pip install -r requirements.txt

Run the server

python server.py

Navigate to localhost:8080 when ready

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
index.html		index.html
index.js		index.js
requirements.txt		requirements.txt
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Talker

How it works

Installation

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Talker

How it works

Installation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages