Jarvis-Local

Firmware for local AI assistant hardware. Usage from a ESP32 microchip, with attached speakers and microphone. Server on desktop will prompt a llama3 model and the various voice models.

ESP32

The firmware runs on a five stage loop:

Wait for button press to start recording.
Record audio while the button is down.
Send the Audio.
Wait to receive audio.
Play Audio The ESP32's integrated Wifi-module is used to make a connection over your local Wifi, using WebSockets.

Server Code

The server runs from a python script, loading the models and opening the websocket for listening. The server starts running it's code when it has succesfully received the raw microphone data. First the audio is converted to text with a Speech-to-Text whisper model. The llama3 (or whatever you want to use) local LLM is prompted with the converted text. Finally the reply from the prompted AI is also converted to voice and sent back to the ESP32.

More on how the system is setup on my blog: https://www.techdebtblog.com/esp32ai

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
PPH.py		PPH.py
README.md		README.md
esp32_ai_assist.ino		esp32_ai_assist.ino

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Jarvis-Local

ESP32

Server Code

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

sgull-dev/esp32_ai_assistant

Folders and files

Latest commit

History

Repository files navigation

Jarvis-Local

ESP32

Server Code

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages