OmniVoice chat (Rocky conversation)

Setup

Backend env vars

Create omnivoice-chat/backend/.env:

OPENROUTER_API_KEY=your_key_here
OPENROUTER_MODEL=openrouter/auto
# Optional: how long to wait for the model to finish streaming a reply (default 120 seconds).
OPENROUTER_READ_TIMEOUT=120

Rocky persona (LLM)

Edit markdown under Rocky/:

Rocky-Life.md backstory and personality
Rocky-Speech-Style.md how Rocky writes and speaks in dialogue

The backend injects both into Rocky’s system prompt when non-empty. It reloads them automatically when you save a file (same mtime cache; no restart needed). Very long files are truncated; override the per-file cap with ROCKY_PROMPT_MAX_CHARS (default 12000) in .env if you need more.

The system prompt also states up front that this Rocky is the Eridian from Project Hail Mary, not Rocky Balboa, so models don’t default to the boxer persona. If Rocky-Life.md is missing on the machine running uvicorn, check the server log for a one-time warning; the markdown must live next to backend/ under omnivoice-chat/Rocky/.

OmniVoice still clones timbre from assets/voices/Rocky-1.wav and its reference transcript; a short optional instruct tag is also passed for Rocky at TTS time.

Speech input (no server STT)

The backend does not run Whisper or other server-side transcription. Web Speech dictation plus typing fill the composer; if dictation is empty after a mic take, the UI asks you to type what you said and send again while keeping the pending clip for playback on the user bubble.

Browser dictation (Web Speech API)

Voice turns use on-device dictation in parallel with the recorder (SpeechRecognition / webkitSpeechRecognition) when the browser exposes it.

Support varies: Chromium-based browsers usually work; Safari differs; Firefox support is spotty.
Dictation is not always offline. The engine may send audio to the OS or browser vendor; treat it like any cloud-adjacent speech feature for privacy expectations.

Hosting on Railway

Step-by-step deploy (Docker, one public URL, same-origin /api) is in RAILWAY.md.

OmniVoice TTS is CPU-heavy on Railway; use enough RAM for the model and expect slower speech than on a local GPU. There is no server-side STT in this stack; dictation quality depends on the user’s browser.

Run

Backend:

cd omnivoice-chat/backend
source .venv/bin/activate
uvicorn app.main:app --reload --port 8001

Frontend:

cd omnivoice-chat/frontend
npm run dev

Open the Vite URL. A server session starts automatically the first time you record or send a typed message.

Chat avatars: add rocky.png and user.png under frontend/public/avatars/ (see that folder’s README.txt).

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Rocky		Rocky
backend		backend
frontend		frontend
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
PRD.md		PRD.md
RAILWAY.md		RAILWAY.md
README.md		README.md
railway.toml		railway.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OmniVoice chat (Rocky conversation)

Setup

Backend env vars

Rocky persona (LLM)

Speech input (no server STT)

Browser dictation (Web Speech API)

Hosting on Railway

Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OmniVoice chat (Rocky conversation)

Setup

Backend env vars

Rocky persona (LLM)

Speech input (no server STT)

Browser dictation (Web Speech API)

Hosting on Railway

Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages