EchoPath Web + Backend (Local Run)

Demo Video: https://youtube.com/shorts/SjxQXo5ecP4

Prerequisites

Install Node.js
Install Poetry

Setup and run

If you did not clone with submodules, initialize the backend submodule:
```
git submodule update --init --recursive
```
In the repository root, install frontend dependencies and build:
```
npm i
npm run build
```
Copy the generated dist/ folder into backend/dist/.
```
cp -r dist backend/dist
```
Go into the backend folder and install backend dependencies:
```
cd backend
poetry install
```
Activate the virtual environment:
```
source .venv/bin/activate
```
Start the backend development server:
```
poe dev
```

The webapp will be hosted at http://localhost:3000/index.html.

Important note

The environment description feature requires an OpenAI-compatible backend server running a multimodal model.

Inspiration

EchoPath was built to address a gap in accessibility tools: many systems assume visual interaction first. This project treats audio as the primary interface for blind and low-vision users.

What it does

EchoPath is a real-time voice-and-vision navigation assistant that:

Streams live camera frames to the backend for perception.
Listens for the wake phrase “hey john.”
Captures a spoken command after wake-word detection.
Sends command + current frame to the backend (query_llm).
Speaks concise, non-visual responses from the backend (query_llm_response).
Plays spatial audio cues for nearby obstacles using 3D position data.

How it is built

Frontend: React + TypeScript + Capacitor camera integration.
Transport: WebSocket for continuous frame and message streaming.
Voice loop: Browser speech recognition + speech synthesis.
Spatial audio: Custom Web Audio directional cue engine.
Backend: FastAPI orchestration, Hugging Face Transformers (depth), Ultralytics YOLO (detection), and llama.cpp exposed via an OpenAI-compatible API.
Message contracts: image, query_llm, query_llm_response.

Challenges

Stable wake-word behavior in continuous recognition.
Avoiding stale transcript and repeated-command state bugs.
Recovery and retries after speech/WebSocket failures.
Keeping responses actionable without visual-only language.
Meeting real-time timing constraints across capture, network, inference, and TTS.

Accomplishments

End-to-end wake-word → command → backend → spoken-response loop.
Live camera streaming integrated with backend vision + LLM querying.
Spatial audio cues for obstacle direction and proximity.
Improved robustness with timeout and retry safeguards.
Hands-free, accessibility-first interaction flow.

What we learned

Accessibility-first design changes system architecture, not only UI text.
Reliability and state handling are as important as model quality.
Clear protocol contracts accelerate iteration in real-time systems.
Helpful guidance for blind users should be concise, actionable, and sensory-aware.

What’s next

Stronger on-device/offline fallback for voice commands.
Better personalization (voice style, verbosity, route preferences).
Expanded route safety signals (surface, curb, and crossing cues).
Confidence-aware responses when model certainty is low.
Broader testing with blind and low-vision participants.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
android		android
backend @ 5d01bdd		backend @ 5d01bdd
lib		lib
public		public
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
capacitor.config.ts		capacitor.config.ts
eslint.config.js		eslint.config.js
index.html		index.html
info.md		info.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EchoPath Web + Backend (Local Run)

Prerequisites

Setup and run

Important note

Inspiration

What it does

How it is built

Challenges

Accomplishments

What we learned

What’s next

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EchoPath Web + Backend (Local Run)

Prerequisites

Setup and run

Important note

Inspiration

What it does

How it is built

Challenges

Accomplishments

What we learned

What’s next

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages