EdgeSpeech

Web developers can add listening, speaking, or both to a React Native app with EdgeSpeech, without writing any native audio code. Voice Activity Detection, Speech-to-Text, and Text-to-Speech all run on-device through the Switchboard SDK. Your JavaScript works entirely with text.

import { SwitchboardVoiceModule, initialize, start, speak } from '@synervoz/edgespeech';

initialize('YOUR_APP_ID', 'YOUR_APP_SECRET');

SwitchboardVoiceModule.addListener('onTranscript', async ({ text, isFinal }) => {
  if (isFinal) {
    const response = await chat(text);
    await speak(response);
  }
});

await start();

The example app shows the complete voice loop running end-to-end.

Cost Savings: 99% Cheaper Than Cloud Speech-to-Speech

The real advantage of on-device voice processing is cost.

The Math

Consider a voice AI assistant handling 1,000 conversations per day, each lasting 5 minutes.

OpenAI Realtime API (cloud speech-to-speech):

Component	Calculation	Cost
Audio input	150 sec × 80 tokens/sec × $100/1M	$1.20
Audio output	150 sec × 80 tokens/sec × $200/1M	$2.40
Per conversation		$3.60
1,000 conversations/day		$3,600/day
Monthly (30 days)		$108,000

EdgeSpeech + ChatGPT API (text only):

Component	Calculation	Cost
Text input	~750 tokens × $5/1M	$0.004
Text output	~750 tokens × $20/1M	$0.015
Per conversation		$0.02
1,000 conversations/day		$20/day
Monthly (30 days)		$600

Installation

npm install @synervoz/edgespeech

iOS Setup

The Switchboard SDK frameworks are downloaded automatically on npm install.
Add microphone permission to your Info.plist:

<key>NSMicrophoneUsageDescription</key>
<string>This app needs microphone access for voice input</string>

Build your app:

npx expo run:ios

Quick Start

import {
  SwitchboardVoiceModule,
  initialize,
  configure,
  start,
  speak,
  requestMicrophonePermission,
} from '@synervoz/edgespeech';

// 1. Initialize with your Switchboard credentials
initialize('YOUR_SWITCHBOARD_APP_ID', 'YOUR_SWITCHBOARD_APP_SECRET');

// 2. (Optional) tune settings
configure({ vadSensitivity: 0.5 });

// 3. Set up event listeners
SwitchboardVoiceModule.addListener('onTranscript', ({ text, isFinal }) => {
  console.log(isFinal ? 'Final:' : 'Interim:', text);
  if (isFinal) handleUserSpeech(text);
});

SwitchboardVoiceModule.addListener('onStateChange', ({ state }) => {
  console.log('State:', state); // 'idle' | 'listening' | 'speaking'
});

SwitchboardVoiceModule.addListener('onInterrupted', () => {
  console.log('User interrupted playback');
});

SwitchboardVoiceModule.addListener('onError', ({ code, message }) => {
  console.error('Voice error:', code, message);
});

// 4. Request permission and start
const granted = await requestMicrophonePermission();
if (granted) {
  await start();
}

// 5. Speak responses
await speak('Hello! How can I help you today?');

API Reference

Configuration

await EdgeSpeech.configure({
  appId: string,           // Required: Switchboard app ID
  appSecret: string,       // Required: Switchboard app secret
  sttModel?: string,       // Optional: STT model (default: 'whisper-base-en')
  ttsVoice?: string,       // Optional: TTS voice (default: 'en_GB')
  vadSensitivity?: number, // Optional: VAD sensitivity 0.0-1.0 (default: 0.5)
});

Methods

Method	Description
`configure(config)`	Initialize with credentials and settings
`start()`	Start listening for voice input
`stop()`	Stop listening
`speak(text)`	Speak text using TTS
`stopSpeaking()`	Stop current TTS playback
`requestMicrophonePermission()`	Request microphone access

Events

Listen via SwitchboardVoiceModule.addListener(eventName, handler).

Event	Payload	Description
`onTranscript`	`{ text: string, isFinal: boolean }`	Speech recognized
`onStateChange`	`{ state: string }`	State changed (`idle`, `listening`, `speaking`)
`onSpeechStart`	`{}`	VAD detected voice activity
`onSpeechEnd`	`{}`	VAD detected end of speech
`onTTSComplete`	`{}`	TTS finished playing
`onInterrupted`	`{}`	TTS interrupted by user speech
`onError`	`{ code: string, message: string }`	Error occurred

States

idle -> listening -> processing -> idle
                 \              /
                   -> speaking -

Example App

The example/ directory contains a minimal demo showing the complete voice loop:

cd example
npm install
npx expo run:ios

Architecture

flowchart TB
    mic["🎤 Microphone"]
    spk["🔊 Speaker"]

    subgraph Engines["EdgeSpeech"]
        subgraph JS["JavaScript API"]
            subgraph Controls["Controls"]
                start["start()"]
                stop["stop()"]
                stopSpeaking["stopSpeaking()"]
            end
            speak["speak(text)"]
            onTranscript["onTranscript"]
            onInterrupted["onInterrupted"]
        end
        subgraph ListenGraph["Listening Graph"]
            direction LR
            MCtoMono["MultiChannelToMono"] --> Split["BusSplitter"]
            Split --> VAD["SileroVAD"]
            Split --> STT["Whisper STT"]
            VAD -.-> STT
        end

        subgraph SpeakingGraph["Speaking Graph"]
            TTS["Sherpa TTS"]
        end
    end

    SDK["Switchboard SDK (Runtime)"]

    mic --> ListenGraph
    SpeakingGraph --> spk
    ListenGraph -- "executed by" --> SDK
    SpeakingGraph -- "executed by" --> SDK

    start --> ListenGraph
    stop --> ListenGraph
    speak --> SpeakingGraph
    stopSpeaking --> SpeakingGraph

    STT -.-> onTranscript
    ListenGraph -.-> onInterrupted
    onInterrupted --> stopSpeaking

    onTranscript --> LLM
    LLM --> speak

    LLM["🤖 Your LLM Pipeline"]:::external

    classDef external fill:#f5f5f5,stroke:#999,stroke-dasharray: 5 5

    style ListenGraph fill:#fff,stroke:#999,stroke-dasharray: 5 5
    style SpeakingGraph fill:#fff,stroke:#999,stroke-dasharray: 5 5

Platform Support

Platform	Status
iOS	Supported
Android	Coming soon

Requirements

React Native 0.74+
iOS 13.4+
Node.js 20+

Get Switchboard Credentials

Sign up at switchboard.audio
Create a new app in the dashboard
Copy your App ID and App Secret

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
__tests__		__tests__
assets		assets
example		example
ios		ios
scripts		scripts
shared		shared
specs		specs
src		src
.eslintrc.js		.eslintrc.js
.gitignore		.gitignore
.prettierrc.js		.prettierrc.js
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Gemfile		Gemfile
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
app.json		app.json
babel.config.js		babel.config.js
edgespeech.podspec		edgespeech.podspec
expo-module.config.json		expo-module.config.json
jest.config.js		jest.config.js
jest.integration.config.js		jest.integration.config.js
metro.config.js		metro.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EdgeSpeech

Cost Savings: 99% Cheaper Than Cloud Speech-to-Speech

The Math

Installation

iOS Setup

Quick Start

API Reference

Configuration

Methods

Events

States

Example App

Architecture

Platform Support

Requirements

Get Switchboard Credentials

License

Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EdgeSpeech

Cost Savings: 99% Cheaper Than Cloud Speech-to-Speech

The Math

Installation

iOS Setup

Quick Start

API Reference

Configuration

Methods

Events

States

Example App

Architecture

Platform Support

Requirements

Get Switchboard Credentials

License

Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages