@imcooder/opuslib

Opus 1.6 audio encoding for React Native and Expo

Fork Notice: This project is forked from Scdales/opuslib. We've made the following enhancements:

Threading & Stability

Dedicated encoding thread — Audio capture and Opus encoding run on separate threads (copy+post pattern). Capture thread is never blocked by encoding. All encoder operations are on a single serial queue — no locks, no cross-thread crash risk. Fixes iOS crash caused by encoding on the real-time audio thread.

Flush on stop — Remaining PCM samples are padded with silence and encoded on stop, so no audio is lost at the end of a session.

New Events
audioStarted event — Emitted from the encoding thread when streaming starts. Includes actual audio config and Opus encoder preSkip (OPUS_GET_LOOKAHEAD), so decoders know how many samples to skip.
Opuslib.addListener('audioStarted', (event) => {
  // event.timestamp: 1711000000000    (ms since epoch)
  // event.sampleRate: 16000           (Hz)
  // event.channels: 1                 (mono)
  // event.bitrate: 24000              (bps)
  // event.frameSize: 20               (ms)
  // event.preSkip: 312                (samples, decoder should skip)
});
audioEnd event — Emitted from the encoding thread when streaming stops. Includes session summary.
Opuslib.addListener('audioEnd', (event) => {
  // event.timestamp: 1711000005000    (ms since epoch)
  // event.totalDuration: 5000         (ms, session length)
  // event.totalPackets: 250           (total encoded packets)
});
framesPerCallback — batch multiple frames to reduce data transfer overhead

Multiple independently-encoded Opus frames can be batched into a single audioChunk callback via framesPerCallback, reducing JS bridge calls and data transfer overhead. Each frame in frames[] is a complete, independently decodable Opus packet (with its own TOC byte) — no illegal byte concatenation.

Example: frameSize=20ms, framesPerCallback=5 → 5 frames encoded individually, returned as frames: OpusFrame[] in one audioChunk event (80% fewer bridge calls).

New audioChunk fields
frames — Array of OpusFrame objects. Each frame is an independent, decodable Opus packet (with its own TOC byte). No illegal byte concatenation.

OpusFrame.audioLevel — Per-frame normalized audio level (0.0~1.0), computed via RMS with dBFS-to-linear mapping. Only present when enableAudioLevel: true. Consumers can average neighboring frames for smoothing.

duration — Duration of all frames in milliseconds (frameSize * frameCount).

frameCount — Number of Opus frames in this callback (= frames.length).
preSkip — (in audioStarted event) Opus encoder lookahead in samples. Decoders should skip this many samples at the beginning of the stream.
Opuslib.addListener('audioChunk', (event) => {
  // event.frames: OpusFrame[]           (independent Opus packets)
  //   each frame: { data: ArrayBuffer, audioLevel?: number }
  // event.timestamp: 1711000000100     (ms since epoch)
  // event.sequenceNumber: 5            (callback counter)
  // event.duration: 100               (ms, = frameSize * frameCount)
  // event.frameCount: 5               (= frames.length)
});
New Config Options
enableAudioLevel — Enable per-frame audio level calculation (default: false). When enabled, each OpusFrame includes audioLevel (0.0~1.0). Disabled by default to save computation.
await Opuslib.startStreaming({
  sampleRate: 16000,
  channels: 1,
  bitrate: 24000,
  frameSize: 20,
  framesPerCallback: 5,  // batch 5 independent Opus frames per event
  enableAudioLevel: true, // enable per-frame audio level
});
iosAudioSession — Configurable iOS AudioSession (iOS only)

Problem: AudioSession was hardcoded as .record + .measurement + no options, which means: record-only (no simultaneous playback), system audio processing disabled (no AGC, no echo cancellation), no Bluetooth support, audio defaults to earpiece (not speaker).

Solution: New optional iosAudioSession parameter in AudioConfig lets callers customize AVAudioSession category, mode, and options. Omitting it preserves the original default behavior. Android/Web ignore this parameter.
await Opuslib.startStreaming({
  sampleRate: 24000,
  channels: 1,
  bitrate: 24000,
  frameSize: 20,
  framesPerCallback: 8,
  enableAudioLevel: true,
  // Customize iOS AudioSession for voice chat scenarios
  iosAudioSession: {
    category: 'playAndRecord',    // record + play simultaneously
    mode: 'default',              // enable system audio processing (AGC, echo cancellation)
    options: ['mixWithOthers', 'defaultToSpeaker', 'allowBluetooth', 'allowAirPlay'],
  },
});

Real-time audio capture and encoding using the latest Opus 1.6 codec, built from source with full native integration for iOS and Android.

Features

Opus 1.6 - Latest codec version compiled from the official source
Low Latency - Real-time encoding with minimal overhead
Native Performance - Direct C/C++ integration, no JavaScript encoding
Thread-safe Encoding - Dedicated encoding thread, capture thread never blocked
Audio Level Metering - Optional per-frame 0~1 audio level via RMS (enable with enableAudioLevel: true)
Lifecycle Events - audioStarted / audioEnd events with session metadata
High Quality - 24kbps achieves excellent speech quality
Cross-Platform - iOS and Android with a consistent API
Zero Dependencies - Self-contained with vendored Opus source
Configurable - Bitrate, sample rate, frame size
Event-Based - Stream encoded audio chunks via events

Why Opus 1.6?

Opus is the gold standard for real-time voice applications:

Better compression than AAC, MP3, or Vorbis at low bitrates
Lower latency than other codecs (as low as 5ms)
Royalty-free and open source
Internet standard (RFC 6716) used by Discord, WhatsApp, WebRTC

Installation

# Using npm
npm install @imcooder/opuslib

# Using yarn
yarn add @imcooder/opuslib

# Using pnpm
pnpm add @imcooder/opuslib

Additional Setup

For Expo Projects

npx expo install @imcooder/opuslib
npx expo prebuild

For React Native CLI

# iOS
cd ios && pod install && cd ..

# Android - no additional steps needed

Quick Start

import Opuslib from '@imcooder/opuslib';
import { Platform, PermissionsAndroid } from 'react-native';

// Request microphone permission (Android)
async function requestPermission() {
  if (Platform.OS === 'android') {
    const granted = await PermissionsAndroid.request(
      PermissionsAndroid.PERMISSIONS.RECORD_AUDIO
    );
    return granted === PermissionsAndroid.RESULTS.GRANTED;
  }
  return true; // iOS handles permissions automatically
}

// Start recording and encoding
async function startRecording() {
  const hasPermission = await requestPermission();
  if (!hasPermission) {
    console.error('Microphone permission denied');
    return;
  }

  // Listen for session lifecycle
  Opuslib.addListener('audioStarted', (event) => {
    console.log(`Started: ${event.sampleRate}Hz, preSkip=${event.preSkip}`);
  });

  Opuslib.addListener('audioEnd', (event) => {
    console.log(`Ended: ${event.totalDuration}ms, ${event.totalPackets} packets`);
  });

  // Listen for encoded audio chunks
  const subscription = Opuslib.addListener('audioChunk', (event) => {
    const { frames, timestamp, sequenceNumber } = event;
    for (const frame of frames) {
      console.log(`Opus packet: ${frame.data.byteLength} bytes, level=${frame.audioLevel?.toFixed(2) ?? 'N/A'}`);
      // Send each independent Opus packet to your backend, save to file, etc.
    }
  });

  // Start streaming
  await Opuslib.startStreaming({
    sampleRate: 16000,      // 16 kHz
    channels: 1,            // Mono
    bitrate: 24000,         // 24 kbps
    frameSize: 20,          // 20ms frames
    framesPerCallback: 1,   // 1 frame per callback (default)
  });
}

// Stop recording
async function stopRecording() {
  await Opuslib.stopStreaming();
}

API Reference

Methods

`startStreaming(config: AudioConfig): Promise<void>`

Start audio capture and Opus encoding.

Parameters:

interface AudioConfig {
  sampleRate: number;               // Sample rate in Hz (8000, 16000, 24000, 48000)
  channels: number;                 // Number of channels (1 = mono, 2 = stereo)
  bitrate: number;                  // Target bitrate in bits/second (e.g., 24000)
  frameSize: number;                // Frame duration in ms (2.5, 5, 10, 20, 40, 60)
  framesPerCallback?: number;       // Frames per callback (default 1), batching reduces bridge calls
  dredDuration?: number;            // Reserved for future DRED support (default: 0)
  enableAudioLevel?: boolean;       // Enable per-frame audio level (default: false)
  enableAmplitudeEvents?: boolean;  // Enable amplitude monitoring (default: false)
  amplitudeEventInterval?: number;  // Amplitude update interval in ms (default: 16)
  iosAudioSession?: {               // iOS AudioSession config (iOS only, ignored on Android/Web)
    category: 'record' | 'playAndRecord' | 'playback' | 'ambient';
    mode: 'default' | 'voiceChat' | 'measurement' | 'spokenAudio';
    options?: Array<'mixWithOthers' | 'defaultToSpeaker' | 'allowBluetooth' | 'allowAirPlay' | 'allowBluetoothA2DP'>;
  };
}

Recommended Settings for Speech:

{
  sampleRate: 16000,     // 16 kHz - optimal for speech
  channels: 1,           // Mono - sufficient for voice
  bitrate: 24000,        // 24 kbps - excellent quality
  frameSize: 20,         // 20ms - standard for real-time
  framesPerCallback: 1,  // 1 frame per callback - low latency
}

Throws: Error if already streaming or if microphone permission denied

`stopStreaming(): Promise<void>`

Stop audio capture and encoding, flush remaining audio, release resources.

`pauseStreaming(): void`

Pause audio capture (keeps resources allocated). Call resumeStreaming() to continue.

`resumeStreaming(): void`

Resume audio capture after calling pauseStreaming().

Events

`audioStarted`

Emitted when audio streaming successfully starts. Fired from the encoding thread so all values (including preSkip) are read without cross-thread risk.

Opuslib.addListener('audioStarted', (event: AudioStartedEvent) => {
  console.log(`Streaming started at ${event.sampleRate}Hz, preSkip=${event.preSkip}`);
});

Event Data:

interface AudioStartedEvent {
  timestamp: number;    // Milliseconds since epoch
  sampleRate: number;   // Actual sample rate in Hz
  channels: number;     // Number of channels
  bitrate: number;      // Configured bitrate in bits/second
  frameSize: number;    // Frame duration in milliseconds
  preSkip: number;      // Opus encoder lookahead in samples (decoder should skip these)
}

`audioChunk`

Emitted when an encoded Opus packet is ready.

Opuslib.addListener('audioChunk', (event: AudioChunkEvent) => {
  // event.frames: OpusFrame[] - Independent Opus packets (each decodable on its own)
  //   frame.audioLevel?: number - Per-frame level 0.0~1.0 (when enableAudioLevel is true)
  // event.duration: number - Duration in ms (frameSize * frameCount)
  // event.frameCount: number - Number of Opus frames (= frames.length)
  for (const frame of event.frames) {
    websocket.send(frame.data);  // each frame is an independent Opus packet
  }
});

Event Data:

interface OpusFrame {
  data: ArrayBuffer;         // Independent Opus packet (one opus_encode() output with its own TOC byte)
  audioLevel?: number;       // Per-frame audio level 0.0~1.0 (only when enableAudioLevel is true)
}

interface AudioChunkEvent {
  frames: OpusFrame[];       // Array of independent Opus packets
  timestamp: number;         // Milliseconds since epoch
  sequenceNumber: number;    // Incrementing callback counter
  duration: number;          // Total duration in ms (frameSize * frameCount)
  frameCount: number;        // Number of Opus frames (= frames.length)
}

`audioEnd`

Emitted when audio streaming stops. Fired from the encoding thread after flushing remaining audio.

Opuslib.addListener('audioEnd', (event: AudioEndEvent) => {
  console.log(`Session ended: ${event.totalDuration}ms, ${event.totalPackets} packets`);
});

Event Data:

interface AudioEndEvent {
  timestamp: number;      // Milliseconds since epoch
  totalDuration: number;  // Total session duration in milliseconds
  totalPackets: number;   // Total number of packets encoded
}

`amplitude`

Emitted periodically with audio amplitude data (requires enableAmplitudeEvents: true).

Opuslib.addAmplitudeListener((event: AmplitudeEvent) => {
  // event.rms: number - Root mean square amplitude (0.0 - 1.0)
  // event.peak: number - Peak amplitude (0.0 - 1.0)
  // event.timestamp: number - Milliseconds since epoch
});

`error`

Emitted when an error occurs during recording.

Opuslib.addErrorListener((event: ErrorEvent) => {
  console.error(`Error: ${event.message}`);
});

Architecture

Capture Thread                  Encoding Thread (serial queue)
  |                               |
  | AVAudioEngine tap (iOS)       |
  | AudioRecord.read() (Android)  |
  |                               |
  | format convert + copy PCM     |
  |---- post(samples) ----------->| pendingSamples.append(samples)
  |                               | while (enough samples) {
  |                               |   opus_encode()
  |                               |   per-frame audioLevel (if enabled)
  |                               |   emit audioChunk event
  |                               | }
  |                               |
  | (stop)                        |
  |---- syncFlush() ------------->| pad silence + encode last frame
  |                               | emit audioEnd event
  |                               | destroy encoder
  |<---- done --------------------|

iOS: DispatchQueue (serial) as encoding thread, AVAudioEngine tap for capture

Android: HandlerThread + Handler as encoding thread, AudioRecord loop for capture

All encoder state (samples buffer, Opus encoder, audio level, sequence number) is only accessed on the encoding thread. No locks needed.

Opus Build Configuration

The module compiles Opus 1.6 from source with the following CMake flags:

-DCMAKE_BUILD_TYPE=Release
-DOPUS_DRED=OFF                    # DRED disabled (future feature)
-DOPUS_BUILD_SHARED_LIBRARY=OFF    # Static linking
-DOPUS_BUILD_TESTING=OFF           # No tests
-DOPUS_BUILD_PROGRAMS=OFF          # No CLI tools

iOS: Built as universal binary (arm64 + x86_64) for device and simulator

Android: Built for arm64-v8a, armeabi-v7a, and x86_64

Platform Notes

iOS

Minimum iOS Version: 15.1+

Audio Session: Configurable via iosAudioSession parameter. Default: .record + .measurement + no options (pure recording, system audio processing disabled). For voice chat or playback scenarios, pass a custom config:

Category	Description
`record`	Pure recording (default)
`playAndRecord`	Record + play simultaneously
`playback`	Playback only
`ambient`	Mix with other audio, no interruption

Mode	Description
`measurement`	Disable system audio processing (default)
`default`	Enable AGC, echo cancellation, etc.
`voiceChat`	Optimized for voice calls
`spokenAudio`	Optimized for spoken content

Option	Description
`mixWithOthers`	Allow mixing with other audio apps
`defaultToSpeaker`	Route audio to speaker (not earpiece)
`allowBluetooth`	Allow Bluetooth HFP devices
`allowAirPlay`	Allow AirPlay output
`allowBluetoothA2DP`	Allow Bluetooth A2DP (high quality audio)

Permissions: Add to app.json:

{
  "expo": {
    "ios": {
      "infoPlist": {
        "NSMicrophoneUsageDescription": "This app needs microphone access to record audio."
      }
    }
  }
}

Android

Minimum SDK: API 24 (Android 7.0)

Permissions: Automatically added to manifest, request at runtime:

import { PermissionsAndroid } from 'react-native';

const granted = await PermissionsAndroid.request(
  PermissionsAndroid.PERMISSIONS.RECORD_AUDIO
);

Troubleshooting

iOS: "Microphone permission not granted"

Add NSMicrophoneUsageDescription to your Info.plist or app.json.

Android: "Microphone permission not granted"

Request permission at runtime before calling startStreaming().

Build Errors on iOS

Clean and reinstall pods:

cd ios
rm -rf Pods Podfile.lock opus-build
pod install
cd ..

Build Errors on Android

Clean Gradle caches:

cd android
./gradlew clean
rm -rf .cxx build
cd ..

Contributing

Contributions are welcome! Please read our Contributing Guidelines before submitting PRs.

Development Setup

git clone https://github.com/imcooder/opuslib.git
cd opuslib
npm install
npm run build

cd example
npm install
npx expo run:ios    # or run:android

License

MIT License - see LICENSE file for details

Credits

Original Project - Scdales/opuslib
Opus Codec - opus-codec.org
Expo Modules - docs.expo.dev

Support

Issues: GitHub Issues

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
android		android
example		example
ios		ios
large_files_store		large_files_store
scripts		scripts
src		src
tests		tests
.eslintrc.js		.eslintrc.js
.gitignore		.gitignore
.npmignore		.npmignore
.nvmrc		.nvmrc
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
expo-module.config.json		expo-module.config.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

@imcooder/opuslib

Features

Why Opus 1.6?

Installation

Additional Setup

For Expo Projects

For React Native CLI

Quick Start

API Reference

Methods

startStreaming(config: AudioConfig): Promise<void>

stopStreaming(): Promise<void>

pauseStreaming(): void

resumeStreaming(): void

Events

audioStarted

audioChunk

audioEnd

amplitude

error

Architecture

Opus Build Configuration

Platform Notes

iOS

Android

Troubleshooting

iOS: "Microphone permission not granted"

Android: "Microphone permission not granted"

Build Errors on iOS

Build Errors on Android

Contributing

Development Setup

License

Credits

Support

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`startStreaming(config: AudioConfig): Promise<void>`

`stopStreaming(): Promise<void>`

`pauseStreaming(): void`

`resumeStreaming(): void`

`audioStarted`

`audioChunk`

`audioEnd`

`amplitude`

`error`

Packages