@rxtk/toDeepSpeech

👂 An RxJS operator for real-time speech-to-text (STT/S2T) streaming using the opensource DeepSpeech library.

npm i @rxtk/stt-deepspeech

yarn add @rxtk/stt-deepspeech

⚠️ To run the DeepSpeech pipeline, you must download the corresponding DeepSpeech model, unzip it and pass the model directory to the toDeepSpeech operator like this: toDeepSpeech({modelDir: 'path/to/deepseech-models-0.7.0'}).

⚠️ node.js only. This has not been tested on Browsers but it might be possible to make it work. If you get it working, please make a PR!

API

`toDeepSpeech`

Stream audio speech data to DeepSpeech and get transcripts back:

import {map} from 'rxjs/operators';
import {toDeepSpeech} from '@rxtk/stt-deepspeech';

// The pipeline takes a stream of audio chunks encoded as LINEAR16 (PCM encoded as 16-bit integers) (Buffer, String, Blob or Typed Array)
const buffer$ = pcmChunkEncodedAs16BitIntegers$.pipe(
  map(chunk => Buffer.from(chunk, 'base64')),
  toDeepSpeech({modelDir: '/path/to/deepspeech-models-0.7.0'})
);
buffer$.subscribe(console.log); // log transcript output

⚠️ Pay attention to the endcoding of the audio data. The operator only accepts PCM data encoded as 16-bit integers. For example, LINEAR16 encoding usually works.

Guides

Introduction to audio data

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
scripts		scripts
src		src
.gitignore		.gitignore
.nvmrc		.nvmrc
.releaserc		.releaserc
LICENSE.md		LICENSE.md
README.md		README.md
babel.config.js		babel.config.js
package.json		package.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

@rxtk/toDeepSpeech

API

`toDeepSpeech`

Guides

About

Releases 1

Packages

Languages

License

rxtoolkit/stt-deepspeech

Folders and files

Latest commit

History

Repository files navigation

@rxtk/toDeepSpeech

API

toDeepSpeech

Guides

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

`toDeepSpeech`

Packages