DeepSpeech2 Online (Real-time) Decoder

This project works as an extension to this DeepSpeech2 implementation.

Getting Started

Prerequisites

Since this project is an extension for DeepSpeech2, you need to follow the installation instructions mentioned there.

Installing

Simply, copy the content of this folder and paste it inside deepspeech.pytorch folder.

How To Run it?

I'll illustrate it on this pretrained acoustic model and this ARPA language model. You can find other models here.

You need to edit some files before you run the server application.

Open the run_decoder_server.sh file and change the following variables:
--lm-path: the path of the language model.
--model-path: the path of the acoustic model.
--port: the port that the server will be listening on.

python decoder_server.py --host 0.0.0.0 \
                         --port 8888 \
                         --lm-path /volume/3-gram.pruned.3e-7.arpa \
                         --decoder beam --alpha 1.97 --beta 4.36 \
                         --model-path /volume/librispeech_pretrained_v2.pth \
                         --beam-width 1024 \ 
                         --cuda

Open js/app.js and find the following variables and change them:
X_seconds: record and send data each X_seconds seconds.

var X_seconds = 3;

ws_ip: the IP address of the computer that runs therun_decoder_server.sh script.
ws_port: the port that you use in therun_decoder_server.sh script.

var ws_ip = '0.0.0.0'
var ws_port = '8888'

Copy data/extended_data_loader.py from this project to the data folder in the deepspeech.pytorch folder.

Finally, run the following in different terminals:

> python website_server.py

> ./run_decoder_server.sh

Authors

Faris Alasmary - farisalasmary

License

This project is licensed under the MIT License - see the LICENSE file for details

Acknowledgments

Many thanks for those who made it possible for this project to be realized! This project uses the functionalities of different open-source projects that are mentioned below.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
css		css
data		data
js		js
websocket_server		websocket_server
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
decoder_server.py		decoder_server.py
index.html		index.html
lzstring.py		lzstring.py
run_decoder_server.sh		run_decoder_server.sh
website_server.py		website_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

css

css

data

data

js

js

websocket_server

websocket_server

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

decoder_server.py

decoder_server.py

index.html

index.html

lzstring.py

lzstring.py

run_decoder_server.sh

run_decoder_server.sh

website_server.py

website_server.py

Repository files navigation

DeepSpeech2 Online (Real-time) Decoder

Getting Started

Prerequisites

Installing

How To Run it?

Authors

License

Acknowledgments

About

Releases

Packages

Languages

License

farisalasmary/deepspeech2-online-decoder

Folders and files

Latest commit

History

Repository files navigation

DeepSpeech2 Online (Real-time) Decoder

Getting Started

Prerequisites

Installing

How To Run it?

Authors

License

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Languages