Keras DeepSpeech

Repository for experimenting with different CTC based model designs for ASR. Supports live recording and testing of speech and quickly creates customised datasets using own-voice dataset creation scripts!

OVERVIEW

Recommended > use virtualenv installed with python2.7 (3.x untested and will not work with Core ML)
git clone https://github.com/robmsmt/KerasDeepSpeech
pip install -r requirements.txt
Get the data using the import/download scripts in the folder, LibriSpeech is a good example.
Download the language model (large file) run ./lm/get_lm.sh

To Train, simply run python run-train.py In order to specify training/validation files use python run-train.py --train_files <csvfile> --valid_files <csvfile> (see run-train for complete arguments list)
To Test, run python run-test.py --test_files <datacsvfile>

Have a question? Like the tool? Don't like it? Open an issue and let's talk about it! Pull requests are appreciated!

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
checkpoints		checkpoints
data		data
lm		lm
mobile		mobile
preproc		preproc
tensorboard		tensorboard
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE.md		LICENSE.md
README.md		README.md
char_map.py		char_map.py
data.py		data.py
generator.py		generator.py
model.py		model.py
report.py		report.py
requirements.txt		requirements.txt
run-test.py		run-test.py
run-train.py		run-train.py
text.py		text.py
utils.py		utils.py