Listen, Attend and Spell - PyTorch Implementation

My first project of Speech recognition. This is a PyTorch implementation of Listen, Attend and Spell (LAS) and based on Alexander-H-Liu' repository .

Requirements

Python 3
PyTorch 1.0.0
python_speech_features
editdistance

Chinese Mandarin corpus

Pretrained models (not supported)

Setup

Download four datasets and preprocessing

├── audio_data
│   ├── data_thchs30
│   │   ├── data
│   │   ├── train
│   │   │   ├── ...
│   ├── data_aishell
│   │   ├── transcript
│   │   ├── wav
│   │   │   ├── ...
│   ├── primewords_md_2018_set1
│   │   ├── audio_files
│   │   ├── set1_transcript.json
│   ├── ST-CMDS-20170001_1-OS
│   │   │   ├── ...
│   ├── ...

we should invoke the util/dict_zh_words.py script first, generating Chinese Dict. we can now invoke the util/preprocess_all_datasets.py script, which will read all of this in and create four pickle files. Then, invoke the util/load_datasets.py script.

 $ python util/dict_zh_words.py
 $ python util/preprocess_all_datasets.py 
 $ python util/load_datasets.py

Start training

bash train.sh

Evaluate on test split

Acknowledgements

Thanks the original LAS, Alexander-H-Liu and awesome PyTorch team.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config		config
data		data
log		log
model		model
util		util
README.md		README.md
load_data.sh		load_data.sh
test_demo.ipynb		test_demo.ipynb
train.sh		train.sh
train_all.py		train_all.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

data

data

log

log

model

model

util

util

README.md

README.md

load_data.sh

load_data.sh

test_demo.ipynb

test_demo.ipynb

train.sh

train.sh

train_all.py

train_all.py

Repository files navigation

Listen, Attend and Spell - PyTorch Implementation

Requirements

Chinese Mandarin corpus

Pretrained models (not supported)

Setup

Download four datasets and preprocessing

Start training

Evaluate on test split

Acknowledgements

About

Releases

Packages

Languages

Xiaoxiaohuangg/LAS-Chinese-pytorch

Folders and files

Latest commit

History

Repository files navigation

Listen, Attend and Spell - PyTorch Implementation

Requirements

Chinese Mandarin corpus

Pretrained models (not supported)

Setup

Download four datasets and preprocessing

Start training

Evaluate on test split

Acknowledgements

About

Resources

Stars

Watchers

Forks

Languages