NOTE!

This repository is currently being refactored, and therefore files may change.

Automatic Speech-to-Text (AST)

Sequence-to-sequence model to train speech-to-text systems.

Reference: Pre-training on high-resource speech recognition improves low-resource speech-to-text translation, Sameer Bansal, Herman Kamper, Karen Livescu, Adam Lopez, Sharon Goldwater

Fisher data

We preprocessed the English translations released by:

Improved Speech-to-Text Translation with the Fisher and Callhome Spanish–English Speech Translation Corpus, Matt Post, Gaurav Kumar, Adam Lopez, Damianos Karakos, Chris Callison-Burch and Sanjeev Khudanpur, IWSLT 2013

and make them available here.

Fisher Spanish speech data is available from LDC (LDC2010S01)

Installation

We use Chainer as our deep learning framework

Installation:

create a conda environment with Python 3:

conda create --name ast python=3

activate new environment:

source activate ast

install CuPy

pip install cupy-cuda91

install chainer

pip install chainer

check if Chainer detects GPU support. Launch python:

$ python

Python 3.7.1
[GCC 7.3.0] :: Anaconda, Inc. on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import chainer
>>> chainer.backends.cuda.available
True
>>> chainer.backends.cuda.cudnn_enabled
True
>>>

install NLTK. Used to extract stop word lists for target languages, and for computing evaluation metrics such as BLEU score.

conda install nltk

install tqdm for progress bar support

conda install tqdm

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data/fisher		data/fisher
experiments		experiments
linking_files		linking_files
preprocessing		preprocessing
README.md		README.md
README.pdf		README.pdf
__init__.py		__init__.py
beam.py		beam.py
config.py		config.py
copy_params.py		copy_params.py
dataloader.py		dataloader.py
enc_dec.py		enc_dec.py
eval.py		eval.py
nmt_run.py		nmt_run.py
nn.py		nn.py
run_exp.bat		run_exp.bat
seq2seq.py		seq2seq.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NOTE!

Automatic Speech-to-Text (AST)

Fisher data

Installation

About

Releases

Packages

Contributors 2

Languages

0xSameer/ast

Folders and files

Latest commit

History

Repository files navigation

NOTE!

Automatic Speech-to-Text (AST)

Fisher data

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages