Attention-based-End-to-End-Speech-to-Text-Deep-Neural-Network

Implements attention based speech to text transcription using Recurrent Neural Networks (RNNs) / Convolutional Neural Networks (CNNs) and Dense Networks. End-to-end the system transcribes a given speech utterance to its corresponding transcript. This project implements the paper Listen, Attend and Spell with LAS Variant 1. The final performance achieved a perplexity of less than 12 by incorporting teacher-forcing and gumbel noise.

CMU Academic Integrity Policy

If you are currently enrolled in this course, please refer to Carnegie Mellon University Policy on Academic Integrity here before referring to the any of the repository contents.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
dataloader.py		dataloader.py
main.py		main.py
models.py		models.py
train_test.py		train_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

dataloader.py

dataloader.py

main.py

main.py

models.py

models.py

train_test.py

train_test.py

Repository files navigation

Attention-based-End-to-End-Speech-to-Text-Deep-Neural-Network

CMU Academic Integrity Policy

About

Releases

Packages

Languages

pparmesh/Attention-based-End-to-End-Speech-to-Text-Deep-Neural-Network

Folders and files

Latest commit

History

Repository files navigation

Attention-based-End-to-End-Speech-to-Text-Deep-Neural-Network

CMU Academic Integrity Policy

About

Resources

Stars

Watchers

Forks

Languages