Project on Automatic Speech Recogition for course Deep Learning Speech Processing
- LibriSpeech100 (100 hours) - https://www.openslr.org/12
- Download LibriSpeech100 Dataset from the link mentioned above.
- Run CreatePickleFiles.ipynb - Creates Pickle Files of the Input Data
- Create train_all, dev_all and test_all folders in train, dev and test folders respectively which lie inside the LibriSpeech100 folder. We will gather all our input and output files in these directories.
- Run GatherPickleAndTextFiles.ipynb - Gathers all the input pickle and output text files and places them in their appropriate folders
- Run speech2text.py on terminal or Speech2Text.ipynb in a jupyter environment
- Hori, Takaaki, Jaejin Cho, and Shinji Watanabe. "End-to-end speech recognition with word-based RNN language models." 2018 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2018.
- https://web.stanford.edu/class/cs224s/lectures/224s.17.lec8.pdf
- https://arxiv.org/pdf/1803.10225.pdf
- Miniconda
- Tensorflow 2 Setup Tensorflow 2 on Conda