Skip to content

notes for speech recognition,speech synthesis, signal process...

Notifications You must be signed in to change notification settings

Ella77/awesome-speech

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 

Repository files navigation

awesome-speech

curated paper, notes, codes storage for speech recognition,speech synthesis, signal process...

Challenge for accuracy

SOTA https://paperswithcode.com/task/speech-recognition

Papers

Deep Speech: Scaling up end-to-end speech recognition https://arxiv.org/pdf/1412.5567v2.pdf

  • noise environment

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin https://arxiv.org/pdf/1512.02595v1.pdf

  • integrated in tensorflow model

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/pdf/1904.08779v3.pdf

English Conversational Telephone Speech Recognition by Humans and Machines https://arxiv.org/pdf/1703.02136v1.pdf

STATE-OF-THE-ART SPEECH RECOGNITION USING MULTI-STREAM SELF-ATTENTION WITH DILATED 1D CONVOLUTIONS https://arxiv.org/pdf/1910.00716v1.pdf

Purely sequence-trained neural networks for ASR based on lattice-free MMI https://www.danielpovey.com/files/2016_interspeech_mmi.pdf

Kaldi recipe

한국어 preprocess

http://speech.cbnu.ac.kr/srhome/technology/korean_recog.html

About

notes for speech recognition,speech synthesis, signal process...

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages