Join GitHub today
GitHub is home to over 36 million developers working together to host and review code, manage projects, and build software together.Sign up
CTC decode very slow when training Mandarin Model #1831
I got the alphabet which contains 4333 Chinese characters, vocab, 3gram language model with lm.binary and trie.
Training Acoustic model is is normal, but the ctc-decode step is very very slowly(three batch(batch size is 12) of test data take two days to decode)!!!
My training environment is:
This is not surprising as the number of elements in the alphabet increases dramatically from the case of English, where the problem is not noticeable.
What were are starting to experiment is an implementation of Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes which would have the CTC always output to 256 elements independent of language.