Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
scipywavread error #194
When I run the following command on a Czech text and audio:
I get the following error:
I would guess from the error message, an that installation of a decoding library may be broken; however, if I use
Version: aeneas (18.104.22.168) (from pip)
Hi, it looks like espeak(-ng) was not able to synthesize the text correctly with the Czech voice. You can add -v and/or -l to your command line to see some verbose log and/or save it to file. Have you tried using the $ python -m aeneas.tools.synthesize_text tool to check that the synthesis takes place correctly. HTH, AP
Thank you for your reply. This is log obtained by "-l" argument while using command that produces the error: output.log
When I use
If I read correctly the log, your audio.mp3 file is long ~1120s:
However, the WAV file generated synthesizing the input text appears to be ~90310s long:
This is already quite strange, because usually the duration of the synthetic wave is within a factor of 2 the duration of the real audio. Can you double check that
If not, then there is a problem either with eSpeak(ng) or with cew --- albeit that would be strange, because the aeneas.tools.synthesize_text uses the same code.
If so, then maybe what fails is the subsequent step, i.e. ffmpeg converting the synthesized PCM16LE mono 22050 Hz WAVE file into PCM16LE mono 16000 Hz WAVE file:
Again, it would be helpful checking whether the conv16.wav you can obtain from your output.wav:
has the right duration. If not, there is a problem with your version of ffmpeg.
Also, what's the output of your:
? For reference, mine is:
I have found the source of the problem. There is a bug in my export script causing that transcript in "text.txt" is repeated many times for this particular example. So at the first glance, beginning, middle, and end of text.txt looked ok. The same was repeated when I have checked the synthesized text.
I have fixed the text.txt and everything works as expected.
Thank you for your help.
Thanks for letting me know, that definitely sounds like a plausible explanation. I believe the proximal cause was that such a big audio file exhausted your RAM, leading to the scipywavread error. (I need to find time to test and enable the C extension cwave, adding support for resampling, so that we will not have to use scipywavread anymore, or just for fallback.) AP…
On 01/12/2018 01:33 PM, Stanislav Böhm wrote: I have found the source of the problem. There is a bug in my export script causing that transcript in "text.txt" is repeated /many/ times for this particular example. So at the first glance, beginning, middle, and end of text.txt looked ok. The same was repeated when I have checked the synthesized text. I have fixed the text.txt and everything works as expected. Thank you for your help.