Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem running diarization and returning csv transcriptions #1

Open
RicardoGrayson opened this issue Dec 9, 2021 · 1 comment
Open

Comments

@RicardoGrayson
Copy link

Hi I'm trying to follow the video on youtube and I keep running into this issue when I start running my wav files (which i converted to mono). I'm running python 3.7 and dearpygui v0.6.415 on a windows OS and using google cloud services:

Uploading C:\Users\Robin\TTS-dataset-tools\sultansupreme-source\22050/sultan_18.wav to google cloud storage bucket C:\Users\Robin\PycharmProjects\pythonProject\venv\lib\site-packages\pydub\utils.py:198: RuntimeWarning: Couldn't find ffprobe or avprobe - defaulting to ffprobe, but may not work warn("Couldn't find ffprobe or avprobe - defaulting to ffprobe, but may not work", RuntimeWarning) Traceback (most recent call last): File "C:/Users/Robin/TTS-dataset-tools/tools.py", line 70, in run_google_speech_call builder.diarization(get_value("label_wav_file_transcribe"), get_value("input_storage_bucket"), get_value("input_project_name")) File "C:\Users\Robin\TTS-dataset-tools\dataset_builder.py", line 397, in diarization info = mediainfo(wavfile) File "C:\Users\Robin\PycharmProjects\pythonProject\venv\lib\site-packages\pydub\utils.py", line 334, in mediainfo res = Popen(command, stdout=PIPE) File "C:\Users\Robin\AppData\Local\Programs\Python\Python37\lib\subprocess.py", line 756, in __init__ restore_signals, start_new_session) File "C:\Users\Robin\AppData\Local\Programs\Python\Python37\lib\subprocess.py", line 1155, in _execute_child startupinfo) FileNotFoundError: [WinError 2] The system cannot find the file specified

Any help would be appreciated. Thanks!

@RicardoGrayson
Copy link
Author

So I got the diarization to work, but as soon as it needs to start the transcription process and after splitting all the audio files, it crashes saying:
Traceback (most recent call last): File "tools.py", line 79, in run_dataset_builder_call builder.build_dataset() File "C:\Users\Robin\TTS-dataset-tools\dataset_builder.py", line 203, in build_dataset text = text.replace("%", " percent") UnboundLocalError: local variable 'text' referenced before assignment

I don't know how to assign the 'text' local variable in dataset_builder.py without conflicting with google cloud speech-to-text.
All help appreciated!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant