Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gradio Live , Create Dataset gives an error : ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation. #3790

Open
Rakshasv18 opened this issue Jun 13, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@Rakshasv18
Copy link

Describe the bug

Dataset building + XTTS finetuning and inference in google colab

it requires : pip install transformers -U along with other packages to run smoothly.

When i try to upload my data one was 7.9mb and other was 157 mb data of mp3 and wav resp.
The first step is to create dataset , when i try to run i get the below error :

Traceback (most recent call last):
File "/content/TTS/TTS/demos/xtts_ft_demo/xtts_demo.py", line 215, in preprocess_dataset
train_meta, eval_meta, audio_total_size = format_audio_list(audio_path, target_language=language, out_path=out_path, gradio_progress=progress)
File "/content/TTS/TTS/demos/xtts_ft_demo/utils/formatter.py", line 56, in format_audio_list
asr_model = WhisperModel("large-v2", device=device, compute_type="float16")
File "/usr/local/lib/python3.10/dist-packages/faster_whisper/transcribe.py", line 128, in init
self.model = ctranslate2.models.Whisper(
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.
Loading Whisper Model!
Traceback (most recent call last):
File "/content/TTS/TTS/demos/xtts_ft_demo/xtts_demo.py", line 215, in preprocess_dataset
train_meta, eval_meta, audio_total_size = format_audio_list(audio_path, target_language=language, out_path=out_path, gradio_progress=progress)
File "/content/TTS/TTS/demos/xtts_ft_demo/utils/formatter.py", line 56, in format_audio_list
asr_model = WhisperModel("large-v2", device=device, compute_type="float16")
File "/usr/local/lib/python3.10/dist-packages/faster_whisper/transcribe.py", line 128, in init
self.model = ctranslate2.models.Whisper(
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.

To Reproduce

Try the google colab notebook

Dataset building + XTTS finetuning and inference
Running the demo
To start the demo run the first two cells (ignore pip install errors in the first one)

Then click on the link Running on public URL: when the demo is ready.

Downloading the results
You can run cell [3] to zip and download default dataset path

You can run cell [4] to zip and download the latest model you trained

Expected behavior

Dataset along with transcriptions to fine tune the model

Logs

No response

Environment

Google Colab

Additional context

No response

@Rakshasv18 Rakshasv18 added the bug Something isn't working label Jun 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant