Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUFFT_INTERNAL_ERROR for certain hardware #40

Closed
SamuelBroughton opened this issue May 31, 2024 · 1 comment
Closed

CUFFT_INTERNAL_ERROR for certain hardware #40

SamuelBroughton opened this issue May 31, 2024 · 1 comment

Comments

@SamuelBroughton
Copy link

SamuelBroughton commented May 31, 2024

Installing the repo as described will install torch with cuda 11.7

If you are using a device such as RTX 4090, you will get an error when trying to do diarization with nemo.

For example, if you just run python run_inference.py you will get an error similar to this when torch.stft is called:

NOTSOFAR1-Challenge/diarization/word_based_diarization.py", line 105, in extract_speaker_embedding_for_words
    _, tmp_embedding = spk_model.forward(input_signal=word_wavs, input_signal_length=word_lens)

...

RuntimeError: cuFFT error: CUFFT_INTERNAL_ERROR

As pytorch/pytorch#88038 suggests, cuda 11.8 fixes the issue.

Ugrading pytorch to the latest stable version with cuda 11.8 fixes the issue, pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

@nidleo
Copy link
Collaborator

nidleo commented Jun 2, 2024

Thank you @SamuelBroughton for reporting this issue and providing a fix.
We will consider updating the environment to use cu118. However, since this requires some testing for both training and inference, it may take some time before we can implement the change.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants