Fix missing `.cpu()` call causing WeSpeaker embedding pipeline to crash #1518

juanmc2005 · 2023-10-28T11:32:25Z

Problem

When waveform and masks are both on GPU, the WeSpeaker pipeline crashes because it can't convert features to numpy without moving the tensor to CPU first.

Also mentioned here: juanmc2005/diart#188

Minimal Reproducible Example

import torch
from pyannote.audio.pipelines.speaker_verification import PretrainedSpeakerEmbedding

model = PretrainedSpeakerEmbedding("hbredin/wespeaker-voxceleb-resnet34-LM")
model.to(torch.device("cuda"))

waveform = torch.randn(4, 1, 80000).cuda()
weights = torch.rand(4, 300).cuda()

emb = model(waveform, weights)

print(emb.shape)

juanmc2005 · 2023-11-05T13:09:46Z

This is also solved by #1529
@hbredin closing this one to avoid duplicates

Fix missing .cpu() crashing WeSpeaker embedding pipeline

18c702a

juanmc2005 mentioned this pull request Oct 28, 2023

Add compatibility with pyannote 3.0 embedding wrappers juanmc2005/diart#188

Merged

Add .cpu() call when weights is None

5251f58

hbredin mentioned this pull request Nov 5, 2023

fix: compute fbank on selected device #1529

Merged

juanmc2005 closed this Nov 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix missing `.cpu()` call causing WeSpeaker embedding pipeline to crash #1518

Fix missing `.cpu()` call causing WeSpeaker embedding pipeline to crash #1518

juanmc2005 commented Oct 28, 2023

juanmc2005 commented Nov 5, 2023

Fix missing .cpu() call causing WeSpeaker embedding pipeline to crash #1518

Fix missing .cpu() call causing WeSpeaker embedding pipeline to crash #1518

Conversation

juanmc2005 commented Oct 28, 2023

Problem

Minimal Reproducible Example

juanmc2005 commented Nov 5, 2023

Fix missing `.cpu()` call causing WeSpeaker embedding pipeline to crash #1518

Fix missing `.cpu()` call causing WeSpeaker embedding pipeline to crash #1518