How to train SEA model #4

cyxomo · 2021-08-10T03:03:27Z

The pretrained model sea.ckpt just fit dataset which have 82 speaker, However, I have a huge dataset including 300 speaker at least. How could I train a corresponding SAE model？

auspicious3000 · 2021-08-10T03:12:54Z

Do you mean SEA?

You refer to the SEA paper for training details.

cyxomo · 2021-08-10T03:15:27Z

I seem to have fallen into a mistake. Actually , in preparing data , the Encoder part of SEA model just be used. But I'm not sure that changing the speaker will make a difference.

cyxomo · 2021-08-10T03:18:00Z

Does it matter if I take my own data and extract the features from the SEA model of 82 speakers that you pre-trained

cyxomo · 2021-08-10T03:18:50Z

Do you mean SEA?

You refer to the SEA paper for training details.

Yeah, sorry for spelling mistake

auspicious3000 · 2021-08-10T03:23:00Z

The performance might degrade, but feel free to try.

cyxomo · 2021-08-10T03:32:53Z

The performance might degrade, but feel free to try.

So the right thing to do is to train an SEA model with my own data and then extract the features. Could the sea part training code be provided?

auspicious3000 · 2021-08-10T03:36:14Z

The majority of the code for SEA is here. You just need a data loader and an optimizer.

cyxomo · 2021-08-10T03:44:45Z

The majority of the code for SEA is here. You just need a data loader and an optimizer.

OK, do you use the loss function like

auspicious3000 · 2021-08-10T04:01:41Z

Yes

vasyarv · 2021-09-05T14:42:05Z

@auspicious3000 what is c_trg in model_sea.Generator.forward ? It is part of Decoder's LSTM, dimension is same as hparams.dim_spk which is 82, but still no idea how to get it ...

auspicious3000 · 2021-09-05T15:53:19Z

It is the one-hot speaker embedding.

stalevna · 2021-11-08T06:03:25Z

Do you mean SEA?

You refer to the SEA paper for training details.

Hi! Could you point me to the SEA paper? I want to make sure I am reading the right one

auspicious3000 · 2021-11-08T06:40:21Z

Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery

wang1612 · 2021-11-16T06:47:08Z

@auspicious3000
Could you check my codes of SEA training loss below:

mask_sp_real = ~sequence_mask(len_real, cep_real0.size(1))# cep_real0 is MFCC that do not cut by [:, 0:20]
mask = (~mask_sp_real).float()
self.P = self.P.train()
mel_outputs , mel_outputs_B= self.P(cep_real, spk_emb, mask)#mel_outputs_B is output of decoder with input of self Expressing autoencoded Z
loss_A = F.mse_loss(mel_outputs, cep_real0,reduction='mean')
loss_B = F.mse_loss(mel_outputs_B, cep_real0,reduction='mean')
p_loss = loss_A + loss_B

cyxomo changed the title ~~How to train SAE model~~ How to train SEA model Aug 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train SEA model #4

How to train SEA model #4

cyxomo commented Aug 10, 2021

auspicious3000 commented Aug 10, 2021

cyxomo commented Aug 10, 2021 •

edited

cyxomo commented Aug 10, 2021 •

edited

cyxomo commented Aug 10, 2021

auspicious3000 commented Aug 10, 2021

cyxomo commented Aug 10, 2021

auspicious3000 commented Aug 10, 2021

cyxomo commented Aug 10, 2021

auspicious3000 commented Aug 10, 2021

vasyarv commented Sep 5, 2021

auspicious3000 commented Sep 5, 2021

stalevna commented Nov 8, 2021

auspicious3000 commented Nov 8, 2021

wang1612 commented Nov 16, 2021 •

edited

How to train SEA model #4

How to train SEA model #4

Comments

cyxomo commented Aug 10, 2021

auspicious3000 commented Aug 10, 2021

cyxomo commented Aug 10, 2021 • edited

cyxomo commented Aug 10, 2021 • edited

cyxomo commented Aug 10, 2021

auspicious3000 commented Aug 10, 2021

cyxomo commented Aug 10, 2021

auspicious3000 commented Aug 10, 2021

cyxomo commented Aug 10, 2021

auspicious3000 commented Aug 10, 2021

vasyarv commented Sep 5, 2021

auspicious3000 commented Sep 5, 2021

stalevna commented Nov 8, 2021

auspicious3000 commented Nov 8, 2021

wang1612 commented Nov 16, 2021 • edited

cyxomo commented Aug 10, 2021 •

edited

cyxomo commented Aug 10, 2021 •

edited

wang1612 commented Nov 16, 2021 •

edited