How to use it with non-parallel data? #17

BenRafatian · 2021-06-30T16:02:57Z

I was wondering that if I could use 2 sets of non-parallel voice datasets, where each speaker utters completely different sentences, and there is no transcript of the data.

also, does this program use end-end conversion, or does it train a speech-to-text model and then recreates the speech in the converted voice from the text?

hikaruhotta · 2021-07-02T19:18:04Z

MaskCycleGAN-VC is for non-parallel voice conversion so your dataset would work!

The model does end-to-end conversion meaning that it converts mel-spectrograms to mel-spectrograms

BenRafatian · 2021-07-03T12:43:55Z

Thank you @hikaruhotta for your help.

BenRafatian closed this as completed Jul 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use it with non-parallel data? #17

How to use it with non-parallel data? #17

BenRafatian commented Jun 30, 2021

hikaruhotta commented Jul 2, 2021

BenRafatian commented Jul 3, 2021

How to use it with non-parallel data? #17

How to use it with non-parallel data? #17

Comments

BenRafatian commented Jun 30, 2021

hikaruhotta commented Jul 2, 2021

BenRafatian commented Jul 3, 2021