Strange humming sound during `SP` & `AP` #179

loct824 · 2024-03-17T03:49:55Z

Hi,

We trained an english model for DiffSinger, but we find that for the synthesized songs, in the middle part of the song where SP & AP occurs, the model gives strange voicing that sounds like the singer is humming a constant strange sound.

We give an example below which we use arrows to indicate where that strange humming sound happens.

Could you give us some advice on how the model can be improved/trained to eliminate this strange humming sound during breaks/silence?

'phonemes': [{'name': 'SP', 'duration': 1.3062181},
  {'name': 'AP', 'duration': 0.255292},
  {'name': 'sh', 'duration': 0.1184899},
  {'name': 'uh', 'duration': 0.1555967},
  {'name': 'dx', 'duration': 0.0234931},
  {'name': 'ax', 'duration': 0.075178},
  {'name': 'b', 'duration': 0.0830091},
  {'name': 'ih', 'duration': 0.1427231},
  {'name': 'n', 'duration': 0.06},
  {'name': 's', 'duration': 0.12},
  {'name': 't', 'duration': 0.05},
  {'name': 'r', 'duration': 0.05},
  {'name': 'ao', 'duration': 0.26},
  {'name': 'ng', 'duration': 0.17},
  {'name': 'y', 'duration': 0.05},
  {'name': 'ae', 'duration': 0.23},
  {'name': 'q', 'duration': 0.1454828},
  {'name': 'ay', 'duration': 0.1745172},
  {'name': 'l', 'duration': 0.2},
  {'name': 'ay', 'duration': 0.5},
  {'name': 'd', 'duration': 0.09},
  {'name': 'AP', 'duration': 0.22},
  {'name': 'n', 'duration': 0.0799999},
  {'name': 'ow', 'duration': 0.1300001},
  {'name': 'b', 'duration': 0.04},
  {'name': 'ah', 'duration': 0.1566115},
  {'name': 'dx', 'duration': 0.0233885},
  {'name': 'iy', 'duration': 0.22},
  {'name': 'g', 'duration': 0.16},
  {'name': 'eh', 'duration': 0.2},
  {'name': 't', 'duration': 0.0699999},
  {'name': 's', 'duration': 0.2000001},
  {'name': 'm', 'duration': 0.08},
  {'name': 'iy', 'duration': 0.5209856},
  {'name': 'l', 'duration': 0.0690144},
  {'name': 'ay', 'duration': 0.57},
  {'name': 'k', 'duration': 0.1694275},
  {'name': 'AP', 'duration': 0.2605725},
  {'name': 'y', 'duration': 0.13},
  {'name': 'uw', 'duration': 0.3566036},
  {'name': 'uw', 'duration': 0.7538354},
  {'name': 'SP', 'duration': 1.3531745}, <---------------------
  {'name': 'AP', 'duration': 0.2956739}, <---------------------
  {'name': 'k', 'duration': 0.0907126},
  {'name': 'uh', 'duration': 0.1397},
  {'name': 'dx', 'duration': 0.0203},
  {'name': 'ax', 'duration': 0.09},
  {'name': 'ng', 'duration': 0.06},
  {'name': 'k', 'duration': 0.06},

The text was updated successfully, but these errors were encountered:

yqzhishen · 2024-03-19T05:56:44Z

This seems like a possible labeling issue. If you didn't label the AP and SP areas accurately, the model may pronounce something on these two phonemes.

loct824 · 2024-03-22T21:51:19Z

Do you mean that it relates to the quality of the transcriptions.csv? whether each labelled phoneme correctly correspond to the part in the audio? Any guidance how we could improve other than manually refine the phoneme time positions labelling? thanks.

yqzhishen · 2024-03-23T14:17:25Z

If you enabled some variance parameters then controlling them can be a workaround. But on the training side I cannot provide more advice without further information.

hrukalive closed this as completed Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strange humming sound during `SP` & `AP` #179

Strange humming sound during `SP` & `AP` #179

loct824 commented Mar 17, 2024

yqzhishen commented Mar 19, 2024

loct824 commented Mar 22, 2024

yqzhishen commented Mar 23, 2024

Strange humming sound during SP & AP #179

Strange humming sound during SP & AP #179

Comments

loct824 commented Mar 17, 2024

yqzhishen commented Mar 19, 2024

loct824 commented Mar 22, 2024

yqzhishen commented Mar 23, 2024

Strange humming sound during `SP` & `AP` #179

Strange humming sound during `SP` & `AP` #179