In Tensorboard, validation no see audio of source and target speakers #23

Penghmed · 2023-01-03T08:58:09Z

How can change to see voice conversion?
Only see "gt" and "generated" but where is the source speaker and who is traget?

Thanks you

OlaWod · 2023-01-03T16:34:21Z

I did not log voice conversion in validation.
If you want that, just modify the evaluate function (from line 221) in train.py.

OlaWod · 2023-01-03T16:42:56Z

a simple example (not tested yet):

def evaluate(hps, generator, eval_loader, writer_eval):
    generator.eval()
    with torch.no_grad():
      for batch_idx, items in enumerate(eval_loader):
        if hps.model.use_spk:
          c, spec, y, spk = items
          g = spk[:1].cuda(0)
        else:
          c, spec, y = items
          g = None
        y_source = y[:-1] # modified
        spec, y = spec[:1].cuda(0), y[:1].cuda(0)
        c = c[:-1].cuda(0) # modified
        break
      mel = spec_to_mel_torch(
        spec, 
        hps.data.filter_length, 
        hps.data.n_mel_channels, 
        hps.data.sampling_rate,
        hps.data.mel_fmin, 
        hps.data.mel_fmax)
      y_hat = generator.module.infer(c, g=g, mel=mel)
      
      y_hat_mel = mel_spectrogram_torch(
        y_hat.squeeze(1).float(),
        hps.data.filter_length,
        hps.data.n_mel_channels,
        hps.data.sampling_rate,
        hps.data.hop_length,
        hps.data.win_length,
        hps.data.mel_fmin,
        hps.data.mel_fmax
      )
    image_dict = {
      "gen/mel": utils.plot_spectrogram_to_numpy(y_hat_mel[0].cpu().numpy()),
      "target/mel": utils.plot_spectrogram_to_numpy(mel[0].cpu().numpy()) # modified
    }
    audio_dict = {
      "gen/audio": y_hat[0],
      "source/audio": y_source[0], # modified
      "target/audio": y[0] # modified
    }
    utils.summarize(
      writer=writer_eval,
      global_step=global_step, 
      images=image_dict,
      audios=audio_dict,
      audio_sampling_rate=hps.data.sampling_rate
    )
    generator. Train()

Penghmed · 2023-01-04T06:05:22Z

Thanks you, will try.

Penghmed closed this as completed Jan 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In Tensorboard, validation no see audio of source and target speakers #23

In Tensorboard, validation no see audio of source and target speakers #23

Penghmed commented Jan 3, 2023

OlaWod commented Jan 3, 2023

OlaWod commented Jan 3, 2023

Penghmed commented Jan 4, 2023

In Tensorboard, validation no see audio of source and target speakers #23

In Tensorboard, validation no see audio of source and target speakers #23

Comments

Penghmed commented Jan 3, 2023

OlaWod commented Jan 3, 2023

OlaWod commented Jan 3, 2023

Penghmed commented Jan 4, 2023