Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feb 10th Meeting #2

Open
6 of 8 tasks
realzza opened this issue Feb 11, 2023 · 0 comments
Open
6 of 8 tasks

Feb 10th Meeting #2

realzza opened this issue Feb 11, 2023 · 0 comments
Assignees
Labels
Milestone

Comments

@realzza
Copy link
Owner

realzza commented Feb 11, 2023

Todos

  • Check: Does VAD change speech data in data prep (P1)

    No. The VAD step computes the VAD information only, and store it in the dumpdir, in the file vad.scp. The VAD step is used to mark to non-speech segments, and then exclude those segment information from training. However, it is true that these missing blanks could affect our reconstruction loss. But it can improve the quality of synthesized audios. It is a tradeoff we need to be aware.

  • Keep VITS with xvector and VAD training

  • Used trained decoder (with aligned sample rate) to re-decode, see if speaker information is perceptible (p2) #3

    • No, the decoded wav sample rate is still 22050. Trying the following steps.
      • check the training process
      • check tts_inference.py file on sample rate usage.
    • Inference jobs are not eligible to submit since Feb 13th. Couldn't decode to see if meet correct requirement.
    • Applied retrained model. Speaker information is integrated! /ocean/projects/cis210027p/zzhou5/espnet/egs2/librispeech_100/tts_vits/exp/16k_xvector/tts_beta_lib100_vits_tts_all16k_char_xvector/decode_with_trained_16k_vocoder
  • If 3 does not work, consult Jiatong (p2)

  • Run inference w/o trained vocoder

  • Integrate VITS model in cyclic systems (p3)

@realzza realzza added the todo label Feb 11, 2023
@realzza realzza added this to the VITS-tts milestone Feb 11, 2023
@realzza realzza self-assigned this Feb 11, 2023
realzza pushed a commit that referenced this issue Oct 23, 2023
realzza pushed a commit that referenced this issue Nov 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant