Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

request documentation #1

Open
chikiuso opened this issue Oct 15, 2020 · 4 comments
Open

request documentation #1

chikiuso opened this issue Oct 15, 2020 · 4 comments

Comments

@chikiuso
Copy link

hi, may I ask for documentation such as how to install and run it?? thanks.

@hrnoh
Copy link
Owner

hrnoh commented Oct 15, 2020

@chikiuso We will upload the documentation for running our code ASAP:)

Thanks.

@chikiuso
Copy link
Author

hi @hrnoh , thanks for your update of documentation, I tried with vctk but failed, could you give more hints on how could I work with it? if I would like to use my own dataset, what should the steps I should go through? thanks a lot.

@chikiuso
Copy link
Author

chikiuso commented Dec 3, 2020

Hi @hrnoh , may I ask what number format is in the nikl_spk.txt file? is it hexidecimal or some other?

@hrnoh24
Copy link

hrnoh24 commented Jan 22, 2021

Hi @chikiuso , first of all, sorry for the late reply. I recommend changing the downsampling factor (denoted freq in our code) or data preprocessing. With the hyperparameter below, it could be trained.

data preprocessing ref1, ref2

num_mels: 80
fmin: 90
fmax: 7600
fft_size: 1024
hop_size: 256
min_level_db: -100
ref_level_db: 16
signal_normalization = True
allow_clipping_in_normalization = True
symmetric_mels = True
max_abs_value = 4

model parameter

freq = 16, # I recommend changing this value to 4, 8, and 32

Also, the number format in the nikl_spk.txt file is a decimal.
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants