Problem with proper data loading #5

pstryczke · 2022-11-21T18:29:10Z

Hi, I'd like to run your model by myself, however I cannot find proper way to load the dataset with .mp3 files you provided. Is there a chance to share the dataloader you've used or give some hints how to process the .mp3 files to valid dataset which could be used in your usage examples? I'll be very grateful!

galfaroth · 2022-11-21T18:31:20Z

+1 I faced exact same rabbit hole. Anyone successfully ran the project or it's only paper with some unstructured chunks that can't be ran?

MrZixi · 2022-11-28T11:57:06Z

Uh, the .mp3 files need to be processed and binarized. We are still clearing the codes up. But maybe refer to files as well as the data preparation process in that repo.

pstryczke · 2022-12-02T09:01:03Z

@MrZixi
There is a csv file in base_binarizer.py from which some parameters are gained and used in further files processing, however I cannot find any reference where and how this csv file is created. Any advices? Thanks :)

galfaroth · 2022-12-02T09:13:09Z

Good question, I was reading through the code and was lost there too. @MrZixi mind you know what these .csv files are?

MrZixi · 2022-12-02T09:39:31Z

The csv file is generated by preprocessing codes. See here, an example of preprocessed Ljspeech dataset. They are basically doing things like recording wav file paths, normalizing texts, extracting phoneme and aligning phoneme sequences with wav using MFA tools.

pstryczke · 2022-12-02T16:52:14Z

As I can see, the given csv example has different column names than the keys used in NeuralSVB dataloaders, which prevents us from properly preprocessing the data. Are you going to upload some working examples for NeuralSVB repository? Thank you

galfaroth · 2022-12-05T10:02:55Z

the job demands columns like 'f0', 'uv', 'me12ph', 'me', 'prof_f0' and the code @MrZixi you provided generates 'spk', 'txt', 'txt_raw', 'ph', 'wav_fn'. Can you please explain how to properly load the demo dataset and start training the model?

galfaroth · 2022-12-12T09:36:29Z

@MrZixi ?

MrZixi · 2022-12-13T03:29:06Z

The code I referred to is an example from one of our other repos. The point is that the columns which NeuralSVB needs are processed by those codes. For example, the 'prof_f0' and 'f0' represent the amateur f0 and professional f0 information from the paired pieces. The 'mel2ph' represents the alignment between phonemes and the mel-spectrogram provided by the MFA tool. We are still clearing the codes up and it may take some time as we are now been occupied by some other things.

pstryczke · 2023-01-09T16:55:38Z

@MrZixi
But it still doesn't help us, as we have no idea how such 'f0', 'prof_f0' or 'mel2ph' binaries should be represented or preprocessed. You can try to run training from files in this repo to find out what is missing (of course except audio samples, which you provide separately)

galfaroth · 2023-02-09T09:22:32Z

I think you should probably close this repo as its impossible to run anything and there is none who can help.

MrZixi · 2023-02-10T02:50:17Z

I think you should probably close this repo as its impossible to run anything and there is none who can help.

Try to learn some manners, will you? Be patient and ask about our time schedule of releasing codes and data to arrange yours or just un-star. (Maybe nonsenses as it looks like you never star this repo)

MoonInTheRiver closed this as not planned Won't fix, can't repro, duplicate, stale Feb 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem with proper data loading #5

Problem with proper data loading #5

pstryczke commented Nov 21, 2022

galfaroth commented Nov 21, 2022

MrZixi commented Nov 28, 2022

pstryczke commented Dec 2, 2022

galfaroth commented Dec 2, 2022

MrZixi commented Dec 2, 2022

pstryczke commented Dec 2, 2022

galfaroth commented Dec 5, 2022

galfaroth commented Dec 12, 2022

MrZixi commented Dec 13, 2022

pstryczke commented Jan 9, 2023

galfaroth commented Feb 9, 2023

MrZixi commented Feb 10, 2023

Problem with proper data loading #5

Problem with proper data loading #5

Comments

pstryczke commented Nov 21, 2022

galfaroth commented Nov 21, 2022

MrZixi commented Nov 28, 2022

pstryczke commented Dec 2, 2022

galfaroth commented Dec 2, 2022

MrZixi commented Dec 2, 2022

pstryczke commented Dec 2, 2022

galfaroth commented Dec 5, 2022

galfaroth commented Dec 12, 2022

MrZixi commented Dec 13, 2022

pstryczke commented Jan 9, 2023

galfaroth commented Feb 9, 2023

MrZixi commented Feb 10, 2023