MultiSpeech

Train on your data

In order to train the model on your data, follow the steps below

prepare your data and make sure the data is formatted in an PSV format as below without the header

speaker_id,audio_path,text,duration
0|file/to/file.wav|the text in that file|3.2

The speaker id should be integer and starts from 0

make sure the audios are MONO if not make the proper conversion to meet this condition

python -m venv env

source env/bin/activate

pip install -r requirements.txt

train the model

python train.py --train_path train_data.txt --test_path test_data.txt --checkpoint_dir outdir --epoch 100 --batch_size 64

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
args.py		args.py
data.py		data.py
data_loaders.py		data_loaders.py
decorators.py		decorators.py
interfaces.py		interfaces.py
layers.py		layers.py
loss.py		loss.py
model.py		model.py
optim.py		optim.py
padder.py		padder.py
pipelines.py		pipelines.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg
tokenizer.py		tokenizer.py
train.py		train.py
utils.py		utils.py