All codes assume running from root directory. Please update the sys path at the beginning of the codes before running.
[Evolutionary Neural Architecture Search For Vietnamese Speaker Recognition]
Nguyễn Trường Lâu - Student at University of Information Technology (UIT)
VoiceNet - convolutional neural network architecture found by nondominated sorting genetic algorithm ii
with code: 0-10 - 1-01-001 - 0-00. Two goals are optimized: accuracy and cost calculation
python == 3.6.9 #mandatory
glob2==0.7
h5py==2.10.0
imutils==0.5.3
Keras==2.4.3 #mandatory
librosa==0.6.3
matplotlib==3.2.2
numpy==1.19.5
opencv-contrib-python==4.1.2.30
opencv-python==4.1.2.30
pandas==1.1.5
Pillow==7.0.0
prefetch-generator==1.0.1
pydub==0.24.1
scikit-image==0.16.2
scipy==1.4.1
sklearn==0.0
spacy==2.2.4
tensorboard==2.4.0
tensorflow==2.4.0 #mandatory
pip install -r requirements.txt
model/voicenet.hdf5
To setup:
>>> pip install voicenet
To use:
>>> from voicenet import VoiceNet
>>> model = VoiceNet(input_shape=(40,80,1), classes=40)
>>> model.summary()
data--
--Speaker One
voice_1.wav
voice_2.wav
--Speaker Two
voice_1.wav
voice_2.wav
python normalization_data.py --cfg=cfg/config.cfg
python train.py -batch_size 32 -num_class 40 -epochs 100
If you find the code useful for your research, please consider citing our works
@article{voicenet,
title={Evolutionary Neural Architecture Search For Vietnamese Speaker Recognition},
author={Nguyễn Trường Lâu - Student at University of Information Technology (UIT)},
booktitle={NAS},
year={2020}
}