Introduction

This code repository includes an implementation of voice conversion method to generate childlike speech based on WORLD vocoder. It maps adult speech characteristics into children ones, modifications are made based on a children acoustic study \cite{DBLP:journals/jasa/Lee1999}. The original motivation was data augmentation to improve ASR performance on children speakers.

Corresponding paper

Data augmentation for children ASR and child-adult speaker classification using voice conversion methods
Zhao S., M. Singh, A. Woubie, R. Karhila
24th INTERSPEECH Conference, Dublin, Ireland, 4593-4597

Installation

pip install -r requirements.txt

Usage

Parameters can be randomly generalized according to children speech statistics:

python childrenize.py <input audo> <output audio>

Parameters can be also specified, like follows:

python childrenize.py -f 300 <input audo> <output audio>

Above command will specify a target F0 of 300 Hz, meanwhile keeping spectral warping factor and vowel stretching factor randomized.

Example audio files

utterance/male.wav: an original male speech
utterance/female.wav: an original female speech
utterance/male.childrenized.wav: Childrenized male speech with randomized parameters
utterance/female.childrenized.wav: Childrenized female speech with randomized parameters

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
utterance		utterance
README.md		README.md
childrenize.py		childrenize.py
requirements.txt		requirements.txt
sw.py		sw.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Corresponding paper

Installation

Usage

Example audio files

About

Releases

Packages

Languages

zhao-shuyang/childrenize

Folders and files

Latest commit

History

Repository files navigation

Introduction

Corresponding paper

Installation

Usage

Example audio files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages