Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words

Wake-up word emotion recognition is a task to capture the speakers’ emotional state using short lexically-matched speech such as Ok Google or Hey Siri.

Dataset & Pretrained Model at Zenodo

Reference

Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words [ArXiv]

@inproceedings{kim2022hi,
  title={Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words},
  author={Taesu Kim, SeungHeon Doh, Gyunpyo Lee, Hyung seok Jun, Juhan Nam, Hyeon-Jeong Suk},
  booktitle={Proceedings of the 14th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)},
  year={2022}
}

Requirements

Install python and PyTorch:
- python==3.7
- pytorch-lightning==1.4.9 (important!)
- torch==1.7.1 (Please install it according to your CUDA version.)
Other requirements:
- pip install -r requirements.txt

conda create -n YOUR_ENV_NAME python=3.7
conda activate YOUR_ENV_NAME
pip install -r requirements.txt

Training

Download the data files from HERE
Preprocessing audio: resampling to 16000
```
 python preprocessing.py
```

Transfer Learning training options:

 python train.py --freeze_type none
 python train.py --freeze_type feature # best option
 python train.py --freeze_type context
 python train.py --freeze_type all

Reproduce results in paper

Fore more examples, check bash files under scripts folder.

you can check ML performance in notebook
Reproduce performance in notebook

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
exp/HIKIA		exp/HIKIA
loader		loader
models		models
notebook		notebook
scripts		scripts
.gitignore		.gitignore
README.md		README.md
eval.py		eval.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words

Reference

Requirements

Training

Reproduce results in paper

Inference using your own data (WIP)

About

Releases

Packages

Languages

seungheondoh/hi_kia

Folders and files

Latest commit

History

Repository files navigation

Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words

Reference

Requirements

Training

Reproduce results in paper

Inference using your own data (WIP)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages