Skip to content

Yip-Jia-Qi/ACA-Net

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention

Pytorch Implementation of ACA-Net for Speaker Verification. This repository contains only the model, which can easily be adapted for most speaker verification training frameworks. Make sure to have both TDNN.py and ACANet.py in the same folder, and run the unit test contained at the end of ACANet.py to ensure everything is working.

Abstract

In this paper, we propose ACA-Net, a lightweight, global context-aware speaker embedding extractor for Speaker Verification (SV) that improves upon existing work by using Asymmetric Cross Attention (ACA) to replace temporal pooling. ACA is able to distill large, variable-length sequences into small, fixed-sized latents by attending a small query to large key and value matrices. In ACA-Net, we build a Multi-Layer Aggregation (MLA) block using ACA to generate fixed-sized identity vectors from variable-length inputs. Through global attention, ACA-Net acts as an efficient global feature extractor that adapts to temporal variability unlike existing SV models that apply a fixed function for pooling over the temporal dimension which may obscure information about the signal's non-stationary temporal variability. Our experiments on the WSJ0-1talker show ACA-Net outperforms a strong baseline by 5% relative improvement in EER using only 1/5 of the parameters.

Citing ACA-Net

Please, cite ACA-Net if you use it for your research or business.

@inproceedings{yip23_interspeech,
  author={Jia Qi Yip and Duc-Tuan Truong and Dianwen Ng and Chong Zhang and Yukun Ma and Trung Hieu Nguyen and Chongjia Ni and Shengkui Zhao and Eng Siong Chng and Bin Ma},
  title={{ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention}},
  year=2023,
  booktitle={Proc. INTERSPEECH 2023},
  pages={1938--1942},
  doi={10.21437/Interspeech.2023-1725}
}

About

Pytorch Implementation of ACA-Net for Speaker Verification

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages