FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems

This repository contains the implementation of our paper : FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems

This work has been accepted at the Findings of NAACL'22

Link to the paper: https://aclanthology.org/2022.findings-naacl.93/

If you use this repository, please cite the following paper:

Divya Sharma and Arun Balaji Buduru. 2022. FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems. In Findings of the Association for Computational Linguistics: NAACL 2022, pages 1247–1258, Seattle, United States. Association for Computational Linguistics.

@inproceedings{sharma-buduru-2022-fatnet,

title = "{FA}t{N}et: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems",

author = "Sharma, Divya  and
  Buduru, Arun Balaji",
  
booktitle = "Findings of the Association for Computational Linguistics: NAACL 2022",

month = jul,

year = "2022",

address = "Seattle, United States",

publisher = "Association for Computational Linguistics",

url = "https://aclanthology.org/2022.findings-naacl.93",

pages = "1247--1258",

abstract = "Linguistic bias in Deep Neural Network (DNN) based Natural Language Processing (NLP) systems is a critical problem that needs attention. The problem further intensifies in the case of security systems, such as speaker verification, where fairness is essential. Speaker verification systems are intelligent systems that determine if two speech recordings belong to the same speaker. Such human-oriented security systems should be usable by diverse people speaking varied languages. Thus, a speaker verification system trained on speech in one language should generalize when tested for other languages. However, DNN-based models are often language-dependent. Previous works explore domain adaptation to fine-tune the pre-trained model for out-of-domain languages. Fine-tuning the model individually for each existing language is expensive. Hence, it limits the usability of the system. This paper proposes the cost-effective idea of integrating a lightweight embedding with existing speaker verification systems to mitigate linguistic bias without adaptation. This work is motivated by the theoretical hypothesis that attentive-frames could help generate language-agnostic embeddings. For scientific validation of this hypothesis, we propose two frame-attentive networks and investigate the effect of their integration with baselines for twelve languages. Empirical results suggest that frame-attentive embedding can cost-effectively reduce linguistic bias and enhance the usability of baselines.",

}

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
README.md		README.md
TDNN.py		TDNN.py
convert2wav.py		convert2wav.py
dataloader.py		dataloader.py
feature_extractor.py		feature_extractor.py
models.py		models.py
preprocess.py		preprocess.py
s0_fatnet_v1_test.py		s0_fatnet_v1_test.py
s0_fatnet_v2_test.py		s0_fatnet_v2_test.py
s1_fatnet_v1_test.py		s1_fatnet_v1_test.py
s1_fatnet_v2_test.py		s1_fatnet_v2_test.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

TDNN.py

TDNN.py

convert2wav.py

convert2wav.py

dataloader.py

dataloader.py

feature_extractor.py

feature_extractor.py

models.py

models.py

preprocess.py

preprocess.py

s0_fatnet_v1_test.py

s0_fatnet_v1_test.py

s0_fatnet_v2_test.py

s0_fatnet_v2_test.py

s1_fatnet_v1_test.py

s1_fatnet_v1_test.py

s1_fatnet_v2_test.py

s1_fatnet_v2_test.py

train.py

train.py

utils.py

utils.py

Repository files navigation

FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems

About

Releases

Packages

Languages

vdivyas/FAtNet

Folders and files

Latest commit

History

Repository files navigation

FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems

About

Resources

Stars

Watchers

Forks

Languages