Implementation-of-JoSEC-COLING-22

Implementation of our COLING22 paper Debiasing Word Embeddings with Nonlinear Geometry [1]

Our code is adapted from Debiasing Multiclass Word Embeddings (NAACL 2019)

The repository has two main components.

Identifying bias subspaces, performing hard-debiasing and MAC score calculations (./Debiasing/debias.py)
Downstream evaluations (./Downstream/BiasEvalPipelineRunner.ipynb)

Data

We used pretrained Baseline Word2Vec embeddings which is available here w2v_0
We produced Word2Vecs which have been debiased using hard debiasing for gender, race, religion, and intersection - All based on w2v_0.

Debiasing

Running debias.py requires the following command line arguments

-embeddingPath : The path to the word2vec embeddings (Defaults to w2v_0)
vocabPath : The path to the social stereotype vocabulary
subs : The models which produce bias subspaces ('josec' is our proposed model)
eval : The evaluation set to calculate MAC scores
-hard : If this flag is set hard debiasing will be performed
-soft : If this flag is used then soft debiasing will be performed
-w : If this flag is used then all the output of the analogy tasks, the debiased Word2Vecs and the MAC statistics will be written to disk in an folder named "./output"
-v : If this flag is used then the debias script will execute in verbose mode
-k : An integer which denotes how many principal components that are used to define the bias subspace (Defaults to 2)
-g : If this flag is used then the distribution of biased and debiased words are plotted in 2-dimensional space

Example command is included below.

This commmand performs intersectional hard debiasing based on attributes in the input vocab file. JoSEC is used to identify the intersectional subspace and the first 2 PCA components are used for computing the individual bias subspaces.

python debias.py inter josec inter -hard -v -k 2

Downstream Evaluations

We evaluated the performance of debiased word embeddings on three tasks - POS tagging, POS chunking, and NER. To reproduce our results, run the ipython notebook ./Downstream/BiasEvalPipelineRunner.ipynb
We trained simple LSTM-based Toxicity Detection model and measured FNED/FPED score on 3 demographic groups - gender/race/religion. Datasets can be downloaded from Jigsaw Unintended Bias in Toxicity Classification. To reproduce our results, train the model with ./Downstream/ToxicityDetectionLSTM.ipynb and run ./Downstream/ToxicityDetectionEval.ipynb

Requirements

The following python packages are required (Python 3).

numpy 1.20.3
pandas 1.3.1
scipy 1.6.2
gensim 3.8.3
sklearn 0.24.2
pytorch 1.9.0
matplotlib 3.4.2
jupyter 1.0.0

Reference

[1] Lu Cheng, Nayoung Kim and Huan Liu. Debiasing Word Embeddings with Nonlinear Geometry. Proceedings of the 29th International Conference on Computational Linguistics (COLING), 2022.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Debiasing		Debiasing
Downstream		Downstream
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Implementation-of-JoSEC-COLING-22

Data

Debiasing

Downstream Evaluations

Requirements

Reference

About

Releases

Packages

Contributors 2

Languages

GitHubLuCheng/Implementation-of-JoSEC-COLING-22

Folders and files

Latest commit

History

Repository files navigation

Implementation-of-JoSEC-COLING-22

Data

Debiasing

Downstream Evaluations

Requirements

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages