Skip to content
Project for DS8008 (Natural Language Processing) : Debiasing word embeddings - implementation of
Jupyter Notebook
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.
GloVe Post_debiasing.png
GloVe Pre_debiasing.png Update Apr 23, 2019



(1) Download from and unzip glove.6B.50d.txt into the project folder

(2) Run the code in the jupyter notebook project.ipynb

(3) Project report can be found here


In this project, we aim to perform hard gender debiasing on pre-trained GloVe embeddings. For this project, we have chosen the 50-dimensional version of GloVe, which is based on Wik- ipedia 2014 and Gigaword5 and has 400,000 words.

The method used consists of neutralizing and equalizing gender word pairs in such a way that any non-gendered/neutral word is at equal distance to gender word pairs such as she-he.After plotting the extreme she-he occupations, we find that all occupations are at equal distance from the she and he axis. We also find that gender specific words have moved closer to their respective gender axis (corresponding she or he axis). Conclusions. The application of the suggested debiasing algorithm demonstrates promising results in terms of debiasing occupational stereotypes. GloVe Pre_debiasing GloVe Post_debiasing

You can’t perform that action at this time.