DS8008-project

Instructions:

(1) Download glove.6B.zip from https://nlp.stanford.edu/projects/glove/ and unzip glove.6B.50d.txt into the project folder

(2) Run the code in the jupyter notebook project.ipynb

(3) Project report can be found here

Introduction

In this project, we aim to perform hard gender debiasing on pre-trained GloVe embeddings. For this project, we have chosen the 50-dimensional version of GloVe, which is based on Wik- ipedia 2014 and Gigaword5 and has 400,000 words.

The method used consists of neutralizing and equalizing gender word pairs in such a way that any non-gendered/neutral word is at equal distance to gender word pairs such as she-he.After plotting the extreme she-he occupations, we find that all occupations are at equal distance from the she and he axis. We also find that gender specific words have moved closer to their respective gender axis (corresponding she or he axis). Conclusions. The application of the suggested debiasing algorithm demonstrates promising results in terms of debiasing occupational stereotypes.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
Debiasing_word_embeddings_project_report.pdf		Debiasing_word_embeddings_project_report.pdf
GloVe Post_debiasing.png		GloVe Post_debiasing.png
GloVe Pre_debiasing.png		GloVe Pre_debiasing.png
README.md		README.md
project.ipynb		project.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DS8008-project

Instructions:

Introduction

About

Releases

Packages

Contributors 2

Languages

atabas/DS8008-project

Folders and files

Latest commit

History

Repository files navigation

DS8008-project

Instructions:

Introduction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages