BERT Bias

This work is based on sent-bias.

This repository contains the code and data for the paper "On Measuring Social Biases in Sentence Encoders" by Chandler May, Alex Wang, Shikha Bordia, Samuel R. Bowman and Rachel Rudinger.

Main changes:

Focus on BERT:

all_models = [
      "bert-base-uncased",
      "bert-large-uncased",
      "bert-base-multilingual-uncased",
      "distilbert-base-uncased"
  ]

Adopt to latest Huggingface Transformers API + library versions (numpy, pandas, etc.)
Add support for BERT-large

Setup

Create a virtual environment and install the requirements:

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Usage

Run the following command to evaluate the bias of a model on a dataset:

python3 main.py --model bert-base-uncased --dataset name

where name is one of the filenames (without .jsonl) in the data directory, and model is one of the models in the all_models list in main.py.

Results

The results are saved in the results directory. The results are saved in a csv file with the following columns:

model: the name of the model
test: the name of the test`
p-value: the p-value of the test
effect size: the effect size of the test

License

MIT License (see LICENSE file).

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
data		data
logs		logs
results		results
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

logs

logs

results

results

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

main.py

main.py

Repository files navigation

BERT Bias

Setup

Usage

Results

License

About

Releases

Packages

Languages

License

MisterXY89/bertBias

Folders and files

Latest commit

History

Repository files navigation

BERT Bias

Setup

Usage

Results

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages