bias-bert

Set Up

clone repository and cd into bias-bert
install requirements.txt in your environment

Get and Prepare Data

python res_data/IMDB_data_preparation_script.py | tee data_prep.txt
python cd res_data/twitter_data_preparation_script.py | tee data_prep.txt

Train

Train the models with train_pytorch.py. In the script, three variables are specified: (1) the task (i.e. "IMDB" or "Twitter"), (2) the defined model_id of the pretrained model (find a list of all options below), and (3) the data specification(s) (spec) that are used to train the model(s). Each specification determines a different subset of test and training data and results in one model. Further training variables are defined in train().

Specify the variables directly in the script before calling train() or call train() with the corresponding function variables, e.g., train(task='Twitter', model_id='bertlarge', spec='mix_pro', lr_in=2e-5, batch_s=16, run="ex_Tw_LR", name_addition='LR2')

Trained models were evaluated with evaluate_pytorch.py and evaluate.ipynb (accuracy, f1 score, ...).

Possible Variables

specs are "N_pro", "N_weat", "N_all", "mix_pro", "mix_weat", "mix_all", "original";
model_id can be "bertbase", "bertlarge", "distbase", "distlarge", "robertabase", "robertalarge", "albertbase", "albertlarge",
which correspond to the pretrained Hugging Face Models bert-base-uncased, bert-large-uncased, distilbert-base-uncased, distilbert-large-uncased, roberta-base, roberta-large, albert-base-v2, albert-large-v2.

Rate Experimental Samples

Rate gender samples with the trained model by calling rate() in rate.py, e.g.,
rate('IMDB', 'bertbase', 'original', 'weat')

The ratings are saved in pandas data frames as pickle into res_results/. This data is needed to calculate the biases.

Analyse Results: Calculate and Plot Biases

Tables and Plots were created with res_plots/biases.ipynb and res_plots/tables.ipynb.

Reference

This work has been published in:
Jentzsch, S. F., & Turan, C. (2022). Gender Bias in BERT-Measuring and Analysing Biases through Sentiment Rating in a Realistic Downstream Classification Task. GeBNLP 2022, 184.

Resources

IMDB data
Stanford data

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
res_data		res_data
res_models		res_models
res_plots		res_plots
res_results		res_results
.gitignore		.gitignore
README.md		README.md
evaluate.ipynb		evaluate.ipynb
evaluate_pytorch.py		evaluate_pytorch.py
evaluation_df		evaluation_df
pytorch.yml		pytorch.yml
rate.py		rate.py
train_functions.py		train_functions.py
train_pytorch.py		train_pytorch.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bias-bert

Set Up

Get and Prepare Data

Train

Possible Variables

Rate Experimental Samples

Analyse Results: Calculate and Plot Biases

Reference

Resources

About

Releases

Packages

Languages

sciphie/bias-bert

Folders and files

Latest commit

History

Repository files navigation

bias-bert

Set Up

Get and Prepare Data

Train

Possible Variables

Rate Experimental Samples

Analyse Results: Calculate and Plot Biases

Reference

Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages