UnStereoEval

Repository for the paper Are Models Biased on Text without Gender-related Language? accepted at ICLR 2024! In this paper, we challenge a common observation in prior work considering the gender bias evaluation of large language models (LMs). The observation is that models reinforce stereotypes in the training data by picking up on gendered correlations. In this paper, we challenge this assumption and instead address the question: Do language models still exhibit gender bias in non-stereotypical settings?

To do so, we introduce UnStereoEval (USE), a novel framework tailored for investigating gender bias in stereotype-free scenarios. USE defines a sentence-level score based on pretraining data statistics to determine if the sentence contain minimal word-gender associations. To systematically assess the fairness of popular language models in stereotype-free scenarios, we utilize USE to automatically generate benchmarks without any gender-related language. By leveraging USE's sentence-level score, we also repurpose prior gender bias benchmarks (Winobias and Winogender) for non-stereotypical evaluation.

Datasets

The unconstrained version of the datasets can be found at 🤗 datasets.

Code

Coming soon!

Analysis

Coming soon!

Citation

If you find this work interesting and useful for your research, please consider citing us (:

@inproceedings{belem2024-unstereoeval,
    title={Are Models Biased on Text without Gender-related Language?},
    author={Catarina G Bel{\'e}m and Preethi Seshadri and Yasaman Razeghi and Sameer Singh},
    month={May},
    year={2024},
    booktitle={The Twelfth International Conference on Learning Representations},
    url={https://openreview.net/forum?id=w1JanwReU6}
}

Name	Name	Last commit message	Last commit date
Latest commit PastelBelem8 Add docs May 3, 2024 2aaff91 · May 3, 2024 History 38 Commits
benchmark-creation-configs	benchmark-creation-configs	Add config files used to generate the USE-X benchmarks	Apr 30, 2024
data	data	Update raw dataset	May 1, 2024
docs	docs	Add docs	May 3, 2024
notebooks	notebooks	Add notebooks and processed results folder	May 1, 2024
results	results	Add notebooks and processed results folder	May 1, 2024
src	src	Add src/README.md	May 1, 2024
.gitignore	.gitignore	Update CSS and JS files in /docs	Apr 18, 2024
README.md	README.md	Update README.md	May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UnStereoEval

Datasets

Code

Analysis

Citation

About

Languages

ucinlp/unstereo-eval

Folders and files

Latest commit

History

Repository files navigation

UnStereoEval

Datasets

Code

Analysis

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages