Skip to content

Benchmark dataset for anti-queer bias in large language models (LLMs)

License

Notifications You must be signed in to change notification settings

katyfelkner/winoqueer-v0

Repository files navigation

winoqueer-v0

Benchmark dataset for anti-queer bias in large language models (LLMs)

Our paper, Towards Winoqueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models, was published in the QueerInAI workshop at NAACL 2022!

Repo contents:

Finetuning Data

Finetuning Data are currently down because of licensing concerns - sorry for the outage. Expect the correct data to be posted on or before 09/09.

Finetuning Scripts

Scripts use to preprocess data (segment and normalize) and finetune models. Tweets are normalized using TweetNormalizer from BERTweet.

Model Checkpoints

Model Checkpoints are included for four models (BERT_base, BERT_large, SpanBERT_base, SpanBERT_large) under three finetuning conditions (none, LGBTQ+ news, LGBTQ+ twitter).

Benchmark Data

winoqueer_benchmark.csv is the benchmark data used in our experiments in the paper. Use this to replicate our results!

Our data follows the CrowS-Pairs format, and you should use their evaluation script to run our metric.

Note

Some files in this repo are large. You will probably need to use Git LFS.

About

Benchmark dataset for anti-queer bias in large language models (LLMs)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages