BiasMonkey

As large language models (LLMs) become more capable, more people have become interested in using LLMs as proxies for humans in real-world tasks where subjective labels are desired, such as in surveys and opinion polling. One widely-cited barrier to the adoption of LLMs is their sensitivity to prompt wording---but interestingly, humans also display sensitivities to instruction changes in the form of response biases. But are the sensitivities and biases displayed by LLMs the same or different from those displayed by humans?

BiasMonkey is a framework that answers this question -- it allows you to evaluate whether LLMs exhibit human-like response biases in survey questionnaires. You can find more details in our paper below, or browse the dataset and results from our experiments below. We'd be happy if you try out BiasMonkey on your own LLMs or survey questions, and please feel free to get in contact via the issues page if you encounter any issues.

Paper:
Do LLMs exhibit human-like response biases? A case study in survey design
Lindia Tjuatja*, Valerie Chen*, Sherry Tongshuang Wu, Ameet Talwalkar, Graham Neubig

Usage

First, install the requirements:

pip install -r requirements.txt

Then you can run the scripts in the analysis directory:

full_analysis.ipynb: Generates results for all models across response biases and non-bias perturbations.
correlation_human_behavior.ipynb: Computes human and model distributions for all relevant questions and wasserstein distance between the two distributions.
uncertainty_analysis.ipynb: Generate uncertainty measures for all models across response biases and non-bias perturbations.

Dataset

You can browse the dataset used in the paper:

The original and modified questions used in our study can be found in here.
Original Pew questions were acquired from the OpinionsQA dataset (Santurkar et al. 2023)

You can also view the LLM Responses:

Raw responses from LLMs are in results/<model>/*.pickle.
Formatted responses that are used in the analysis scripts are in results/<model>/csv/. The script to generate these files from the raw responses is format_results.py.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
analysis		analysis
prompts		prompts
results		results
.gitignore		.gitignore
README.md		README.md
format_results.py		format_results.py
monkey.png		monkey.png
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

analysis

analysis

prompts

prompts

results

results

.gitignore

.gitignore

README.md

README.md

format_results.py

format_results.py

monkey.png

monkey.png

requirements.txt

requirements.txt

utils.py

utils.py

Repository files navigation

BiasMonkey

Usage

Dataset

About

Releases

Packages

Contributors 3

Languages

lindiatjuatja/BiasMonkey

Folders and files

Latest commit

History

Repository files navigation

BiasMonkey

Usage

Dataset

About

Resources

Stars

Watchers

Forks

Languages