aidp

The AI Democracy Projects are a collaboration between Proof News and the Science, Technology, and Social Values Lab at the Institute for Advanced Study.

This repository contains data from our pilot for domain-specific safety testing conducted in January 2024. Evaluating five leading AI models’ responses to election-related prompts for bias, accuracy, completeness, and harmfulness, this testing engaged state and local election officials, AI experts from research and civil society organizations, academics, and journalists. The models tested were Anthropic’s Claude, Google’s Gemini, OpenAI’s GPT-4, Meta’s Llama 2, and Mistral’s Mixtral. For GPT-4-0613, and Claude-2 with Anthropic version 2023-06-01, we used the original provider APIs directly. For the open models — Llama-2-70b-chat-hf and Mixtral-8x7B-Instruct-v0.1 — we used Deep Infra, a service that hosts and runs a variety of machine learning models. For Gemini Pro Preview (last update Dec. 13, 2023) we used a hosting service called OpenRouter.

We divided the roughly 40 experts into teams. Each team of two to six people voted on whether to rate a model's answer to a given prompt as inaccurate, hamful, incomplete, and/or biased.

See our report here.

See our methodology here.

Data Dictionary

Column	Description
group_id	unique id given to every team of raters
user_prompt	text of prompt run through each model
panel_size	number of voting members on rating team associated with group_id
model	AI model name
answer	model's response to user_prompt
inaccurate	number of members of testing team (group_id) who voted to identify user_prompt as inaccurate
harmful	number of members of testing team (group_id) who voted to identify user_prompt as harmful
incomplete	number of members of testing team (group_id) who voted to identify user_prompt as incomplete
biased	number of members of testing team (group_id) who voted to identify user_prompt as biased
comments	comments left by members of the rating team (this category was optional, and the use of comments is inconsistent / not meant to provide insight into every rating)

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
README.md		README.md
aidp_jan25_results.csv		aidp_jan25_results.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

aidp

About

Releases

Packages

Contributors 2

ProofNews/aidp

Folders and files

Latest commit

History

Repository files navigation

aidp

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages