Skip to content

The AI Democracy Projects are a collaboration between Proof News and the Science, Technology, and Social Values Lab at the Institute for Advanced Study.

ProofNews/aidp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 

Repository files navigation

aidp

The AI Democracy Projects are a collaboration between Proof News and the Science, Technology, and Social Values Lab at the Institute for Advanced Study.

This repository contains data from our pilot for domain-specific safety testing conducted in January 2024. Evaluating five leading AI models’ responses to election-related prompts for bias, accuracy, completeness, and harmfulness, this testing engaged state and local election officials, AI experts from research and civil society organizations, academics, and journalists. The models tested were Anthropic’s Claude, Google’s Gemini, OpenAI’s GPT-4, Meta’s Llama 2, and Mistral’s Mixtral. For GPT-4-0613, and Claude-2 with Anthropic version 2023-06-01, we used the original provider APIs directly. For the open models — Llama-2-70b-chat-hf and Mixtral-8x7B-Instruct-v0.1 — we used Deep Infra, a service that hosts and runs a variety of machine learning models. For Gemini Pro Preview (last update Dec. 13, 2023) we used a hosting service called OpenRouter.

We divided the roughly 40 experts into teams. Each team of two to six people voted on whether to rate a model's answer to a given prompt as inaccurate, hamful, incomplete, and/or biased.

See our report here.

See our methodology here.

Data Dictionary

Column Description
group_id unique id given to every team of raters
user_prompt text of prompt run through each model
panel_size number of voting members on rating team associated with group_id
model AI model name
answer model's response to user_prompt
inaccurate number of members of testing team (group_id) who voted to identify user_prompt as inaccurate
harmful number of members of testing team (group_id) who voted to identify user_prompt as harmful
incomplete number of members of testing team (group_id) who voted to identify user_prompt as incomplete
biased number of members of testing team (group_id) who voted to identify user_prompt as biased
comments comments left by members of the rating team (this category was optional, and the use of comments is inconsistent / not meant to provide insight into every rating)

About

The AI Democracy Projects are a collaboration between Proof News and the Science, Technology, and Social Values Lab at the Institute for Advanced Study.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published