Skip to content

Codebase for political bias mitigation in ChatGPT using Anthropic's ConstitutionalAI principles

Notifications You must be signed in to change notification settings

3x-dev/ChatGPT-Political-Bias-Mitigation-ConstitutionalAI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Unveiling Bias in ChatGPT-3.5: An Analysis of Constitutional AI Principles for Politically Biased Responses

Presented at SCCUR 2023.

Link to paper: (soon)

Abstract

OpenAI's set of GPT models has already been applied to a variety of applications across many industries. In a previous study, we quantified how OpenAI's GPT-3.0 model responses exhibit bias across various political subjects. Our results revealed a statistically significant left-leaning political bias in GPT-3.0’s responses for 9 out of the 11 analyzed political topics. In this research, we employed Anthropic's Constitutional AI principles to mitigate GPT-3.5’s political bias. We conducted a series of tests by applying custom constitutional principles in an attempt to mitigate political bias. We hypothesized that applying Anthropic’s Constitutional AI principles would result in a statistically significant reduction in politically biased responses generated by ChatGPT. Our observations indicated a significant reduction in bias for the “Abortion” and “Racism + Police” topics when using a custom principle with a carefully crafted prompt template. For the other topics, surprisingly, our study did not uncover significant bias reduction in GPT-3.5’s responses. This implies that while constitutional principles can be effective in mitigating biases in certain areas, their application across a broader range of topics requires further refinement and research to achieve consistent results.

About

Codebase for political bias mitigation in ChatGPT using Anthropic's ConstitutionalAI principles

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published