Unveiling Bias in ChatGPT-3.5: An Analysis of Constitutional AI Principles for Politically Biased Responses

Presented at SCCUR 2023.

Link to paper: (soon)

Abstract

OpenAI's set of GPT models has already been applied to a variety of applications across many industries. In a previous study, we quantified how OpenAI's GPT-3.0 model responses exhibit bias across various political subjects. Our results revealed a statistically significant left-leaning political bias in GPT-3.0’s responses for 9 out of the 11 analyzed political topics. In this research, we employed Anthropic's Constitutional AI principles to mitigate GPT-3.5’s political bias. We conducted a series of tests by applying custom constitutional principles in an attempt to mitigate political bias. We hypothesized that applying Anthropic’s Constitutional AI principles would result in a statistically significant reduction in politically biased responses generated by ChatGPT. Our observations indicated a significant reduction in bias for the “Abortion” and “Racism + Police” topics when using a custom principle with a carefully crafted prompt template. For the other topics, surprisingly, our study did not uncover significant bias reduction in GPT-3.5’s responses. This implies that while constitutional principles can be effective in mitigating biases in certain areas, their application across a broader range of topics requires further refinement and research to achieve consistent results.

Name	Name	Last commit message	Last commit date
Latest commit 3x-dev Update README.md Sep 20, 2024 390d2a3 · Sep 20, 2024 History 7 Commits
Custom_Built	Custom_Built	Code update	Dec 12, 2023
Selected_Anthropic_Rules	Selected_Anthropic_Rules	Code update	Dec 12, 2023
old	old	Old code	Dec 12, 2023
README.md	README.md	Update README.md	Sep 20, 2024
all_anthropic_rules.txt	all_anthropic_rules.txt	Code update	Dec 12, 2023
combine.py	combine.py	Code update	Dec 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unveiling Bias in ChatGPT-3.5: An Analysis of Constitutional AI Principles for Politically Biased Responses

Abstract

About

Releases

Packages

Languages

3x-dev/ChatGPT-Political-Bias-Mitigation-ConstitutionalAI

Folders and files

Latest commit

History

Repository files navigation

Unveiling Bias in ChatGPT-3.5: An Analysis of Constitutional AI Principles for Politically Biased Responses

Abstract

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages