Skip to content

CharmaineBarker/ChatGPT-as-a-Text-Simplification-Tool-to-Remove-Bias

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 

Repository files navigation

ChatGPT as a Text Simplification Tool to Remove Bias

The work for a paper submitted to Yorkshire Innovation in Science and Engineering Conference 2023 generated a dataset of Disneyland reviews which is released here.

The sample of 300 reviews was taken from a Kaggle dataset: Disneyland Reviews

Each review was then manually inputted into ChatGPT using the prompt: Simplify "(Review)". The output was then added to a new column in the dataset. The dataset contains the following features:

  • Reviewer_Location- The sensitive attribute which was country of origin of the reviewer. This was filtered to only have 150 people from the United Kingdom and 150 from the United States.
  • American- This is the same as the Reviewer_Location column, but it has been changed into a binary where 1 is United States and 0 is United Kingdom.
  • Review_Text- The actual review data which was simplified.
  • Prompt- This is the text which was used as the input into ChatGPT.
  • Simplified- This is the output from ChatGPT.

A number of tests were completed on this data such as sentiment analysis to check the reviews kept the same meaning and classification of the reviews into one of the two protected characteristics (UK or US). Further Work is also outlined in the paper.

About

The dataset created from the simplification of reviews using ChatGPT

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published