Detecting-Generated-Abstract

Google Drive Link: https://drive.google.com/drive/folders/1KCYFfVvA9dAkhGZYSedBUifSaOM_hu64?usp=sharing

Dr. Jiang Feng.

The Chinese University Of Hong Kong, Shenzhen

jeffreyjiang@cuhk.edu.cn

Introduction

Welcome to this project! You're about to detect text generated by advanced large-scale language models (such as ChatGPT). In this project, you'll explore the fascinating world of natural language processing as you work to identify and analyze text generated by a state-of-the-art language model.

Building a ChatGPT detector is an essential task because as language models like ChatGPT become more advanced, it becomes increasingly difficult to distinguish between text generated by humans and text generated by AI models. This is especially important when the text is used to make crucial decisions in financial markets, healthcare, or legal proceedings.

By developing techniques to detect text generated by ChatGPT, you'll be contributing to the ongoing effort to ensure that language models like ChatGPT are used responsibly and ethically. Your work on this project has important implications for language technology's future and can help prevent the misuse of AI-generated text.

We're excited to see the results of your project and look forward to the new insights you'll discover along the way. Good luck!

Task Definition

Given a dataset of text samples, the goal is to build a detector that can classify each sample as either human-written or AI-generated. The detector will be trained using a pre-trained language model, such as BERT or Roberta.

The whole process will involve the following steps:

Data preparation: The text samples in the dataset will be preprocessed, including tokenization and normalization, to prepare them for input to the pre-trained language model.

Fine-tuning the pre-trained language model: The pre-trained language model will be fine-tuned on text classification, using a labeled dataset of human-written and AI-generated text. The fine-tuning process will involve adjusting the parameters of the pre-trained language model to improve its ability to classify text as human-written or AI-generated.

Evaluation: The fine-tuned language model will be evaluated on a holdout dataset to measure its accuracy, precision, recall, and F1 score in detecting AI-generated text.

Testing: Finally, the fine-tuned language model will be used to classify new text samples as human-written or AI-generated.

By formulating the detection task as a text classification problem and fine-tuning a pre-trained language model on this task, we can build a detector that is capable of accurately detecting AI-generated text.

Related work

The Science of LLM-generated Text Detection gives a comprehensive survey, and you will learn the research line for detecting AI-generated text.

Here are some popular detectors and related papers:

GPT-2 output detector and related paper.
ai-text-classifier and related blog.
GPT-ZERO and related blog.
DetectGPT and related paper.
ChatGPT detector and related paper.

DataSet

Our dataset is sourced from HC3-English.

Since there was no publicly available dataset split mentioned in the paper, we have divided the dataset into a typical 80% training set and 20% test set, with 10% of the training set reserved for validation set.

The preprocessing step is shown in preprocess.py.

You just need to download the all.jsonl into ./dataset from HC3-English.

Baseline

Please note: The sample code below assumes that your machine has a GPU that can be used.

In this project, we reproduce the detector following ChatGPT detector, which is an open-source detector for ChatGPT.

We provided the modified code in this project for training and testing detector model.

You can follow the guideline to reproduce it.

Guideline

The detector is a Roberta for classification model with labels (0: human, 1:ChatGPT).

If you want to train it, follow these steps:

install the environment

pip install -r requirements.txt

One thing you have to pay attention to is you should install the proper version of the torch according to your machine.

prepare the data

Download the all.jsonl into ./dataset

Split the data into train set, validate set and test set:

python preprocess.py

train

python train.py

Please read the code in this file carefully if you want to know how to train and test the model in detail.

get the detector

The best_model.pt is the trained detector.

You can test the custom sample in text_test.txt (only three examples in it):

python test.py

Potential task

If you successfully finish the training, you will get a detector with >99% accuracy in the test set, which is ready to be applied in practice.

However, I suggest you take a further step considering the following issues because there are still many challenges that need to be addressed urgently.

Firstly, there is the issue of interpretability. Although the constructed detector can achieve very high accuracy, it is unclear what features of the AI-generated text it captures. We hope to explain better the reasons for the judgments through methods such as visualizing attention.

Secondly, there is the issue of generalizability. According to our preliminary experiments, even though the detector performs well on the HC3 dataset, its performance drops to around 57% accuracy when we apply it to a dataset generated by a language model trained on a paper polishing task. This indicates that the detector's generalizability is not sufficient. Therefore, we hope to enhance the detector's performance on detections from different datasets and models.

Thirdly, there is the issue of the detector's robustness. The current model can detect text generated entirely by humans and text wholly generated by ChatGPT. However, can the detector still accurately detect it if we manually modify the text generated by ChatGPT? Additionally, we want to investigate whether the detector can indicate the degree of modification.

Furthermore, other critical issues related to the detection of text generated by AI and humans can be explored and addressed if you are interested.

0330 Update

After you reproduced the baseline, I provided the validation set of the polished abstract dataset (abstract) for you to test your model's generalization.

I also suggest you read papers in recent advanced of AI-generated text detection if you are interested.

Some interesting opinions are proposed.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
__pycache__		__pycache__
abstract		abstract
bench1		bench1
cohere-1-100		cohere-1-100
cohere1		cohere1
dataset		dataset
fr-1		fr-1
frd		frd
images		images
report		report
.gitignore		.gitignore
1_to_5372.txt		1_to_5372.txt
5372_to_6185.txt		5372_to_6185.txt
Flan.ipynb		Flan.ipynb
README.md		README.md
answers_base.txt		answers_base.txt
baseline_output.txt		baseline_output.txt
bloom.py		bloom.py
cohere-1-100.txt		cohere-1-100.txt
cohere_.py		cohere_.py
csc3160 (1).ipynb		csc3160 (1).ipynb
csc3160.ipynb		csc3160.ipynb
dolly-1-675.txt		dolly-1-675.txt
dolly-675-775.txt		dolly-675-775.txt
dolly-775-875.txt		dolly-775-875.txt
dolly.py		dolly.py
dolly_1-775.txt		dolly_1-775.txt
dolly_from_1_to_875.txt		dolly_from_1_to_875.txt
extract_expanded.py		extract_expanded.py
extract_questions.py		extract_questions.py
feature.ipynb		feature.ipynb
fr-1.zip		fr-1.zip
huggingchat_manual.txt		huggingchat_manual.txt
image.png		image.png
logs_1683848916371.txt		logs_1683848916371.txt
logs_1683854202086.txt		logs_1683854202086.txt
logs_1683854341743.txt		logs_1683854341743.txt
milestone2.ipynb		milestone2.ipynb
nohup.out		nohup.out
output_5372_to_6185.txt		output_5372_to_6185.txt
output_br_dolly.txt		output_br_dolly.txt
output_fr_5372_to_6185.txt		output_fr_5372_to_6185.txt
output_fr_dolly.txt		output_fr_dolly.txt
package.json		package.json
palm.js		palm.js
paraphrase.py		paraphrase.py
preprocess.py		preprocess.py
preprocess_with_args.py		preprocess_with_args.py
prompt.js		prompt.js
prompt.py		prompt.py
proposal.ipynb		proposal.ipynb
questions_base.txt		questions_base.txt
requirements.txt		requirements.txt
temp.txt		temp.txt
test.py		test.py
test_flight.js		test_flight.js
test_with_args.py		test_with_args.py
text_features.csv		text_features.csv
text_test.txt		text_test.txt
train.py		train.py
wront_output_fine_tune_from_baseline.txt		wront_output_fine_tune_from_baseline.txt
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Detecting-Generated-Abstract

Introduction

Task Definition

Related work

DataSet

Baseline

Guideline

Potential task

0330 Update

About

Releases

Packages

Contributors 4

Languages

l1xiangyi/chatgpt-detector-roberta

Folders and files

Latest commit

History

Repository files navigation

Detecting-Generated-Abstract

Introduction

Task Definition

Related work

DataSet

Baseline

Guideline

Potential task

0330 Update

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages