CHR2024

Scripts for the paper:
Simone Rebora and Gabriele Vezzani, "Models of Literary Evaluation and Web 2.0. An Annotation Experiment with Goodreads Reviews", CHR2024 (PDF)

Preliminary notes

All ".ipynb" scripts are designed to work with Google Colab. You will need to connect your Drive to the Notebooks and have all datasets saved in the folder "MyDrive/CHR2024".
All ".py" scripts should be run locally (python3 my_script.py). You will need to install the required packages (pip install -r requirements.txt) and setup an OpenAI API key.
Because of copyright limitations, we cannot share the datasets here. If interested in getting access to them, you can contact us by describing your intended use.

Scripts overview

The scripts are linked to different sections of our paper. Here below you can find a brief description and an indication of the section of the paper where the script was used.

Compute_Agreement.ipynb (Paper section 3.2). Script used to calculare inter-annotator agreement.
Transformers_kfold.ipynb (Paper section 4.1). Script used to fine-tune three Transformer models on the annotated dataset.
Transformers_learning_curve.ipynb (Paper section 4.1). Script used to calculate efficiency of the best-performing model with differing amounts of training materials.
Transformers_save_model.ipynb (Paper section 4.1). Script used to save the best-performing model when trained on a selected dataset. The model has been published on HuggingFace.
GPT_annotation.py (Paper section 4.2). Script used to annotate the selected dataset with GPT-4 (through the OpenAI API).
GPT_annotation_fewshot.py (Paper section 4.2). Script used to annotate the selected dataset with GPT-4 (through the OpenAI API) with few-shot strategy.
Evaluate_GPT_prompt_engineering.ipynb (Paper section 4.2). Script used to evaluate the efficiency of different GPT4 prompting strategies on the selected dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Annotations		Annotations
Curation		Curation
GPT_datasets		GPT_datasets
GPT_results		GPT_results
System_prompts		System_prompts
Compute_Agreement.ipynb		Compute_Agreement.ipynb
Evaluate_GPT_prompt_engineering.ipynb		Evaluate_GPT_prompt_engineering.ipynb
GPT_annotation.py		GPT_annotation.py
GPT_annotation_fewshot.py		GPT_annotation_fewshot.py
LICENSE		LICENSE
README.md		README.md
Transformers_kfold.ipynb		Transformers_kfold.ipynb
Transformers_learning_curve.ipynb		Transformers_learning_curve.ipynb
Transformers_save_model.ipynb		Transformers_save_model.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CHR2024

Preliminary notes

Scripts overview

About

Releases

Packages

Languages

License

SimoneRebora/CHR2024

Folders and files

Latest commit

History

Repository files navigation

CHR2024

Preliminary notes

Scripts overview

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages