This repository contains code, data, and documentation for research on the impact of prompt engineering on large language models in the field of argument mining.
batch_api/– Data for the OpenAI Batch APIinput/– Input dataoutput/– Output data
data/– Datasetsoriginal/– Argument-Annotated-Essays dataset (Version 2) by Stab and Gurevych (2017a), downloaded heretransformed/– Processed data
graphs/– Visualizations and related notebooksprompts/– Used promptsbuilding-blocks/– Prompt building blocksfinal-prompts/– Final prompt variants
report/– Data-based reportsrc/– Python modules and helper functions1.EDA.ipynb– Exploratory data analysis2.data-transformation.ipynb– Data processing3.llm.ipynb– Prompt creation & LLM queries4.evaluation.ipynb– Evaluation of LLM resultsrequirements.txt– Dependencies
It is recommended to use a virtual environment (e.g., Anaconda). Install the dependencies as follows:
conda create -n argument-mining python=3.12 -y
conda activate argument-mining
pip install -r requirements.txtTo use the GPT-4o mini model via the OpenAI Batch API, an OpenAI API key is required. Store this in a .env file as OPENAI_API_KEY. More information about the API and key can be found here.
For questions or feedback:
- Email: datadrvn.ai@gmail.com