Human Study Legal Verdict Classification Analysis

Overview

This repository contains the Python scripts used to analyse the results of a human study involving the classification of case verdicts pertaining to Article 6 (Right to a fair trial) from the ECHR (European Convention on Human Rights) based on descriptions of the case facts. Participants were tasked with making a prediction of the correct verdict after reading the 'circumstances of the case' and again after reading the 'relevant legal framework', and these predictions are compared against the actual decisions taken by the ECtHR (European Court of Human Rights). The python scripts provide detailed statistical analyses, focusing on performance and confidence metrics. Below are descriptions of each script along with usage instructions.

Citation of Resource

If you use the datasets provided in this repository, or if you want to gain a deeper understanding of the context and methods behind the creation of these datasets and their analysis, please refer to the following paper due for publication at JURIX 2023:

Mumford J, Atkinson K, Bench-Capon T. (2023). Human Performance on the AI Legal Case Verdict Classification Task. In Proceedings of the Thirty-Sixth International Conference on Legal Knowledge and Information Systems. (due for publication)

Scripts Description

s7_performance_based_analysis_v1_6.py

This script performs an analysis based on the performance metrics derived from the study data. While the script's functionality and details of the analyses performed are proprietary, it generally handles:

Data preprocessing including null value handling and type conversions.
Performing various statistical analyses to extract insights from the data.
Visual representation of the analyses in the form of scatter plots.

Usage

To use the script, execute it in a Python environment where necessary libraries are installed. The script can be configured to perform different analyses based on user input.

s8_confidence_based_analysis_v1_1.py

This script focuses on analyzing the confidence metrics gathered during the study. The functionalities it offers include:

Reading and preprocessing survey response data.
Mapping categorical responses to numerical values to facilitate analysis.
Statistical analyses to understand different confidence aspects such as early and final confidence, influence of models, and domain knowledge on confidence levels.
Pairwise comparisons using Tukey HSD test among others.

Usage

Execute the script in a Python environment where the necessary libraries (mentioned at the beginning of the script) are installed. The script can be configured to perform different analyses based on user input.

Data Files

The necessary participant classification data files for running the scripts are found in the six subdirectories nested in the 'Analysis/Processed' path. The names of these subdirectories indicate the participant group that produced the datasets contained within where directory names containing the string:

'Alpha' indicate the group was not provided the domain knowledge model (ADM);
'Omega' indicate the group was provided with the domain knowledge model;
'CS' pertain to computer science student outputs (denoted as the 'Weak' domain knowledge group in the associated peer-reviewed paper);
'Law' pertain to general law students without ECHR module study (denoted as the 'Moderate' group in the associated peer-reviewed paper);
'Domain' pertain to law students with ECHR module study (denoted as the 'Strong' group in the associated peer-reviewed paper)

The model groups' final quiz responses are found in model_2nd_quiz_responses.xlsx, and the debrief survey responses for all participants are found in survey_responses.xlsx. Please ensure to maintain the structure of the repository for the scripts to function correctly. The file Article6_ADM.pdf is not necessary for executing any of the scripts, but contains the ADM in the format provided to those participant groups that were permitted access to the domain model.

Installation

Ensure you have a Python environment set up with the following libraries installed:

matplotlib
numpy
pandas
scipy
statsmodels

You can install the required packages using the following command:

pip install matplotlib numpy pandas scipy statsmodels

Acknowledgements

A big thank you to all participants of the study, whose valuable input made this analysis possible. The study was supported by Research England under the Policy Support Funding stream.

Contact

For any questions or support, please contact Dr Jack Mumford at jack [dot] mumford [at] liverpool [dot] ac [dot] uk

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human Study Legal Verdict Classification Analysis

Overview

Citation of Resource

Scripts Description

s7_performance_based_analysis_v1_6.py

Usage

s8_confidence_based_analysis_v1_1.py

Usage

Data Files

Installation

Acknowledgements

Contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
Analysis/Processed		Analysis/Processed
Article6_ADM.pdf		Article6_ADM.pdf
LICENSE		LICENSE
README.md		README.md
model_2nd_quiz_responses.xlsx		model_2nd_quiz_responses.xlsx
s7_performance_based_analysis_v1_6.py		s7_performance_based_analysis_v1_6.py
s8_confidence_based_analysis_v1_1.py		s8_confidence_based_analysis_v1_1.py
survey_responses.xlsx		survey_responses.xlsx

License

jamumford/Human_Legal_Verdict_Prediction

Folders and files

Latest commit

History

Repository files navigation

Human Study Legal Verdict Classification Analysis

Overview

Citation of Resource

Scripts Description

s7_performance_based_analysis_v1_6.py

Usage

s8_confidence_based_analysis_v1_1.py

Usage

Data Files

Installation

Acknowledgements

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages