TüReuth Legal at SemEval 2023 Task 6 Code

This repository contains the code to replicate experiments for our participation at SemEval Task 6: LegalAI. Note that we only participate in subtask (A), predicting Rhetorical Roles.

Usage

Create Folder data
Place 3 json files called train.json, dev.json, and test.json in folder data. These files are provided by the shared task organisers, but have to be renamed accordingly
Run python main.py. This will train and save all models and output test predictions in file test_predictions.pickle.
Run python make_test_predictions.py. This will create a file called RR_TEST_DATA_FS.json which contains test set predictions in the right format for submission to the shared task.

Note that training MLPs and fine-tuning LMs requires GPU and internet access to download models from huggingface model hub.

Requirements

torch (with GPU support)
transformers
datasets
nltk
numpy
scipy
pandas
tqdm

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
ablation.py		ablation.py
blocked_word_mapping.py		blocked_word_mapping.py
data_preparation.py		data_preparation.py
evaluate_classifiers.py		evaluate_classifiers.py
evaluation.py		evaluation.py
main.py		main.py
make_test_predictions.py		make_test_predictions.py
train_sentence_classifiers.py		train_sentence_classifiers.py
transition_matrix.py		transition_matrix.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TüReuth Legal at SemEval 2023 Task 6 Code

Usage

Requirements

About

Releases

Packages

Languages

LGirrbach/Tuereuth-Legal-at-SemEval-Task-6

Folders and files

Latest commit

History

Repository files navigation

TüReuth Legal at SemEval 2023 Task 6 Code

Usage

Requirements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages