Discovering the Rationale of Decisions

This repository contains a set of experiments aimed at evaluating the rationales of machine learning systems in different legal settings.

Abstract

In AI and law, systems that are designed for decision support should be explainable when pursuing justice. In order for these systems to be fair and responsible, they should make correct decisions and make them using a sound and transparent rationale. In this paper, we introduce a knowledge-driven method for model-agnostic rationale evaluation using dedicated test cases, similar to unit-testing in professional software development. We apply this new quantitative human-in-the-loop method in a set of machine learning experiments aimed at extracting known knowledge structures from artificial datasets from a real-life legal setting. We show that our method allows us to analyze the rationale of black box machine learning systems by assessing which rationale elements are learned or not. Furthermore, we show that the rationale can be adjusted using tailor-made training data based on the results of the rationale evaluation.

Experiments

All three experiments can be found in the following Jupyter notebooks:

wb_replication.ipynb: Welfare benefit experiment (Bench-Capon replication)
wb_simplified.ipynb: Simplified welfare benefit experiment
tort_law.ipynb: Experiment on tort law

Results

accuracies: The accuracies of our models across the three domains
plots: The plots for the Age-Gender and the Patient-Distance conditions
comparing_lime_shap: Contains a comparison between LIME and SHAP explanations. For each instance in 'test_dataset.csv' of the Welfare Benefit domain, we provide an explanation using SHAP and LIME, using the neural networks with 1, 2 or 3 hidden layers, trained on either the tailored or regular training dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
datasets		datasets
results		results
README.md		README.md
generate_data.ipynb		generate_data.ipynb
neural_networks.py		neural_networks.py
simplified_dataset.py		simplified_dataset.py
tort_dataset.py		tort_dataset.py
tort_law.ipynb		tort_law.ipynb
wb_dataset.py		wb_dataset.py
wb_dataset_helpers.py		wb_dataset_helpers.py
wb_replication.ipynb		wb_replication.ipynb
wb_simplified.ipynb		wb_simplified.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datasets

datasets

results

results

README.md

README.md

generate_data.ipynb

generate_data.ipynb

neural_networks.py

neural_networks.py

simplified_dataset.py

simplified_dataset.py

tort_dataset.py

tort_dataset.py

tort_law.ipynb

tort_law.ipynb

wb_dataset.py

wb_dataset.py

wb_dataset_helpers.py

wb_dataset_helpers.py

wb_replication.ipynb

wb_replication.ipynb

wb_simplified.ipynb

wb_simplified.ipynb

Repository files navigation

Discovering the Rationale of Decisions

Abstract

Experiments

Results

Papers

About

Releases

Packages

Languages

CorSteging/DiscoveringTheRationaleOfDecisions

Folders and files

Latest commit

History

Repository files navigation

Discovering the Rationale of Decisions

Abstract

Experiments

Results

Papers

About

Resources

Stars

Watchers

Forks

Languages