Skip to content

CorSteging/DiscoveringTheRationaleOfDecisions

Repository files navigation

Discovering the Rationale of Decisions

This repository contains a set of experiments aimed at evaluating the rationales of machine learning systems in different legal settings.

Abstract

In AI and law, systems that are designed for decision support should be explainable when pursuing justice. In order for these systems to be fair and responsible, they should make correct decisions and make them using a sound and transparent rationale. In this paper, we introduce a knowledge-driven method for model-agnostic rationale evaluation using dedicated test cases, similar to unit-testing in professional software development. We apply this new quantitative human-in-the-loop method in a set of machine learning experiments aimed at extracting known knowledge structures from artificial datasets from a real-life legal setting. We show that our method allows us to analyze the rationale of black box machine learning systems by assessing which rationale elements are learned or not. Furthermore, we show that the rationale can be adjusted using tailor-made training data based on the results of the rationale evaluation.

Experiments

All three experiments can be found in the following Jupyter notebooks:

  • wb_replication.ipynb: Welfare benefit experiment (Bench-Capon replication)
  • wb_simplified.ipynb: Simplified welfare benefit experiment
  • tort_law.ipynb: Experiment on tort law

Results

  • accuracies: The accuracies of our models across the three domains
  • plots: The plots for the Age-Gender and the Patient-Distance conditions
  • comparing_lime_shap: Contains a comparison between LIME and SHAP explanations. For each instance in 'test_dataset.csv' of the Welfare Benefit domain, we provide an explanation using SHAP and LIME, using the neural networks with 1, 2 or 3 hidden layers, trained on either the tailored or regular training dataset.

Papers

About

Discovering the Rationale of Decisions

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages