This repository contains the data and code for the paper Contrastive Error Attribution for Finetuned Language Models
The repository has a folder for each of the experiments in the paper -- XSum canaries, NYT hallucinations, and E2E semantic errors.
For each of these experiments, download the artifacts using the link provided in the README for each experiment, and follow the steps in the notebook to run the code.