FairFlow: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding

Authors:

FairFlow Paper: EMNLP 2024, Preprint

Overview

We propose FairFlow, a novel debiasing method that mitigates dataset biases (spurious correlations / shortcuts) of Language Models (LMs). It formulates debiasing with Undecided Learning -- when facing biased / corrupted examples, a robust LM should remain undecided about the label. FairFlow creates biased samples through a series of explicit and implict perturbations and encourages LMs to be predict uniform distribution across classes for biased samples while maintaining the correct label for intact inputs. FairFlow is effective across a wide range of biases (OOD, stress), tasks (NLI, Paraphrase Identification, Relation Classification), architectures (Bert, Roberta, GPT2, DeBerta).

Key idea of Undecided Learning and FairFlow

We observe that when biased LMs make wrong predictions on biased examples, they tend to be over-confident. Therefore, we propose Undecided Learning, which encourages LMs to be undecided about the labels of biased inputs.

Overview of FairFlow. Built upon this formulation, FairFlow debiases LMs with two steps. First, the input samples are corrupted to be biased samples without enough information for a model to determine the label. We design a series of perturbation operations, including explicit and implicit ones. Explicit perturbations directly work on the input text, targetting explicit biases that are easily perceivable by human, such as dropping constituents, shuffling. While implicit perturbations does not directly work on the input text, such as deactivating models, zeroing out representations, targetting implicit biases that are not perceivable by human, unknown, or undefined. Then, the LM is encouraged to predict uniform distribution across classes for biased / corrupted inputs, while predict the correct labels for intact inputs, through the proposed Debiasing Contrastive Loss.

Code

I'm currently working on cleaning the code.

Citation

If you find FairFlow useful for your research, please consider citing this paper:

@inproceedings{cheng-amiri-2024-fairflow,
    title = "{F}air{F}low: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding",
    author = "Cheng, Jiali  and
      Amiri, Hadi",
    editor = "Al-Onaizan, Yaser  and
      Bansal, Mohit  and
      Chen, Yun-Nung",
    booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2024",
    address = "Miami, Florida, USA",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.emnlp-main.1225",
    pages = "21960--21975",
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FairFlow: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding

Authors:

FairFlow Paper: EMNLP 2024, Preprint

Overview

Key idea of Undecided Learning and FairFlow

Code

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

FairFlow: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding

Authors:

FairFlow Paper: EMNLP 2024, Preprint

Overview

Key idea of Undecided Learning and FairFlow

Code

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages