Fair-Over-Sampling

Description

This repository provides source code and data to implement the Fair Over-Sampling (FOS) method for bias mitigation, as described in the paper, "Towards Bridging Algorithmic Fairness and Imbalanced Learning."

Dependencies

The source code is built on IBM's AIF360 toolkit, which can be found at AIF360. We recommend that you follow the instructions at AIF360 to either pip install the software or clone it in a separate conda environment. AIF360 requires Tensorflow to implement certain of its features (we used TF ver. 2.6.0). In addition, we used the following python libraries:

Python v. 3.7.0
Numpy v. 1.19.5
Pandas v. 1.3.3
Scikit Learn v. 0.24.2

Data

We have included data to run FOS on the German Credit, Adult Census, and Compas Two-Year Recidivism datasets. The data can be found in the data folder located in this repository. The data should be downloaded and placed in the ../data/ folder. The orginal datasets can be found at:

How to Run Fair Over-Sampling

We have included a version of Fair Over-Sampling (FOS) that is intended to be used with standard classifiers (e.g., scikit learn's Support Vector Machines or Logistic Regression). The main file for running FOS is FOS_main.py.
The basic steps to run FOS are:

Select the AIF360 dataset that you would like to run (e.g., Adult Census, German Credit, or Compas) by commenting or uncommenting the respective lines in FOS_main.py.
Select a classifier (e.g., SVM or LG).
Input a link to the respective data folder that is saved on your local machine.
Run the file (FOS_main.py).

Related python files are:

Fair_OS.py contains the FOS algorithm.
common_utils.py generates useful metrics, including fair utility.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
FOS_main.py		FOS_main.py
Fair_OS.py		Fair_OS.py
LICENSE		LICENSE
README.md		README.md
common_utils.py		common_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

FOS_main.py

FOS_main.py

Fair_OS.py

Fair_OS.py

LICENSE

LICENSE

README.md

README.md

common_utils.py

common_utils.py

Repository files navigation

Fair-Over-Sampling

Description

Dependencies

Data

How to Run Fair Over-Sampling

About

Releases

Packages

Languages

License

dd1github/Fair-Over-Sampling

Folders and files

Latest commit

History

Repository files navigation

Fair-Over-Sampling

Description

Dependencies

Data

How to Run Fair Over-Sampling

About

Resources

License

Stars

Watchers

Forks

Languages