scikit-learn-compatible API #134

hoffmansc · 2019-12-20T15:52:48Z

Progress towards #58

Old API:

Minor changes to old API mostly to aid reproducibility.
Fixed bug in CalibratedEqOddsPostprocessing -- GFNR is weighted by base_rate not 1-base_rate
New Sphinx docs layout

New API:

4 datasets (Adult, German, Bank, Compas) in DataFrame format with protected attributes in the index
- Automatically downloads from openml.org -- closes Use OpenML and sklearn.datasets.fetch_openml for datasets. #53
6 group fairness metrics as functions (statistical_parity_difference, disparate_impact_ratio, equal_opportunity_difference, average_odds_difference, average_odds_error, between_group_generalized_entropy_error)
2 individual fairness metrics as functions (generalized_entropy_index and its variants, consistency_score)
5 additional metrics as functions (specificity_score, base_rate, selection_rate, generalized_fpr, generalized_fnr)
3 algorithms (Reweighing, AdversarialDebiasing, CalibratedEqualizedOdds)

* dataset loading is more similar to sklearn.datasets * label binarization is now done outside standardize_dataset * metrics use 'groups' and 'priv_group' to signify priv/unpriv split

removed Reweighing.sample_weight_ attribute

also added score function (compute weighted cost)

added additional tests to check this

* fixed German 'age' from being dropped * renamed two_year_recid labels to 'Survived' and 'Recidivated' to match ProPublica article * reordered COMPAS categories to 'Male' < 'Female' * added 'foreign_worker' protected attribute for German

animeshsingh · 2020-02-07T01:21:44Z

Thanks @hoffmansc. Maybe you can schedule 30 min review with @Tomcli myself ?

hoffmansc · 2020-02-07T18:13:47Z

Sure. Sent an invite.

aif360/algorithms/postprocessing/calibrated_eq_odds_postprocessing.py

aif360/algorithms/inprocessing/adversarial_debiasing.py

aif360/datasets/adult_dataset.py

aif360/sklearn/datasets/openml_datasets.py

aif360/sklearn/utils.py

aif360/sklearn/metrics/metrics.py

aif360/sklearn/postprocessing/calibrated_equalized_odds.py

aif360/sklearn/postprocessing/__init__.py

docs/Makefile

examples/sklearn/demo_new_features.ipynb

tests/sklearn/test_adversarial_debiasing.py

animeshsingh · 2020-02-18T02:28:04Z

cc @adrinjalali

Adrian this is a PR originating from the original request
#58

Would be great to get your feedback and review on this, as well as how can we target this toward SKLearn community?

animeshsingh · 2020-02-18T03:14:50Z

@hoffmansc @nrkarthikeyan best to get this merged if the issues are non blockers, and then we can come back with refinements.

* added one-hot encoding example and random_states to demo notebook * added 'prefit' option to PostProcessingMeta * multiple fixes to docstring wordings * added additional links/disclaimers in docstrings * renamed CalibratedEqualizedOdds args to X and y

Samuel Hoffman and others added 30 commits December 19, 2019 12:49

Initial sklearn-compatible datasets and metrics

8cfa9de

added initial dataset tests

1f4ae57

fixed to_list for older pandas versions

2aef3fc

added metrics tests

2b1799a

added README and docs

9da5abd

simpler dataset loading and 'groups' for metrics

025ecc1

* dataset loading is more similar to sklearn.datasets * label binarization is now done outside standardize_dataset * metrics use 'groups' and 'priv_group' to signify priv/unpriv split

fixes to categoricals

8e96177

fixes for tests, updated README

8abb897

added travis badge to README

15a8eb2

updated todo with external blockers

3f594a4

added reweighing workaround to example

7754b32

added Reweighing algorithm

17b0c95

clean up comments

cc9246f

fixed package version in docs

8c58f65

adding hyperlinks to SLEPs

1e7899c

added binary_age opt to german; fixed NAs in bank

c1c1e40

modified onehot_transformer to return DataFrame

93a7cdf

tweaks to reweighing to conform with sklearn

8e52268

updated README

0183449

fixed docstring formatting

89b4a79

changed metrics to use prot_attr

d57b6df

added __all__ to __init__s

d8958bb

updated notebook with reweighing example

0bd3837

initial adversarial debiasing port

4107dd7

multiclass/multigroup support for adv debiasing

df85e42

fix build errors

d2d0ddc

Add ensure_binary option to check_groups

7a2414a

numeric_only converts index and label as well

aac9954

changed Reweighing to return X, sample_weight

dc317cf

removed Reweighing.sample_weight_ attribute

made sample_weight optional in check_inputs

0f184c3

hoffmansc added 10 commits December 19, 2019 12:56

docstrings and add alpha=sqrt(global_step) option

0cbc3f4

docstrings and input is now predict_proba output

8be6449

also added score function (compute weighted cost)

moved tests to main test folder

994bdf0

more docs and formatting changes

372e111

postprocessor takes DataFrame if use_proba

8d10893

added additional tests to check this

readme changes overwritten in the merge

e0ff2b6

train, test were swapped for adult

a2cd77e

remove branch mentions

ee7f23c

remove "attributes" line if none present

c8154ec

moved example to main folder

7ef94e7

hoffmansc requested review from animeshsingh and nrkarthikeyan January 10, 2020 19:53

hoffmansc added 5 commits January 31, 2020 16:40

use_proba -> needs_proba

c5af647

fixed/renamed/reordered/added some attributes

042bb12

* fixed German 'age' from being dropped * renamed two_year_recid labels to 'Survived' and 'Recidivated' to match ProPublica article * reordered COMPAS categories to 'Male' < 'Female' * added 'foreign_worker' protected attribute for German

fixed sample_weight=None bug and classes_ typo

ff9e70c

improved specificity_score and added fpr/fnr error

57b2ab5

made foreign_worker and education (bank) ordered

8fdd6dc

hoffmansc force-pushed the sklearn-compat branch from 5ded679 to 8fdd6dc Compare February 6, 2020 17:57

nrkarthikeyan reviewed Feb 13, 2020

View reviewed changes

aif360/algorithms/postprocessing/calibrated_eq_odds_postprocessing.py Show resolved Hide resolved

nrkarthikeyan reviewed Feb 13, 2020

View reviewed changes

aif360/algorithms/inprocessing/adversarial_debiasing.py Show resolved Hide resolved

nrkarthikeyan reviewed Feb 14, 2020

View reviewed changes

hoffmansc added 2 commits February 19, 2020 17:09

added comments to tests

789e96b

nrkarthikeyan self-requested a review February 19, 2020 23:32

nrkarthikeyan approved these changes Feb 19, 2020

View reviewed changes

hoffmansc merged commit 1002610 into master Feb 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scikit-learn-compatible API #134

scikit-learn-compatible API #134

hoffmansc commented Dec 20, 2019

animeshsingh commented Feb 7, 2020

hoffmansc commented Feb 7, 2020

animeshsingh commented Feb 18, 2020

animeshsingh commented Feb 18, 2020

scikit-learn-compatible API #134

scikit-learn-compatible API #134

Conversation

hoffmansc commented Dec 20, 2019

animeshsingh commented Feb 7, 2020

hoffmansc commented Feb 7, 2020

animeshsingh commented Feb 18, 2020

animeshsingh commented Feb 18, 2020