-
Notifications
You must be signed in to change notification settings - Fork 839
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Executing the Credit Risk notebook does not generate a de-biased dataset #51
Comments
I [almost] concur. I get 0.11 vs 0.21. Is the input data changing? What's the source of these different results? It may be a good idea to have some of these as tests or doctests in the docstrings (as examples). |
@nrkarthikeyan Can you advise on this? Just sent an email cc'ing the AIOS team. Need this fixed so we can show integration. |
I know the German credit risk dataset is small, but that doesn't explain the odd behavior of AIF360's notebook. E.g. I looped through a hundred different splits of that dataset looking for one that would de-bias properly. In the process I found that for EVERY split of test/train, AIF360 actually generates a more biased dataset than the initial one. Something else is wrong here. Here's my notebook showing this bug: tutorial_credit_scoring_merged.ipynb.zip |
The warning shown in the notebook might also be significant. The preprocessing sets the privileged and unprivileged groups, (gender and race I guess), and then in the notebook when the user tries to set the privileged/unprivileged group, it's ignored as a result. That would quite change the results. |
Hi all, the optimized pre-processing (used in the original credit scoring tutorial) has a lot of randomness built into it, which will create issues with small datasets. I suggest that we use re-weighing pre-processing to circumvent this issue. Please see the attached notebook: The key thing to keep in mind is that reweighing pre-processing works by changing the instance level weights (this is available in dataset.instance_weights). So, the classifier trained on the debiased data should be capable of handling instance level weights. Let me know what you folks think. |
Alternatively, we could run it on the Adult dataset which seems to be much more stable and effective. |
Any update on fixing this? We've a code pattern on developer.ibm.com that uses this notebook, and it shows that using AIF360 makes fairness worse. |
We have modified the tutorial and this issue is fixed. |
Port changes from Trusted-AI/AIF360#51 Switch to Reweighing pre-processing alogrithm. Closes: #25
Port changes from Trusted-AI/AIF360#51 Switch to Reweighing pre-processing alogrithm. Closes: #25
Executing the Credit Risk Notebook does not generate a de-biased dataset. The results below are from a brand now GIT pull from the AIF360 repo. As shown at the end, the new "debiased" model now over twice as biased as the original model:
The text was updated successfully, but these errors were encountered: