A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.
-
Updated
Jul 5, 2024 - Python
A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
LangFair is a Python library for conducting use-case level LLM bias and fairness assessments
Learning to Split for Automatic Bias Detection
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
Tools for diagnostics and assessment of (machine learning) models
"Beyond Skin Tone: A Multidimensional Measure of Apparent Skin Color" (ICCV 2023)
Official code of "Discover and Mitigate Unknown Biases with Debiasing Alternate Networks" (ECCV 2022)
Code & Data for the paper "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"
Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models. Paper presented at MICCAI 2023 conference.
Official code of "Discover the Unknown Biased Attribute of an Image Classifier" (ICCV 2021)
This repository contains a console-interface name-ethnicity classifier
"Learning Stable Classifiers by Transferring Unstable Features" ICML 2022
Scan your AI/ML models for problems before you put them into production.
A program to automate testing open source LLMs for their political compass scores
Our submission to the SemEval2019 shared task on Hyperpartisan News Detection.
Author Bias Computation and Scientometric Plotting
Using open source NLP apis and sentiment analysis to create a bias detection web extension.
Evaluation of the quality of LLM geo knowledge
Scripts used for development of my Master's Thesis "Analyzing Twitter data to discover gender biases in Spanish politics" at Universitat Politècnica de Catalunya and Barcelona Institute of International Studies
Add a description, image, and links to the bias-detection topic page so that developers can more easily learn about it.
To associate your repository with the bias-detection topic, visit your repo's landing page and select "manage topics."