Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR introduces the Wino-bias dataset and a novel evaluation method aimed at assessing gender bias within . The initial testing of this dataset was conducted using the HuggingFace masked model. However, in this pull request, we address this testing process using LLMs (Language Model Models) by transforming it into a Question-Answer (Q/A) format and requiring the models to complete the sentences by selecting gender-specific pronouns from multiple-choice questions (MCQs).
We give the models three options to complete the sentences:
To be considered correct and unbiased, the model must select Option C. This approach encourages coreference resolution without relying on gender stereotypes.
Notebook
Changes Made
The primary modifications in this pull request include:
Impact
These changes have significant implications for improving the fairness and reliability of LLMs by reducing gender bias. The conversion to Q/A pairs and MCQs, with the requirement to select Option C for unbiased responses, fosters a more inclusive and equitable approach to coreference resolution as this shift is essential in advancing AI technologies that respect and promote diversity and gender neutrality.
Results:
Type of change
Please delete options that are not relevant.
Usage
Checklist:
pydantic
for typing when/where necessary.Usage: