Skip to content

Latest commit

 

History

History

wino


WinoBias dataset

  1. The data/ folder contains the WinoDataset we generated.
  2. Gender swapping lists are listed in extra_gendered_words.txt and generalized_swaps.txt files. The swapping can be finished by the word_swapper.py file.
  3. If you want to try the WinoBias dataset using allennlp, remember to add "pos_tag != '-'" in line 274 here.

How To evaulate on our WinoBias Dataset:

Others:

  • You do not need the anonymize.py file for our winobias dataset. The script is for anonymizing the OntoNote coref file. To use this, you need fo first generate the NER tags for the OntoNotes coref files, and then run python anonymize.py file_with_NERs ontonotes_coref.conll outputfile.
  • For the NER tags we use tagger in our paper.