Skip to content
Code for "Bleaching Text: Abstract Features for Cross-lingual Gender Prediction" (van der Goot et al., 2018)
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data_humans
humanData
runs
scripts
src
.gitignore
README.md
embeds-scripts.tar.gz
todo

README.md

bleaching-text

Required python packages:

emoji (we used version 0.4.5)
sklearn (we used version 0.19.1)
numpy (we used version 1.14.2)

To reproduce results from the paper (note that they are also already included in the runs folder):

./scripts/runAll.sh
parallel -j 16 < run.sh 

Note: the data is available upon request - email us.

To generate the tables from the paper:

./scripts/getResults.sh

If you use parts of this repository we appreciate if you cite us:

@InProceedings{vangoot:ea:2018,
  author    = {van der Goot, Rob and Ljube\v{s}i\'{c}, Nikola  and Matroos, Ian and Nissim, Malvina and Plank, Barbara},
  title     = {Bleaching Text: Abstract Features for Cross-lingual Gender Prediction},
  booktitle = {Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics},
  month     = {August},
  year      = {2018},
  address   = {Melbourne},
  publisher = {Association for Computational Linguistics},
}

You can’t perform that action at this time.