amacinho/Gender-Expectations-Predictions
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
This document contains the corpus-based predictions of gender expectations of more than 20.000 actions that are represented in ConceptNet. Currently, there are two predictions: Twitter-based (twitter.txt) and Web-based (ukwac.txt). For details consult the references at the end. Both files have the same format: Each line consists of 4 tab-seperated fields, action, male count, female count, normalized score. Male and female counts correspond to the number of male and female utterances of the action either in a Twitter-based or Web-based corpus. The normalized score is a z-normalization based on a binomial assumption of male and female utterances with p equal to ratio of overall male utterances to the all utterances. References: A. Herdağdelen, M. Baroni (2011). Stereotypical gender actions can be extracted from Web text. Journal of the American Society for Information Science and Technology, Volume 62, Issue 9, pages 1741–1749. A. Herdağdelen (2010). Gender differences in tweets: A corpus- based analysis of Twitter. Turunç Workshop on Complex Systems, 30 August-01 September 2010, Marmaris, Turkey.
About
Corpus-based predictions of gender expectations of all actions contained in ConceptNet
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published