Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
exploring the relationship between gender and scientific research
Python
branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
data
gender totally works, generates txt files so you don't have to scrape the Wi…
scripts counting gender neutral names
.gitignore added simple script for plos authors
README
plos_api_key added plos solr module

README

 (                                                                                 
 )\ )                                       (      (                 (             
(()/(   (    (             (       )        )\ )   )\ )     (        )\ )  (  (    
 /(_))( )\  ))\ (     (   ))\   ( /(  (    (()/(  (()/(    ))\ (    (()/( ))\ )(   
(_))  )((_)/((_))\ )  )\ /((_)  )(_)) )\ )  ((_))  /(_))_ /((_))\ )  ((_))((_|()\  
/ __|((_|_|_)) _(_/( ((_|_))   ((_)_ _(_/(  _| |  (_)) __(_)) _(_/(  _| (_))  ((_) 
\__ Y _|| / -_) ' \)) _|/ -_)  / _` | ' \)) _` |    | (_ / -_) ' \)) _` / -_)| '_| 
|___|__||_\___|_||_|\__|\___|  \__,_|_||_|\__,_|     \___\___|_||_|\__,_\___||_| 

Science and Gender

The goal of this application is to predict gender, with a reasonable margin of error, based only on author names found in articles published by PLoS accessible through http://api.plos.org/

##### Resources #####

+ Facebook Name and Gender Research data:
    - http://sites.google.com/site/facebooknamelist/namelist
+ List of common gender-neutral names:
    - http://evan.nixsyspaus.org/names/
    - data: http://evan.nixsyspaus.org/names/ordered-names.txt
+ "Baby Name Guesser" gives HTML web page back for name query with gender probability value and name popularity
    - http://www.gpeters.com/names/baby-names.php
+ Wolfram Alpha returns name information for known names
    - e.g. http://www.wolframalpha.com/input/?i=Mary
    - For single name (best to minimize errors) API query: http://www.wolframalpha.com/input/?i=name%2C+ELIZABETH
    - The Wolfram Alpha API only allows individual applications 2,000 queries per month >.< meaning, useless for us.
+ US Census Data (DUH)
    - Names by rank with female/male breakdown: http://www.census.gov/genealogy/names/names_files.html
    - Male first data: http://www.census.gov/genealogy/names/dist.male.first
    - Female first data: http://www.census.gov/genealogy/names/dist.female.first
+ Also using Wikipedia lists of names, those organized by category
    - http://en.wikipedia.org/wiki/Category:Given_names_by_gender
    - Also, just realized there is a list of gender-neutral names: http://en.wikipedia.org/wiki/Category:Unisex_given_names
    - Didn't know about the Wikipedia API: http://en.wikipedia.org/w/api.php until late, but successfully scraped the name data! You can find it in our data/ directory.

Something went wrong with that request. Please try again.