Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

exploring the relationship between gender and scientific research

branch: master

Fetching latest commit…

Octocat-spinner-32-eaf2f5

Cannot retrieve the latest commit at this time

Octocat-spinner-32 data
Octocat-spinner-32 gender
Octocat-spinner-32 scripts
Octocat-spinner-32 .gitignore
Octocat-spinner-32 README
Octocat-spinner-32 plos_api_key
README
 (                                                                                 
 )\ )                                       (      (                 (             
(()/(   (    (             (       )        )\ )   )\ )     (        )\ )  (  (    
 /(_))( )\  ))\ (     (   ))\   ( /(  (    (()/(  (()/(    ))\ (    (()/( ))\ )(   
(_))  )((_)/((_))\ )  )\ /((_)  )(_)) )\ )  ((_))  /(_))_ /((_))\ )  ((_))((_|()\  
/ __|((_|_|_)) _(_/( ((_|_))   ((_)_ _(_/(  _| |  (_)) __(_)) _(_/(  _| (_))  ((_) 
\__ Y _|| / -_) ' \)) _|/ -_)  / _` | ' \)) _` |    | (_ / -_) ' \)) _` / -_)| '_| 
|___|__||_\___|_||_|\__|\___|  \__,_|_||_|\__,_|     \___\___|_||_|\__,_\___||_| 

Science and Gender

The goal of this application is to predict gender, with a reasonable margin of error, based only on author names found in articles published by PLoS accessible through http://api.plos.org/

##### Resources #####

+ Facebook Name and Gender Research data:
    - http://sites.google.com/site/facebooknamelist/namelist
+ List of common gender-neutral names:
    - http://evan.nixsyspaus.org/names/
    - data: http://evan.nixsyspaus.org/names/ordered-names.txt
+ "Baby Name Guesser" gives HTML web page back for name query with gender probability value and name popularity
    - http://www.gpeters.com/names/baby-names.php
+ Wolfram Alpha returns name information for known names
    - e.g. http://www.wolframalpha.com/input/?i=Mary
    - For single name (best to minimize errors) API query: http://www.wolframalpha.com/input/?i=name%2C+ELIZABETH
    - The Wolfram Alpha API only allows individual applications 2,000 queries per month >.< meaning, useless for us.
+ US Census Data (DUH)
    - Names by rank with female/male breakdown: http://www.census.gov/genealogy/names/names_files.html
    - Male first data: http://www.census.gov/genealogy/names/dist.male.first
    - Female first data: http://www.census.gov/genealogy/names/dist.female.first
+ Also using Wikipedia lists of names, those organized by category
    - http://en.wikipedia.org/wiki/Category:Given_names_by_gender
    - Also, just realized there is a list of gender-neutral names: http://en.wikipedia.org/wiki/Category:Unisex_given_names
    - Didn't know about the Wikipedia API: http://en.wikipedia.org/w/api.php until late, but successfully scraped the name data! You can find it in our data/ directory.

Something went wrong with that request. Please try again.