Skip to content
This repository

Predicts gender based on name

branch: master

Fetching latest commit…

Octocat-spinner-32-eaf2f5

Cannot retrieve the latest commit at this time

Octocat-spinner-32 LICENSE.txt
Octocat-spinner-32 README.markdown
Octocat-spinner-32 USSSALoader.py
Octocat-spinner-32 genderPredictor.py
README.markdown

genderPredictor

GenderPredictor is a wrapper around NLTK's Naive Bayes classifier for predicting the gender given a name.

This problem is common when dealing with incomplete contact information for users.

Currently it appears to be about 82% accurate on American names but this is just the framework. The name files are from the US Social Security Administration and are likely in the public domain. The processed files are distributed under the same rules as the original data (which is likely public domain...).

The code is under the Apache 2.0 license.

Comments and suggestions are welcome at stephen.holiday@gmail.com,

Stephen Holiday stephenholiday.com

Something went wrong with that request. Please try again.