Skip to content

OpenUpSA/nameclass

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nameclass

A simple API to classify race and gender by name.

This tool attempts to infer a person's race and gender from their name using a classification algorithm. Attempting to identify a person's race from their name may seem ethically questional but is important for analysis of racial transformation over time. This classifier was trained using data from South Africa and so may not easily be used for other countries. Race classes are borrowed from official races used in South Africa for classification of demographics.

The classifier tends to work well with African names. Chinese has good precision but terrible recall. Results with Indian names are fair. Coloured names are often confused with both White and Indian names resulting in poor performance for the class.

5-fold crossvalidation results are shown below:

Train size Test size Race Precision Recall F1
30001500White0.7680.74590.7567
30001500Indian0.88850.82340.8547
30001500African0.91360.98390.9474
30001500Chinese0.96290.30300.4609
30001500Coloured0.64380.72800.6833

About

A simple API to classify race and gender by name.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages