ethnicolor: Impute Race and Ethnicity Based on Name
The data are from:
- The 2000 Census
- The 2010 Census
- The Florida Voter Registration File from February 2017. Request access to the data here or request your own copy from the Florida Secretary of State.
- Wikipedia Data
Included Models for Imputing Race
census_lnwill simply append data from the 2000 and 2010 census. For some mild processing that underlies it, see here.
pred_census_lnexposes a model based off Florida Voter Registration Last Name Data. For details, see here
pred_fl_reg_lnexposes a model built using Florida Voter Registration data. Utilizes only the surname. For details, see here
pred_fl_reg_lnexposes a model built using Florida Voter Registration data. Utilizes the first name, middle name (if there), last name, and suffix (if there). For details, see here
To get the current development version from GitHub:
# install.packages("devtools") devtools::install_github("soodoku/ethnicolor")
For Python package that provides roughly the same functionality, see ethnicolr.
Scripts are released under the MIT License.
Contributor Code of Conduct
The project welcomes contributions from everyone! In fact, it depends on it. To maintain this welcoming atmosphere, and to collaborate in a fun and productive way, we expect contributors to the project to abide by the Contributor Code of Conduct.