Skip to content

hadley/data-baby-names

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

US Baby names 1880-2009

Data

baby-names.csv contains the top 1000 girl and boy baby names from 1880 to 2009. This data was aggregated from the data made available from the social security administration. If you want to recreate it yourself, run the files 1-download.r, 2-parse.rb and 3-clean.r in order. You will need both R and ruby.

Percent of names in top 1000

Percent of baby names in top 1000

Since the 1960's the percentage of babies with names in the top 1000 has been shrinking, to it's current level of 80% of boys and 67% of girls.

Last letters

Stimulated by the discussion on Andrew Gelman's blog (prompted by an old post of the baby name wizard blog) here are plots showing the distribution of last letter of names, 1880-2008.

Distribution of last letter of baby names

About

Distribution of US baby names, 1880-2008

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published