This repository is private.
All pages are served over SSL and all pushing and pulling is done over SSH.
No one may fork, clone, or view it unless they are added as a member.
Every repository with this icon (
) is private.
Every repository with this icon (
This repository is public.
Anyone may fork, clone, or view it.
Every repository with this icon (
) is public.
Every repository with this icon (
| name | age | message | |
|---|---|---|---|
| |
.gitignore | Fri May 15 07:46:47 -0700 2009 | |
| |
1-download.r | Fri May 15 07:46:47 -0700 2009 | |
| |
2-parse.rb | Fri May 15 07:46:47 -0700 2009 | |
| |
3-clean.r | Fri May 15 07:48:29 -0700 2009 | |
| |
4-explore.r | Fri May 15 09:02:34 -0700 2009 | |
| |
5-old-testament.r | Sun Jun 07 21:11:55 -0700 2009 | |
| |
6-sex-exploration.r | Tue Oct 06 15:13:15 -0700 2009 | |
| |
7-top5.r | Mon Jun 08 07:20:28 -0700 2009 | |
| |
8-variable.r | Wed Oct 21 09:34:11 -0700 2009 | |
| |
baby-names-by-state.csv | Mon Jun 08 07:20:42 -0700 2009 | |
| |
baby-names.csv | Fri May 15 07:48:29 -0700 2009 | |
| |
births.csv | Wed Oct 21 09:34:11 -0700 2009 | |
| |
by-state/ | Mon Jun 08 07:20:42 -0700 2009 | |
| |
images/ | Fri May 15 09:02:34 -0700 2009 | |
| |
old-testament.txt | Sun Jun 07 21:11:55 -0700 2009 | |
| |
readme.markdown | Sun May 17 07:59:07 -0700 2009 |
readme.markdown
US Baby names 1880-2009
Data
baby-names.csv contains the top 1000 girl and boy baby names from 1880 to 2009. This data was aggregated from the data made available from the social security administration. If you want to recreate it yourself, run the files 1-download.r, 2-parse.rb and 3-clean.r in order. You will need both R and ruby.
Percent of names in top 1000

Since the 1960's the percentage of babies with names in the top 1000 has been shrinking, to it's current level of 80% of boys and 67% of girls.
Last letters
Stimulated by the discussion on Andrew Gelman's blog (prompted by an old post of the baby name wizard blog) here are plots showing the distribution of last letter of names, 1880-2008.








