# Getting started with R

### **By [Christine Zhang](https://twitter.com/christinezhang) (Knight-Mozilla / Los Angeles Times) & [Ryan Menezes](https://twitter.com/ryanvmenezes) (Los Angeles Times)** 

*IRE conference -- New Orleans, LA*  
 
June 18 2016  

This workshop is a basic introduction to R, a free, open-source software for data analysis and statistics. 

R is a powerful tool that can help you quickly and effectively answer questions using data.

Take our host city, New Orleans, for example:

* Hurricane Katrina was a devastating natural disaster that substantially affected the population of New Orleans.

* The hurricane took place in August 2005, which coincidentally falls between the 2000 and 2010 U.S. Censuses.

* This allows us to measure the effect of the hurricane on the populations of the different areas of New Orleans.

** Here are some questions we will set out to answer: **

* How much did the population of New Orleans change between 2000 and 2010?
* In which census tracts and parishes (Louisiana lingo for counties) did it change the most?
* How did the occupancy of homes change in that time period?

** We will use the Demographic Profiles for both years, downloaded from the [U.S. Census Bureau](http://factfinder.census.gov/faces/nav/jsf/pages/index.xhtml). ** 

** In this Getting Started session, we will: **

* Load in the datasets
* Perform some cleaning of the data
* Select the variables of interest
* Look at the Orleans Parish in particular, which represents New Orleans
* Calculate the percent change in population between 2000 and 2010

Load in the data (we'll call it "census2000" and inspect the first few rows)

In [1]:
census2000 <- read.csv('2000_census_demographic_profile.csv')

In [2]:
head(census2000)

Unnamed: 0,GEO.id,GEO.id2,GEO.display.label,HC01_VC01,HC02_VC01,HC01_VC03,HC02_VC03,HC01_VC04,HC02_VC04,HC01_VC05,HC02_VC05,HC01_VC06,HC02_VC06,HC01_VC07,HC02_VC07,HC01_VC08,HC02_VC08,HC01_VC09,HC02_VC09,HC01_VC10,HC02_VC10,HC01_VC11,HC02_VC11,HC01_VC12,HC02_VC12,HC01_VC13,HC02_VC13,HC01_VC14,HC02_VC14,HC01_VC15,HC02_VC15,HC01_VC16,HC02_VC16,HC01_VC17,HC02_VC17,HC01_VC18,HC02_VC18,HC01_VC19,HC02_VC19,HC01_VC20,HC02_VC20,HC01_VC21,HC02_VC21,HC01_VC22,HC02_VC22,HC01_VC23,HC02_VC23,HC01_VC24,HC02_VC24,HC01_VC25,HC02_VC25,HC01_VC26,HC02_VC26,HC01_VC28,HC02_VC28,HC01_VC29,HC02_VC29,HC01_VC30,HC02_VC30,HC01_VC31,HC02_VC31,HC01_VC32,HC02_VC32,HC01_VC33,HC02_VC33,HC01_VC34,HC02_VC34,HC01_VC35,HC02_VC35,HC01_VC36,HC02_VC36,HC01_VC37,HC02_VC37,HC01_VC38,HC02_VC38,HC01_VC39,HC02_VC39,HC01_VC40,HC02_VC40,HC01_VC41,HC02_VC41,HC01_VC42,HC02_VC42,HC01_VC43,HC02_VC43,HC01_VC44,HC02_VC44,HC01_VC45,HC02_VC45,HC01_VC46,HC02_VC46,HC01_VC48,HC02_VC48,HC01_VC49,HC02_VC49,HC01_VC50,HC02_VC50,HC01_VC51,HC02_VC51,HC01_VC52,HC02_VC52,HC01_VC53,HC02_VC53,HC01_VC55,HC02_VC55,HC01_VC56,HC02_VC56,HC01_VC57,HC02_VC57,HC01_VC58,HC02_VC58,HC01_VC59,HC02_VC59,HC01_VC60,HC02_VC60,HC01_VC61,HC02_VC61,HC01_VC62,HC02_VC62,HC01_VC64,HC02_VC64,HC01_VC65,HC02_VC65,HC01_VC66,HC02_VC66,HC01_VC67,HC02_VC67,HC01_VC68,HC02_VC68,HC01_VC69,HC02_VC69,HC01_VC70,HC02_VC70,HC01_VC71,HC02_VC71,HC01_VC72,HC02_VC72,HC01_VC73,HC02_VC73,HC01_VC74,HC02_VC74,HC01_VC75,HC02_VC75,HC01_VC76,HC02_VC76,HC01_VC78,HC02_VC78,HC01_VC79,HC02_VC79,HC01_VC80,HC02_VC80,HC01_VC81,HC02_VC81,HC01_VC82,HC02_VC82,HC01_VC83,HC02_VC83,HC01_VC84,HC02_VC84,HC01_VC85,HC02_VC85,HC01_VC86,HC02_VC86,HC01_VC87,HC02_VC87,HC01_VC88,HC02_VC88,HC01_VC89,HC02_VC89,HC01_VC90,HC02_VC90,HC01_VC91,HC02_VC91,HC01_VC93,HC02_VC93,HC01_VC94,HC02_VC94,HC01_VC95,HC02_VC95,HC01_VC96,HC02_VC96,HC01_VC97,HC02_VC97,HC01_VC98,HC02_VC98,HC01_VC100,HC02_VC100,HC01_VC101,HC02_VC101,HC01_VC102,HC02_VC102,HC01_VC103,HC02_VC103,HC01_VC104,HC02_VC104
1,Id,Id2,Geography,Number; Total population,Percent; Total population,Number; Total population - SEX AND AGE - Male,Percent; Total population - SEX AND AGE - Male,Number; Total population - SEX AND AGE - Female,Percent; Total population - SEX AND AGE - Female,Number; Total population - SEX AND AGE - Under 5 years,Percent; Total population - SEX AND AGE - Under 5 years,Number; Total population - SEX AND AGE - 5 to 9 years,Percent; Total population - SEX AND AGE - 5 to 9 years,Number; Total population - SEX AND AGE - 10 to 14 years,Percent; Total population - SEX AND AGE - 10 to 14 years,Number; Total population - SEX AND AGE - 15 to 19 years,Percent; Total population - SEX AND AGE - 15 to 19 years,Number; Total population - SEX AND AGE - 20 to 24 years,Percent; Total population - SEX AND AGE - 20 to 24 years,Number; Total population - SEX AND AGE - 25 to 34 years,Percent; Total population - SEX AND AGE - 25 to 34 years,Number; Total population - SEX AND AGE - 35 to 44 years,Percent; Total population - SEX AND AGE - 35 to 44 years,Number; Total population - SEX AND AGE - 45 to 54 years,Percent; Total population - SEX AND AGE - 45 to 54 years,Number; Total population - SEX AND AGE - 55 to 59 years,Percent; Total population - SEX AND AGE - 55 to 59 years,Number; Total population - SEX AND AGE - 60 to 64 years,Percent; Total population - SEX AND AGE - 60 to 64 years,Number; Total population - SEX AND AGE - 65 to 74 years,Percent; Total population - SEX AND AGE - 65 to 74 years,Number; Total population - SEX AND AGE - 75 to 84 years,Percent; Total population - SEX AND AGE - 75 to 84 years,Number; Total population - SEX AND AGE - 85 years and over,Percent; Total population - SEX AND AGE - 85 years and over,Number; Total population - SEX AND AGE - Median age (years),Percent; Total population - SEX AND AGE - Median age (years),Number; Total population - SEX AND AGE - 18 years and over,Percent; Total population - SEX AND AGE - 18 years and over,Number; Total population - SEX AND AGE - 18 years and over - Male,Percent; Total population - SEX AND AGE - 18 years and over - Male,Number; Total population - SEX AND AGE - 18 years and over - Female,Percent; Total population - SEX AND AGE - 18 years and over - Female,Number; Total population - SEX AND AGE - 21 years and over,Percent; Total population - SEX AND AGE - 21 years and over,Number; Total population - SEX AND AGE - 62 years and over,Percent; Total population - SEX AND AGE - 62 years and over,Number; Total population - SEX AND AGE - 65 years and over,Percent; Total population - SEX AND AGE - 65 years and over,Number; Total population - SEX AND AGE - 65 years and over - Male,Percent; Total population - SEX AND AGE - 65 years and over - Male,Number; Total population - SEX AND AGE - 65 years and over - Female,Percent; Total population - SEX AND AGE - 65 years and over - Female,Number; Total population - RACE - One race,Percent; Total population - RACE - One race,Number; Total population - RACE - One race - White,Percent; Total population - RACE - One race - White,Number; Total population - RACE - One race - Black or African American,Percent; Total population - RACE - One race - Black or African American,Number; Total population - RACE - One race - American Indian and Alaska Native,Percent; Total population - RACE - One race - American Indian and Alaska Native,Number; Total population - RACE - One race - Asian,Percent; Total population - RACE - One race - Asian,Number; Total population - RACE - One race - Asian - Asian Indian,Percent; Total population - RACE - One race - Asian - Asian Indian,Number; Total population - RACE - One race - Asian - Chinese,Percent; Total population - RACE - One race - Asian - Chinese,Number; Total population - RACE - One race - Asian - Filipino,Percent; Total population - RACE - One race - Asian - Filipino,Number; Total population - RACE - One race - Asian - Japanese,Percent; Total population - RACE - One race - Asian - Japanese,Number; Total population - RACE - One race - Asian - Korean,Percent; Total population - RACE - One race - Asian - Korean,Number; Total population - RACE - One race - Asian - Vietnamese,Percent; Total population - RACE - One race - Asian - Vietnamese,Number; Total population - RACE - One race - Asian - Other Asian [1],Percent; Total population - RACE - One race - Asian - Other Asian [1],Number; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander,Percent; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander,Number; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander - Native Hawaiian,Percent; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander - Native Hawaiian,Number; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander - Guamanian or Chamorro,Percent; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander - Guamanian or Chamorro,Number; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander - Samoan,Percent; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander - Samoan,Number; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander - Other Pacific Islander [2],Percent; Total population - RACE - One race - Native Hawaiian and Other Pacific Islander - Other Pacific Islander [2],Number; Total population - RACE - One race - Some other race,Percent; Total population - RACE - One race - Some other race,Number; Total population - RACE - Two or more races,Percent; Total population - RACE - Two or more races,Number; Total population - RACE - Race alone or in combination with one or more other races [3] - White,Percent; Total population - RACE - Race alone or in combination with one or more other races [3] - White,Number; Total population - RACE - Race alone or in combination with one or more other races [3] - Black or African American,Percent; Total population - RACE - Race alone or in combination with one or more other races [3] - Black or African American,Number; Total population - RACE - Race alone or in combination with one or more other races [3] - American Indian and Alaska Native,Percent; Total population - RACE - Race alone or in combination with one or more other races [3] - American Indian and Alaska Native,Number; Total population - RACE - Race alone or in combination with one or more other races [3] - Asian,Percent; Total population - RACE - Race alone or in combination with one or more other races [3] - Asian,Number; Total population - RACE - Race alone or in combination with one or more other races [3] - Native Hawaiian and Other Pacific Islander,Percent; Total population - RACE - Race alone or in combination with one or more other races [3] - Native Hawaiian and Other Pacific Islander,Number; Total population - RACE - Race alone or in combination with one or more other races [3] - Some other race,Percent; Total population - RACE - Race alone or in combination with one or more other races [3] - Some other race,Number; HISPANIC OR LATINO AND RACE - Total population,Percent; HISPANIC OR LATINO AND RACE - Total population,Number; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race),Percent; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race),Number; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race) - Mexican,Percent; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race) - Mexican,Number; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race) - Puerto Rican,Percent; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race) - Puerto Rican,Number; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race) - Cuban,Percent; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race) - Cuban,Number; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race) - Other Hispanic or Latino,Percent; HISPANIC OR LATINO AND RACE - Total population - Hispanic or Latino (of any race) - Other Hispanic or Latino,Number; HISPANIC OR LATINO AND RACE - Total population - Not Hispanic or Latino,Percent; HISPANIC OR LATINO AND RACE - Total population - Not Hispanic or Latino,Number; HISPANIC OR LATINO AND RACE - Total population - Not Hispanic or Latino - White alone,Percent; HISPANIC OR LATINO AND RACE - Total population - Not Hispanic or Latino - White alone,Number; RELATIONSHIP - Total population,Percent; RELATIONSHIP - Total population,Number; RELATIONSHIP - Total population - In households,Percent; RELATIONSHIP - Total population - In households,Number; RELATIONSHIP - Total population - In households - Householder,Percent; RELATIONSHIP - Total population - In households - Householder,Number; RELATIONSHIP - Total population - In households - Spouse,Percent; RELATIONSHIP - Total population - In households - Spouse,Number; RELATIONSHIP - Total population - In households - Child,Percent; RELATIONSHIP - Total population - In households - Child,Number; RELATIONSHIP - Total population - In households - Child - Own child under 18 years,Percent; RELATIONSHIP - Total population - In households - Child - Own child under 18 years,Number; RELATIONSHIP - Total population - In households - Other relatives,Percent; RELATIONSHIP - Total population - In households - Other relatives,Number; RELATIONSHIP - Total population - In households - Other relatives - Under 18 years,Percent; RELATIONSHIP - Total population - In households - Other relatives - Under 18 years,Number; RELATIONSHIP - Total population - In households - Nonrelatives,Percent; RELATIONSHIP - Total population - In households - Nonrelatives,Number; RELATIONSHIP - Total population - In households - Nonrelatives - Unmarried partner,Percent; RELATIONSHIP - Total population - In households - Nonrelatives - Unmarried partner,Number; RELATIONSHIP - Total population - In group quarters,Percent; RELATIONSHIP - Total population - In group quarters,Number; RELATIONSHIP - Total population - In group quarters - Institutionalized population,Percent; RELATIONSHIP - Total population - In group quarters - Institutionalized population,Number; RELATIONSHIP - Total population - In group quarters - Noninstitutionalized population,Percent; RELATIONSHIP - Total population - In group quarters - Noninstitutionalized population,Number; HOUSEHOLDS BY TYPE - Total households,Percent; HOUSEHOLDS BY TYPE - Total households,Number; HOUSEHOLDS BY TYPE - Total households - Family households (families),Percent; HOUSEHOLDS BY TYPE - Total households - Family households (families),Number; HOUSEHOLDS BY TYPE - Total households - Family households (families) - With own children under 18 years,Percent; HOUSEHOLDS BY TYPE - Total households - Family households (families) - With own children under 18 years,Number; HOUSEHOLDS BY TYPE - Total households - Family households (families) - Married-couple family,Percent; HOUSEHOLDS BY TYPE - Total households - Family households (families) - Married-couple family,Number; HOUSEHOLDS BY TYPE - Total households - Family households (families) - Married-couple family - With own children under 18 years,Percent; HOUSEHOLDS BY TYPE - Total households - Family households (families) - Married-couple family - With own children under 18 years,"Number; HOUSEHOLDS BY TYPE - Total households - Family households (families) - Female householder, no husband present","Percent; HOUSEHOLDS BY TYPE - Total households - Family households (families) - Female householder, no husband present","Number; HOUSEHOLDS BY TYPE - Total households - Family households (families) - Female householder, no husband present - With own children under 18 years","Percent; HOUSEHOLDS BY TYPE - Total households - Family households (families) - Female householder, no husband present - With own children under 18 years",Number; HOUSEHOLDS BY TYPE - Total households - Nonfamily households,Percent; HOUSEHOLDS BY TYPE - Total households - Nonfamily households,Number; HOUSEHOLDS BY TYPE - Total households - Nonfamily households - Householder living alone,Percent; HOUSEHOLDS BY TYPE - Total households - Nonfamily households - Householder living alone,Number; HOUSEHOLDS BY TYPE - Total households - Nonfamily households - Householder living alone - Householder 65 years and over,Percent; HOUSEHOLDS BY TYPE - Total households - Nonfamily households - Householder living alone - Householder 65 years and over,Number; HOUSEHOLDS BY TYPE - Total households - Households with individuals under 18 years,Percent; HOUSEHOLDS BY TYPE - Total households - Households with individuals under 18 years,Number; HOUSEHOLDS BY TYPE - Total households - Households with individuals 65 years and over,Percent; HOUSEHOLDS BY TYPE - Total households - Households with individuals 65 years and over,Number; HOUSEHOLDS BY TYPE - Total households - Average household size,Percent; HOUSEHOLDS BY TYPE - Total households - Average household size,Number; HOUSEHOLDS BY TYPE - Total households - Average family size,Percent; HOUSEHOLDS BY TYPE - Total households - Average family size,Number; HOUSING OCCUPANCY - Total housing units,Percent; HOUSING OCCUPANCY - Total housing units,Number; HOUSING OCCUPANCY - Total housing units - Occupied housing units,Percent; HOUSING OCCUPANCY - Total housing units - Occupied housing units,Number; HOUSING OCCUPANCY - Total housing units - Vacant housing units,Percent; HOUSING OCCUPANCY - Total housing units - Vacant housing units,"Number; HOUSING OCCUPANCY - Total housing units - Vacant housing units - For seasonal, recreational, or occasional use","Percent; HOUSING OCCUPANCY - Total housing units - Vacant housing units - For seasonal, recreational, or occasional use",Number; HOUSING OCCUPANCY - Total housing units - Homeowner vacancy rate (percent),Percent; HOUSING OCCUPANCY - Total housing units - Homeowner vacancy rate (percent),Number; HOUSING OCCUPANCY - Total housing units - Rental vacancy rate (percent),Percent; HOUSING OCCUPANCY - Total housing units - Rental vacancy rate (percent),Number; HOUSING TENURE - Occupied housing units,Percent; HOUSING TENURE - Occupied housing units,Number; HOUSING TENURE - Occupied housing units - Owner-occupied housing units,Percent; HOUSING TENURE - Occupied housing units - Owner-occupied housing units,Number; HOUSING TENURE - Occupied housing units - Renter-occupied housing units,Percent; HOUSING TENURE - Occupied housing units - Renter-occupied housing units,Number; HOUSING TENURE - Occupied housing units - Average household size of owner-occupied unit,Percent; HOUSING TENURE - Occupied housing units - Average household size of owner-occupied unit,Number; HOUSING TENURE - Occupied housing units - Average household size of renter-occupied unit,Percent; HOUSING TENURE - Occupied housing units - Average household size of renter-occupied unit
2,1400000US22001960100,22001960100,"Census Tract 9601, Acadia Parish, Louisiana",6188,100,2920,47,3268,53,462,8,502,8,541,9,572,9,375,6,728,12,913,15,699,11,301,5,252,4,433,7,287,5,123,2,34,(X),4304,70,1957,32,2347,38,4031,65,996,16,843,14,295,5,548,9,6174,100,4455,72,1675,27,12,0,7,0,0,0,0,0,2,0,0,0,1,0,0,0,4,0,0,0,0,0,0,0,0,0,0,0,25,0,14,0,4468,72,1677,27,19,0,8,0,0,0,30,1,6188,100,87,1,51,1,1,0,0,0,35,1,6101,99,4398,71,6188,100,6030,97,2236,36,1119,18,2199,36,1700,28,274,4,143,2,202,3,109,2,158,3,151,2,7,0,2236,100,1595,71,868,39,1119,50,573,26,363,16,237,11,641,29,585,26,303,14,962,43,569,25,3,(X),3,(X),2410,100,2236,93,174,7,15,1,1,(X),8,(X),2236,100,1526,68,710,32,3,(X),3,(X)
3,1400000US22001960200,22001960200,"Census Tract 9602, Acadia Parish, Louisiana",5056,100,2562,51,2494,49,346,7,416,8,476,9,463,9,298,6,579,12,861,17,709,14,250,5,203,4,263,5,150,3,42,1,34,(X),3527,70,1758,35,1769,35,3289,65,570,11,455,9,217,4,238,5,5035,100,4799,95,216,4,6,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,13,0,21,0,4816,95,226,5,13,0,6,0,4,0,16,0,5056,100,35,1,18,0,0,0,0,0,17,0,5021,99,4775,94,5056,100,5056,100,1764,35,1216,24,1791,35,1413,28,173,3,86,2,112,2,61,1,0,0,0,0,0,0,1764,100,1408,80,722,41,1216,69,617,35,134,8,81,5,356,20,310,18,128,7,781,44,339,19,3,(X),3,(X),1909,100,1764,92,145,8,31,2,1,(X),7,(X),1764,100,1461,83,303,17,3,(X),3,(X)
4,1400000US22001960300,22001960300,"Census Tract 9603, Acadia Parish, Louisiana",3149,100,1593,51,1556,49,209,7,251,8,305,10,260,8,204,7,368,12,520,17,409,13,148,5,130,4,209,7,104,3,32,1,35,(X),2233,71,1103,35,1130,36,2081,66,435,14,345,11,150,5,195,6,3140,100,3058,97,67,2,8,0,2,0,0,0,1,0,0,0,0,0,0,0,0,0,1,0,1,0,1,0,0,0,0,0,0,0,4,0,9,0,3066,97,69,2,13,0,2,0,1,0,7,0,3149,100,15,1,4,0,0,0,0,0,11,0,3134,100,3049,97,3149,100,3148,100,1145,36,750,24,1091,35,854,27,73,2,38,1,89,3,48,2,1,0,0,0,1,0,1145,100,883,77,445,39,750,66,369,32,93,8,52,5,262,23,228,20,91,8,475,42,247,22,3,(X),3,(X),1246,100,1145,92,101,8,19,2,1,(X),7,(X),1145,100,1041,91,104,9,3,(X),3,(X)
5,1400000US22001960400,22001960400,"Census Tract 9604, Acadia Parish, Louisiana",5617,100,2754,49,2863,51,429,8,406,7,520,9,476,9,353,6,691,12,914,16,684,12,254,5,222,4,410,7,193,3,65,1,34,(X),3944,70,1911,34,2033,36,3716,66,800,14,668,12,302,5,366,7,5583,99,5347,95,207,4,18,0,6,0,1,0,1,0,4,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,5,0,34,1,5381,96,211,4,43,1,8,0,0,0,9,0,5617,100,43,1,24,0,0,0,0,0,19,0,5574,99,5307,95,5617,100,5592,100,1991,35,1291,23,1994,36,1554,28,196,4,101,2,120,2,70,1,25,0,10,0,15,0,1991,100,1555,78,804,40,1291,65,641,32,168,8,99,5,436,22,388,20,189,10,861,43,482,24,3,(X),3,(X),2176,100,1991,92,185,9,23,1,1,(X),6,(X),1991,100,1630,82,361,18,3,(X),3,(X)
6,1400000US22001960500,22001960500,"Census Tract 9605, Acadia Parish, Louisiana",4927,100,2461,50,2466,50,400,8,438,9,439,9,418,9,319,7,704,14,777,16,644,13,227,5,154,3,234,5,134,3,39,1,32,(X),3405,69,1675,34,1730,35,3162,64,499,10,407,8,167,3,240,5,4901,100,4498,91,378,8,15,0,4,0,0,0,0,0,1,0,0,0,0,0,3,0,0,0,1,0,0,0,0,0,1,0,0,0,5,0,26,1,4524,92,385,8,35,1,4,0,1,0,9,0,4927,100,61,1,37,1,1,0,0,0,23,1,4866,99,4448,90,4927,100,4921,100,1692,34,1068,22,1809,37,1418,29,174,4,71,1,178,4,100,2,6,0,0,0,6,0,1692,100,1326,78,762,45,1068,63,611,36,182,11,103,6,366,22,300,18,136,8,808,48,316,19,3,(X),3,(X),1796,100,1692,94,104,6,22,1,1,(X),5,(X),1692,100,1419,84,273,16,3,(X),3,(X)


Notice the file came with two header rows, and we don't need the first one.

In [3]:
census2000 <- read.csv('2000_census_demographic_profile.csv', skip = 1)

In [4]:
head(census2000)

Unnamed: 0,Id,Id2,Geography,Number..Total.population,Percent..Total.population,Number..Total.population...SEX.AND.AGE...Male,Percent..Total.population...SEX.AND.AGE...Male,Number..Total.population...SEX.AND.AGE...Female,Percent..Total.population...SEX.AND.AGE...Female,Number..Total.population...SEX.AND.AGE...Under.5.years,Percent..Total.population...SEX.AND.AGE...Under.5.years,Number..Total.population...SEX.AND.AGE...5.to.9.years,Percent..Total.population...SEX.AND.AGE...5.to.9.years,Number..Total.population...SEX.AND.AGE...10.to.14.years,Percent..Total.population...SEX.AND.AGE...10.to.14.years,Number..Total.population...SEX.AND.AGE...15.to.19.years,Percent..Total.population...SEX.AND.AGE...15.to.19.years,Number..Total.population...SEX.AND.AGE...20.to.24.years,Percent..Total.population...SEX.AND.AGE...20.to.24.years,Number..Total.population...SEX.AND.AGE...25.to.34.years,Percent..Total.population...SEX.AND.AGE...25.to.34.years,Number..Total.population...SEX.AND.AGE...35.to.44.years,Percent..Total.population...SEX.AND.AGE...35.to.44.years,Number..Total.population...SEX.AND.AGE...45.to.54.years,Percent..Total.population...SEX.AND.AGE...45.to.54.years,Number..Total.population...SEX.AND.AGE...55.to.59.years,Percent..Total.population...SEX.AND.AGE...55.to.59.years,Number..Total.population...SEX.AND.AGE...60.to.64.years,Percent..Total.population...SEX.AND.AGE...60.to.64.years,Number..Total.population...SEX.AND.AGE...65.to.74.years,Percent..Total.population...SEX.AND.AGE...65.to.74.years,Number..Total.population...SEX.AND.AGE...75.to.84.years,Percent..Total.population...SEX.AND.AGE...75.to.84.years,Number..Total.population...SEX.AND.AGE...85.years.and.over,Percent..Total.population...SEX.AND.AGE...85.years.and.over,Number..Total.population...SEX.AND.AGE...Median.age..years.,Percent..Total.population...SEX.AND.AGE...Median.age..years.,Number..Total.population...SEX.AND.AGE...18.years.and.over,Percent..Total.population...SEX.AND.AGE...18.years.and.over,Number..Total.population...SEX.AND.AGE...18.years.and.over...Male,Percent..Total.population...SEX.AND.AGE...18.years.and.over...Male,Number..Total.population...SEX.AND.AGE...18.years.and.over...Female,Percent..Total.population...SEX.AND.AGE...18.years.and.over...Female,Number..Total.population...SEX.AND.AGE...21.years.and.over,Percent..Total.population...SEX.AND.AGE...21.years.and.over,Number..Total.population...SEX.AND.AGE...62.years.and.over,Percent..Total.population...SEX.AND.AGE...62.years.and.over,Number..Total.population...SEX.AND.AGE...65.years.and.over,Percent..Total.population...SEX.AND.AGE...65.years.and.over,Number..Total.population...SEX.AND.AGE...65.years.and.over...Male,Percent..Total.population...SEX.AND.AGE...65.years.and.over...Male,Number..Total.population...SEX.AND.AGE...65.years.and.over...Female,Percent..Total.population...SEX.AND.AGE...65.years.and.over...Female,Number..Total.population...RACE...One.race,Percent..Total.population...RACE...One.race,Number..Total.population...RACE...One.race...White,Percent..Total.population...RACE...One.race...White,Number..Total.population...RACE...One.race...Black.or.African.American,Percent..Total.population...RACE...One.race...Black.or.African.American,Number..Total.population...RACE...One.race...American.Indian.and.Alaska.Native,Percent..Total.population...RACE...One.race...American.Indian.and.Alaska.Native,Number..Total.population...RACE...One.race...Asian,Percent..Total.population...RACE...One.race...Asian,Number..Total.population...RACE...One.race...Asian...Asian.Indian,Percent..Total.population...RACE...One.race...Asian...Asian.Indian,Number..Total.population...RACE...One.race...Asian...Chinese,Percent..Total.population...RACE...One.race...Asian...Chinese,Number..Total.population...RACE...One.race...Asian...Filipino,Percent..Total.population...RACE...One.race...Asian...Filipino,Number..Total.population...RACE...One.race...Asian...Japanese,Percent..Total.population...RACE...One.race...Asian...Japanese,Number..Total.population...RACE...One.race...Asian...Korean,Percent..Total.population...RACE...One.race...Asian...Korean,Number..Total.population...RACE...One.race...Asian...Vietnamese,Percent..Total.population...RACE...One.race...Asian...Vietnamese,Number..Total.population...RACE...One.race...Asian...Other.Asian..1.,Percent..Total.population...RACE...One.race...Asian...Other.Asian..1.,Number..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander,Percent..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander,Number..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander...Native.Hawaiian,Percent..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander...Native.Hawaiian,Number..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander...Guamanian.or.Chamorro,Percent..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander...Guamanian.or.Chamorro,Number..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander...Samoan,Percent..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander...Samoan,Number..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander...Other.Pacific.Islander..2.,Percent..Total.population...RACE...One.race...Native.Hawaiian.and.Other.Pacific.Islander...Other.Pacific.Islander..2.,Number..Total.population...RACE...One.race...Some.other.race,Percent..Total.population...RACE...One.race...Some.other.race,Number..Total.population...RACE...Two.or.more.races,Percent..Total.population...RACE...Two.or.more.races,Number..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....White,Percent..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....White,Number..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....Black.or.African.American,Percent..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....Black.or.African.American,Number..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....American.Indian.and.Alaska.Native,Percent..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....American.Indian.and.Alaska.Native,Number..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....Asian,Percent..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....Asian,Number..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....Native.Hawaiian.and.Other.Pacific.Islander,Percent..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....Native.Hawaiian.and.Other.Pacific.Islander,Number..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....Some.other.race,Percent..Total.population...RACE...Race.alone.or.in.combination.with.one.or.more.other.races..3....Some.other.race,Number..HISPANIC.OR.LATINO.AND.RACE...Total.population,Percent..HISPANIC.OR.LATINO.AND.RACE...Total.population,Number..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race.,Percent..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race.,Number..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race....Mexican,Percent..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race....Mexican,Number..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race....Puerto.Rican,Percent..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race....Puerto.Rican,Number..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race....Cuban,Percent..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race....Cuban,Number..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race....Other.Hispanic.or.Latino,Percent..HISPANIC.OR.LATINO.AND.RACE...Total.population...Hispanic.or.Latino..of.any.race....Other.Hispanic.or.Latino,Number..HISPANIC.OR.LATINO.AND.RACE...Total.population...Not.Hispanic.or.Latino,Percent..HISPANIC.OR.LATINO.AND.RACE...Total.population...Not.Hispanic.or.Latino,Number..HISPANIC.OR.LATINO.AND.RACE...Total.population...Not.Hispanic.or.Latino...White.alone,Percent..HISPANIC.OR.LATINO.AND.RACE...Total.population...Not.Hispanic.or.Latino...White.alone,Number..RELATIONSHIP...Total.population,Percent..RELATIONSHIP...Total.population,Number..RELATIONSHIP...Total.population...In.households,Percent..RELATIONSHIP...Total.population...In.households,Number..RELATIONSHIP...Total.population...In.households...Householder,Percent..RELATIONSHIP...Total.population...In.households...Householder,Number..RELATIONSHIP...Total.population...In.households...Spouse,Percent..RELATIONSHIP...Total.population...In.households...Spouse,Number..RELATIONSHIP...Total.population...In.households...Child,Percent..RELATIONSHIP...Total.population...In.households...Child,Number..RELATIONSHIP...Total.population...In.households...Child...Own.child.under.18.years,Percent..RELATIONSHIP...Total.population...In.households...Child...Own.child.under.18.years,Number..RELATIONSHIP...Total.population...In.households...Other.relatives,Percent..RELATIONSHIP...Total.population...In.households...Other.relatives,Number..RELATIONSHIP...Total.population...In.households...Other.relatives...Under.18.years,Percent..RELATIONSHIP...Total.population...In.households...Other.relatives...Under.18.years,Number..RELATIONSHIP...Total.population...In.households...Nonrelatives,Percent..RELATIONSHIP...Total.population...In.households...Nonrelatives,Number..RELATIONSHIP...Total.population...In.households...Nonrelatives...Unmarried.partner,Percent..RELATIONSHIP...Total.population...In.households...Nonrelatives...Unmarried.partner,Number..RELATIONSHIP...Total.population...In.group.quarters,Percent..RELATIONSHIP...Total.population...In.group.quarters,Number..RELATIONSHIP...Total.population...In.group.quarters...Institutionalized.population,Percent..RELATIONSHIP...Total.population...In.group.quarters...Institutionalized.population,Number..RELATIONSHIP...Total.population...In.group.quarters...Noninstitutionalized.population,Percent..RELATIONSHIP...Total.population...In.group.quarters...Noninstitutionalized.population,Number..HOUSEHOLDS.BY.TYPE...Total.households,Percent..HOUSEHOLDS.BY.TYPE...Total.households,Number..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families.,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families.,Number..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....With.own.children.under.18.years,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....With.own.children.under.18.years,Number..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....Married.couple.family,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....Married.couple.family,Number..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....Married.couple.family...With.own.children.under.18.years,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....Married.couple.family...With.own.children.under.18.years,Number..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....Female.householder..no.husband.present,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....Female.householder..no.husband.present,Number..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....Female.householder..no.husband.present...With.own.children.under.18.years,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Family.households..families....Female.householder..no.husband.present...With.own.children.under.18.years,Number..HOUSEHOLDS.BY.TYPE...Total.households...Nonfamily.households,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Nonfamily.households,Number..HOUSEHOLDS.BY.TYPE...Total.households...Nonfamily.households...Householder.living.alone,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Nonfamily.households...Householder.living.alone,Number..HOUSEHOLDS.BY.TYPE...Total.households...Nonfamily.households...Householder.living.alone...Householder.65.years.and.over,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Nonfamily.households...Householder.living.alone...Householder.65.years.and.over,Number..HOUSEHOLDS.BY.TYPE...Total.households...Households.with.individuals.under.18.years,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Households.with.individuals.under.18.years,Number..HOUSEHOLDS.BY.TYPE...Total.households...Households.with.individuals.65.years.and.over,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Households.with.individuals.65.years.and.over,Number..HOUSEHOLDS.BY.TYPE...Total.households...Average.household.size,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Average.household.size,Number..HOUSEHOLDS.BY.TYPE...Total.households...Average.family.size,Percent..HOUSEHOLDS.BY.TYPE...Total.households...Average.family.size,Number..HOUSING.OCCUPANCY...Total.housing.units,Percent..HOUSING.OCCUPANCY...Total.housing.units,Number..HOUSING.OCCUPANCY...Total.housing.units...Occupied.housing.units,Percent..HOUSING.OCCUPANCY...Total.housing.units...Occupied.housing.units,Number..HOUSING.OCCUPANCY...Total.housing.units...Vacant.housing.units,Percent..HOUSING.OCCUPANCY...Total.housing.units...Vacant.housing.units,Number..HOUSING.OCCUPANCY...Total.housing.units...Vacant.housing.units...For.seasonal..recreational..or.occasional.use,Percent..HOUSING.OCCUPANCY...Total.housing.units...Vacant.housing.units...For.seasonal..recreational..or.occasional.use,Number..HOUSING.OCCUPANCY...Total.housing.units...Homeowner.vacancy.rate..percent.,Percent..HOUSING.OCCUPANCY...Total.housing.units...Homeowner.vacancy.rate..percent.,Number..HOUSING.OCCUPANCY...Total.housing.units...Rental.vacancy.rate..percent.,Percent..HOUSING.OCCUPANCY...Total.housing.units...Rental.vacancy.rate..percent.,Number..HOUSING.TENURE...Occupied.housing.units,Percent..HOUSING.TENURE...Occupied.housing.units,Number..HOUSING.TENURE...Occupied.housing.units...Owner.occupied.housing.units,Percent..HOUSING.TENURE...Occupied.housing.units...Owner.occupied.housing.units,Number..HOUSING.TENURE...Occupied.housing.units...Renter.occupied.housing.units,Percent..HOUSING.TENURE...Occupied.housing.units...Renter.occupied.housing.units,Number..HOUSING.TENURE...Occupied.housing.units...Average.household.size.of.owner.occupied.unit,Percent..HOUSING.TENURE...Occupied.housing.units...Average.household.size.of.owner.occupied.unit,Number..HOUSING.TENURE...Occupied.housing.units...Average.household.size.of.renter.occupied.unit,Percent..HOUSING.TENURE...Occupied.housing.units...Average.household.size.of.renter.occupied.unit
1,1400000US22001960100,22001960100,"Census Tract 9601, Acadia Parish, Louisiana",6188,100,2920,47,3268,53,462,8,502,8,541,9,572,9,375,6,728,12,913,15,699,11,301,5,252,4,433,7,287,5,123,2,34,(X),4304,70,1957,32,2347,38,4031,65,996,16,843,14,295,5,548,9,6174,100,4455,72,1675,27,12,0,7,0,0,0,0,0,2,0,0,0,1,0,0,0,4,0,0,0,0,0,0,0,0,0,0,0,25,0,14,0,4468,72,1677,27,19,0,8,0,0,0,30,1,6188,100,87,1,51,1,1,0,0,0,35,1,6101,99,4398,71,6188,100,6030,97,2236,36,1119,18,2199,36,1700,28,274,4,143,2,202,3,109,2,158,3,151,2,7,0,2236,100,1595,71,868,39,1119,50,573,26,363,16,237,11,641,29,585,26,303,14,962,43,569,25,3,(X),3,(X),2410,100,2236,93,174,7,15,1,1,(X),8,(X),2236,100,1526,68,710,32,3,(X),3,(X)
2,1400000US22001960200,22001960200,"Census Tract 9602, Acadia Parish, Louisiana",5056,100,2562,51,2494,49,346,7,416,8,476,9,463,9,298,6,579,12,861,17,709,14,250,5,203,4,263,5,150,3,42,1,34,(X),3527,70,1758,35,1769,35,3289,65,570,11,455,9,217,4,238,5,5035,100,4799,95,216,4,6,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,13,0,21,0,4816,95,226,5,13,0,6,0,4,0,16,0,5056,100,35,1,18,0,0,0,0,0,17,0,5021,99,4775,94,5056,100,5056,100,1764,35,1216,24,1791,35,1413,28,173,3,86,2,112,2,61,1,0,0,0,0,0,0,1764,100,1408,80,722,41,1216,69,617,35,134,8,81,5,356,20,310,18,128,7,781,44,339,19,3,(X),3,(X),1909,100,1764,92,145,8,31,2,1,(X),7,(X),1764,100,1461,83,303,17,3,(X),3,(X)
3,1400000US22001960300,22001960300,"Census Tract 9603, Acadia Parish, Louisiana",3149,100,1593,51,1556,49,209,7,251,8,305,10,260,8,204,7,368,12,520,17,409,13,148,5,130,4,209,7,104,3,32,1,35,(X),2233,71,1103,35,1130,36,2081,66,435,14,345,11,150,5,195,6,3140,100,3058,97,67,2,8,0,2,0,0,0,1,0,0,0,0,0,0,0,0,0,1,0,1,0,1,0,0,0,0,0,0,0,4,0,9,0,3066,97,69,2,13,0,2,0,1,0,7,0,3149,100,15,1,4,0,0,0,0,0,11,0,3134,100,3049,97,3149,100,3148,100,1145,36,750,24,1091,35,854,27,73,2,38,1,89,3,48,2,1,0,0,0,1,0,1145,100,883,77,445,39,750,66,369,32,93,8,52,5,262,23,228,20,91,8,475,42,247,22,3,(X),3,(X),1246,100,1145,92,101,8,19,2,1,(X),7,(X),1145,100,1041,91,104,9,3,(X),3,(X)
4,1400000US22001960400,22001960400,"Census Tract 9604, Acadia Parish, Louisiana",5617,100,2754,49,2863,51,429,8,406,7,520,9,476,9,353,6,691,12,914,16,684,12,254,5,222,4,410,7,193,3,65,1,34,(X),3944,70,1911,34,2033,36,3716,66,800,14,668,12,302,5,366,7,5583,99,5347,95,207,4,18,0,6,0,1,0,1,0,4,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,5,0,34,1,5381,96,211,4,43,1,8,0,0,0,9,0,5617,100,43,1,24,0,0,0,0,0,19,0,5574,99,5307,95,5617,100,5592,100,1991,35,1291,23,1994,36,1554,28,196,4,101,2,120,2,70,1,25,0,10,0,15,0,1991,100,1555,78,804,40,1291,65,641,32,168,8,99,5,436,22,388,20,189,10,861,43,482,24,3,(X),3,(X),2176,100,1991,92,185,9,23,1,1,(X),6,(X),1991,100,1630,82,361,18,3,(X),3,(X)
5,1400000US22001960500,22001960500,"Census Tract 9605, Acadia Parish, Louisiana",4927,100,2461,50,2466,50,400,8,438,9,439,9,418,9,319,7,704,14,777,16,644,13,227,5,154,3,234,5,134,3,39,1,32,(X),3405,69,1675,34,1730,35,3162,64,499,10,407,8,167,3,240,5,4901,100,4498,91,378,8,15,0,4,0,0,0,0,0,1,0,0,0,0,0,3,0,0,0,1,0,0,0,0,0,1,0,0,0,5,0,26,1,4524,92,385,8,35,1,4,0,1,0,9,0,4927,100,61,1,37,1,1,0,0,0,23,1,4866,99,4448,90,4927,100,4921,100,1692,34,1068,22,1809,37,1418,29,174,4,71,1,178,4,100,2,6,0,0,0,6,0,1692,100,1326,78,762,45,1068,63,611,36,182,11,103,6,366,22,300,18,136,8,808,48,316,19,3,(X),3,(X),1796,100,1692,94,104,6,22,1,1,(X),5,(X),1692,100,1419,84,273,16,3,(X),3,(X)
6,1400000US22001960600,22001960600,"Census Tract 9606, Acadia Parish, Louisiana",5654,100,2647,47,3007,53,464,8,471,8,442,8,460,8,358,6,760,13,871,15,615,11,243,4,209,4,415,7,241,4,105,2,33,(X),3999,71,1791,32,2208,39,3736,66,869,15,761,14,271,5,490,9,5620,99,4809,85,782,14,7,0,12,0,0,0,3,0,2,0,0,0,1,0,6,0,0,0,0,0,0,0,0,0,0,0,0,0,10,0,34,1,4842,86,792,14,18,0,15,0,0,0,21,0,5654,100,49,1,28,1,0,0,0,0,21,0,5605,99,4774,84,5654,100,5526,98,2073,37,1076,19,1891,33,1478,26,289,5,160,3,197,4,110,2,128,2,128,2,0,0,2073,100,1477,71,796,38,1076,52,541,26,310,15,189,9,596,29,521,25,243,12,882,43,510,25,3,(X),3,(X),2292,100,2073,90,219,10,11,1,1,(X),14,(X),2073,100,1474,71,599,29,3,(X),3,(X)


We don't need all of the variables, since we only care about a handful. Let's take a look at what's in there.

In [5]:
names(census2000)

Let's keep a handful of these: 

* `Id2`: this is what the census bureau calls a fips code. It is a unique numerical identifier for all census tracts. This will be important when we join our two datasets together.  
 
 
* `Geography`: this is a text description of the tract, with the parish name.  
 
 
* `Number..Total.population`: the total population of the tract.  
 
 
* `Number..HOUSING.OCCUPANCY...Total.housing.units`, `Number..HOUSING.OCCUPANCY...Total.housing.units...Occupied.housing.units`, and `Number..HOUSING.OCCUPANCY...Total.housing.units...Vacant.housing.units`: the total, occupied, and vacant housing units.

In [6]:
library('dplyr')

census2000.trim <- select(
    census2000, # name of the data frame
    # list of all the column names we want to keep
    Id2, Geography, Number..Total.population, 
    Number..HOUSING.OCCUPANCY...Total.housing.units, 
    Number..HOUSING.OCCUPANCY...Total.housing.units...Occupied.housing.units, 
    Number..HOUSING.OCCUPANCY...Total.housing.units...Vacant.housing.units
)

head(census2000.trim)

: package ‘dplyr’ was built under R version 3.2.4
Attaching package: ‘dplyr’

The following objects are masked from ‘package:stats’:

    filter, lag

The following objects are masked from ‘package:base’:

    intersect, setdiff, setequal, union



Unnamed: 0,Id2,Geography,Number..Total.population,Number..HOUSING.OCCUPANCY...Total.housing.units,Number..HOUSING.OCCUPANCY...Total.housing.units...Occupied.housing.units,Number..HOUSING.OCCUPANCY...Total.housing.units...Vacant.housing.units
1,22001960100,"Census Tract 9601, Acadia Parish, Louisiana",6188,2410,2236,174
2,22001960200,"Census Tract 9602, Acadia Parish, Louisiana",5056,1909,1764,145
3,22001960300,"Census Tract 9603, Acadia Parish, Louisiana",3149,1246,1145,101
4,22001960400,"Census Tract 9604, Acadia Parish, Louisiana",5617,2176,1991,185
5,22001960500,"Census Tract 9605, Acadia Parish, Louisiana",4927,1796,1692,104
6,22001960600,"Census Tract 9606, Acadia Parish, Louisiana",5654,2292,2073,219


These column names are long and unwieldy. Since we are going to be referring to them often, let's rename them to more convenient, shorter versions.

In [7]:
colnames(census2000.trim) <- c('fips.code', 'census.tract', 'population', 
                               'total.housing.units', 'occupied.housing.units', 'vacant.housing.units')
head(census2000.trim)

Unnamed: 0,fips.code,census.tract,population,total.housing.units,occupied.housing.units,vacant.housing.units
1,22001960100,"Census Tract 9601, Acadia Parish, Louisiana",6188,2410,2236,174
2,22001960200,"Census Tract 9602, Acadia Parish, Louisiana",5056,1909,1764,145
3,22001960300,"Census Tract 9603, Acadia Parish, Louisiana",3149,1246,1145,101
4,22001960400,"Census Tract 9604, Acadia Parish, Louisiana",5617,2176,1991,185
5,22001960500,"Census Tract 9605, Acadia Parish, Louisiana",4927,1796,1692,104
6,22001960600,"Census Tract 9606, Acadia Parish, Louisiana",5654,2292,2073,219


In [8]:
str(census2000.trim)

'data.frame':	1106 obs. of  6 variables:
 $ fips.code             : num  2.2e+10 2.2e+10 2.2e+10 2.2e+10 2.2e+10 ...
 $ census.tract          : Factor w/ 1106 levels "Census Tract 10.01, Lafayette Parish, Louisiana",..: 970 978 985 993 1000 1006 1011 1015 1018 1021 ...
 $ population            : Factor w/ 1019 levels "0","1","10,248",..: 859 710 350 801 692 806 647 804 711 842 ...
 $ total.housing.units   : Factor w/ 905 levels "1","10","1,002",..: 565 404 104 499 357 531 397 522 426 594 ...
 $ occupied.housing.units: Factor w/ 876 levels "0","1","1,001",..: 512 363 74 446 338 468 347 470 374 514 ...
 $ vacant.housing.units  : Factor w/ 374 levels "0","1","10","100",..: 86 54 5 98 9 134 83 86 110 196 ...


In [9]:
library('stringr')

: package ‘stringr’ was built under R version 3.2.5

In [10]:
census2000.trim$population <- str_replace(census2000.trim$population, pattern = ',', replacement = '')

In [11]:
head(census2000.trim)

Unnamed: 0,fips.code,census.tract,population,total.housing.units,occupied.housing.units,vacant.housing.units
1,22001960100,"Census Tract 9601, Acadia Parish, Louisiana",6188,2410,2236,174
2,22001960200,"Census Tract 9602, Acadia Parish, Louisiana",5056,1909,1764,145
3,22001960300,"Census Tract 9603, Acadia Parish, Louisiana",3149,1246,1145,101
4,22001960400,"Census Tract 9604, Acadia Parish, Louisiana",5617,2176,1991,185
5,22001960500,"Census Tract 9605, Acadia Parish, Louisiana",4927,1796,1692,104
6,22001960600,"Census Tract 9606, Acadia Parish, Louisiana",5654,2292,2073,219


In [12]:
census2000.trim$total.housing.units <- str_replace(census2000.trim$total.housing.units, pattern = ',', replacement = '')
census2000.trim$occupied.housing.units <- str_replace(census2000.trim$occupied.housing.units, pattern = ',', replacement = '')
census2000.trim$vacant.housing.units <- str_replace(census2000.trim$vacant.housing.units, pattern = ',', replacement = '')

In [13]:
census2000.trim$population <- as.numeric(census2000.trim$population)
str(census2000.trim)

'data.frame':	1106 obs. of  6 variables:
 $ fips.code             : num  2.2e+10 2.2e+10 2.2e+10 2.2e+10 2.2e+10 ...
 $ census.tract          : Factor w/ 1106 levels "Census Tract 10.01, Lafayette Parish, Louisiana",..: 970 978 985 993 1000 1006 1011 1015 1018 1021 ...
 $ population            : num  6188 5056 3149 5617 4927 ...
 $ total.housing.units   : chr  "2410" "1909" "1246" "2176" ...
 $ occupied.housing.units: chr  "2236" "1764" "1145" "1991" ...
 $ vacant.housing.units  : chr  "174" "145" "101" "185" ...


In [14]:
census2000.trim$total.housing.units <- as.numeric(census2000.trim$total.housing.units)
census2000.trim$occupied.housing.units <- as.numeric(census2000.trim$occupied.housing.units)
census2000.trim$vacant.housing.units <- as.numeric(census2000.trim$vacant.housing.units)
str(census2000.trim)

'data.frame':	1106 obs. of  6 variables:
 $ fips.code             : num  2.2e+10 2.2e+10 2.2e+10 2.2e+10 2.2e+10 ...
 $ census.tract          : Factor w/ 1106 levels "Census Tract 10.01, Lafayette Parish, Louisiana",..: 970 978 985 993 1000 1006 1011 1015 1018 1021 ...
 $ population            : num  6188 5056 3149 5617 4927 ...
 $ total.housing.units   : num  2410 1909 1246 2176 1796 ...
 $ occupied.housing.units: num  2236 1764 1145 1991 1692 ...
 $ vacant.housing.units  : num  174 145 101 185 104 219 171 174 196 284 ...


In [15]:
head(census2000.trim)

Unnamed: 0,fips.code,census.tract,population,total.housing.units,occupied.housing.units,vacant.housing.units
1,22001960100,"Census Tract 9601, Acadia Parish, Louisiana",6188,2410,2236,174
2,22001960200,"Census Tract 9602, Acadia Parish, Louisiana",5056,1909,1764,145
3,22001960300,"Census Tract 9603, Acadia Parish, Louisiana",3149,1246,1145,101
4,22001960400,"Census Tract 9604, Acadia Parish, Louisiana",5617,2176,1991,185
5,22001960500,"Census Tract 9605, Acadia Parish, Louisiana",4927,1796,1692,104
6,22001960600,"Census Tract 9606, Acadia Parish, Louisiana",5654,2292,2073,219


In [16]:
library('tidyr')

census2000.trim <- separate(census2000.trim, census.tract, c('tract', 'parish', 'state'), ', ')

In [17]:
head(census2000.trim)

Unnamed: 0,fips.code,tract,parish,state,population,total.housing.units,occupied.housing.units,vacant.housing.units
1,22001960100,Census Tract 9601,Acadia Parish,Louisiana,6188,2410,2236,174
2,22001960200,Census Tract 9602,Acadia Parish,Louisiana,5056,1909,1764,145
3,22001960300,Census Tract 9603,Acadia Parish,Louisiana,3149,1246,1145,101
4,22001960400,Census Tract 9604,Acadia Parish,Louisiana,5617,2176,1991,185
5,22001960500,Census Tract 9605,Acadia Parish,Louisiana,4927,1796,1692,104
6,22001960600,Census Tract 9606,Acadia Parish,Louisiana,5654,2292,2073,219


In [18]:
table(census2000.trim$parish)


              Acadia Parish                Allen Parish 
                         12                           5 
           Ascension Parish           Assumption Parish 
                         14                           6 
           Avoyelles Parish           Beauregard Parish 
                          9                           7 
           Bienville Parish              Bossier Parish 
                          5                          19 
               Caddo Parish            Calcasieu Parish 
                         64                          41 
            Caldwell Parish              Cameron Parish 
                          3                           2 
           Catahoula Parish            Claiborne Parish 
                          3                           5 
           Concordia Parish              De Soto Parish 
                          5                           7 
    East Baton Rouge Parish         East Carroll Parish 
                         89   

In [19]:
orleans2000 <- filter(census2000.trim, parish == 'Orleans Parish')
head(orleans2000)

Unnamed: 0,fips.code,tract,parish,state,population,total.housing.units,occupied.housing.units,vacant.housing.units
1,22071000000.0,Census Tract 1,Orleans Parish,Louisiana,2381,1408,1145,263
2,22071000000.0,Census Tract 2,Orleans Parish,Louisiana,1347,691,496,195
3,22071000000.0,Census Tract 3,Orleans Parish,Louisiana,1468,719,559,160
4,22071000000.0,Census Tract 4,Orleans Parish,Louisiana,2564,1034,873,161
5,22071000000.0,Census Tract 6.01,Orleans Parish,Louisiana,2034,704,506,198
6,22071000000.0,Census Tract 6.02,Orleans Parish,Louisiana,2957,1106,1011,95


In [20]:
sum(orleans2000$population)

In [21]:
census2010 <- read.csv('2010_census_demographic_profile.csv')
census2010 <- read.csv('2010_census_demographic_profile.csv', skip = 1)

In [22]:
census2010.trim <- select(
  census2010, # name of the data frame
  # list of all the column names we want to keep
  Id2, Geography, Number..SEX.AND.AGE...Total.population, 
  Number..HOUSING.OCCUPANCY...Total.housing.units, 
  Number..HOUSING.OCCUPANCY...Total.housing.units...Occupied.housing.units, 
  Number..HOUSING.OCCUPANCY...Total.housing.units...Vacant.housing.units
)

colnames(census2010.trim) <- c('fips.code', 'census.tract', 'population', 
                               'total.housing.units', 'occupied.housing.units', 'vacant.housing.units')

census2010.trim$population <- str_replace(census2010.trim$population, pattern = ',', replacement = '')
census2010.trim$total.housing.units <- str_replace(census2010.trim$total.housing.units, pattern = ',', replacement = '')
census2010.trim$occupied.housing.units <- str_replace(census2010.trim$occupied.housing.units, pattern = ',', replacement = '')
census2010.trim$vacant.housing.units <- str_replace(census2010.trim$vacant.housing.units, pattern = ',', replacement = '')

census2010.trim$population <- as.numeric(census2010.trim$population)
census2010.trim$total.housing.units <- as.numeric(census2010.trim$total.housing.units)
census2010.trim$occupied.housing.units <- as.numeric(census2010.trim$occupied.housing.units)
census2010.trim$vacant.housing.units <- as.numeric(census2010.trim$vacant.housing.units)


census2010.trim <- separate(census2010.trim, census.tract, c('tract', 'parish', 'state'), ', ')

orleans2010 <- filter(census2010.trim, parish == 'Orleans Parish')

In [23]:
sum(orleans2010$population)

In [29]:
nola2000pop <- sum(orleans2000$population)
nola2010pop <- sum(orleans2010$population)

perc.change.nola <- (nola2010pop - nola2000pop)/nola2000pop * 100

In [37]:
print(paste('The change in New Orleans population since 2000 is', round(perc.change.nola), '%'))

[1] "The change in New Orleans population since 2000 is -29 %"


In [47]:
census.comparison <- merge(census2000.trim, census2010.trim, 
                           by = c('fips.code', 'tract', 'parish', 'state'), 
                           suffixes = c('.00', '.10'), 
                          all = TRUE)
head(census.comparison)

Unnamed: 0,fips.code,tract,parish,state,population.00,total.housing.units.00,occupied.housing.units.00,vacant.housing.units.00,population.10,total.housing.units.10,occupied.housing.units.10,vacant.housing.units.10
1,22001960100,Census Tract 9601,Acadia Parish,Louisiana,6188,2410,2236,174,6213,2574,2345,229
2,22001960200,Census Tract 9602,Acadia Parish,Louisiana,5056,1909,1764,145,5988,2362,2144,218
3,22001960300,Census Tract 9603,Acadia Parish,Louisiana,3149,1246,1145,101,3582,1427,1286,141
4,22001960400,Census Tract 9604,Acadia Parish,Louisiana,5617,2176,1991,185,6584,2604,2362,242
5,22001960500,Census Tract 9605,Acadia Parish,Louisiana,4927,1796,1692,104,6093,2349,2178,171
6,22001960600,Census Tract 9606,Acadia Parish,Louisiana,5654,2292,2073,219,5972,2504,2306,198


In [48]:
write.csv(census.comparison, 'census_comparison.csv', row.names = FALSE)