## Milwaukee Housing Data - Larissa Kidd 11/28/2023
### From https://data.milwaukee.gov/dataset/property-sales-data
Every year, the city of Milwaukee uploads a data file about which houses were sold in that past year. I want to compare 2002 against 2022 for a comparison of the 20 year difference.

background info about the variables:
Dataset information: Milwaukee historical housing sales.
Field Name =Description
PropertyID	= A unique identifier for each property.
PropType= The type of property (e.g., Commercial or Residential).
taxkey =The tax key associated with the property
Address = The address of the property.
CondoProject= Information about whether the property is part of a condominium	project (NaN indicates missing data).
District= The district number for the property.
nbhd= The neighborhood number for the property.
Style =The architectural style of the property.
Extwall =The type of exterior wall material used.
Stories =The number of stories in the building.
Year_Built =The year the property was built.
Rooms =The number of rooms in the property.
FinishedSqft =The total square footage of finished space in the property.
Units =The number of units in the property(e.g., apartments in a multifamily building) 
Bdrms =The number of bedrooms in the property.
Fbath =The number of full bathrooms in the property.
Hbath =The number of half bathrooms in the property.
Lotsize =The size of the lot associated with the property.
Sale_date =The date when the property was sold.
Sale_price =The sale price of the property.

In [55]:
from matplotlib import pyplot as plt
import pandas as pd
import numpy as np

In [56]:
data02 = pd.read_csv('2002-property-sales-data.csv')
data12 = pd.read_csv('2012-property-sales-data.csv')
data22 = pd.read_csv('2022-property-sales-data.csv')
data02.head()

Unnamed: 0,PropType,Taxkey,Address,CondoProject,District,Nbhd,Style,Extwall,Stories,Year_Built,Nr_of_rms,Fin_sqft,Units,Bdrms,Fbath,Hbath,Lotsize,Sale_date,Sale_price
0,Commercial,3230461110,2628 N 6TH ST,,6,6258,Commercial Exempt,,2.0,1880,0,1840,1,0,0,0,12750,2002-01,15900
1,Commercial,3590192000,1363 N PROSPECT AV,,3,6262,Mansions With Commercial Usage,,2.0,1876,0,6377,1,0,0,0,11840,2002-01,850000
2,Commercial,4161194000,617 S 94TH ST,,10,6272,Service Building,,1.0,1954,0,5022,1,0,0,0,9700,2002-01,119000
3,Commercial,1719836000,3624 W SILVER SPRING DR,,1,6218,"Store Bldg - Multi Story (Store & Apt, Store & O",,2.0,1955,0,6420,1,0,0,0,8792,2002-01,210000
4,Commercial,3480290000,3830 W LISBON AV,,15,6254,"Store Bldg - Multi Story (Store & Apt, Store & O",,2.0,1909,0,5956,1,0,0,0,4840,2002-01,48500


In [57]:
data12.head()

Unnamed: 0,PropType,Taxkey,Address,CondoProject,District,Nbhd,Style,Extwall,Stories,Year_Built,Nr_of_rms,Fin_sqft,Units,Bdrms,Fbath,Hbath,Lotsize,Sale_date,Sale_price
0,Residential,1930715000,5271 N TEUTONIA AV,,1,1160,Cape Cod,Aluminum / Vinyl,1.0,1940,0,1206,1,3,1,0,5880,2012-01,35000
1,Residential,2081312000,5138 N 40TH ST,,1,1440,Ranch,Aluminum / Vinyl,1.0,1952,0,908,1,2,1,0,3270,2012-01,34507
2,Residential,2260831000,4463 N 66TH ST,,2,1490,Cape Cod,Brick,1.0,1953,0,1512,1,3,2,0,5250,2012-01,112900
3,Residential,2120137000,7009 W VILLARD AV,,2,1380,Cape Cod,Aluminum / Vinyl,1.0,1956,0,1326,1,4,1,0,4800,2012-01,67000
4,Residential,1440202000,9131 W EDGEWATER DR,,2,480,Colonial,Aluminum / Vinyl,2.0,2003,0,1991,1,4,2,1,21571,2012-01,194800


In [58]:
data22.head()

Unnamed: 0,PropertyID,PropType,taxkey,Address,CondoProject,District,nbhd,Style,Extwall,Stories,Year_Built,Rooms,FinishedSqft,Units,Bdrms,Fbath,Hbath,Lotsize,Sale_date,Sale_price
0,98461,Manufacturing,30131000,9434-9446 N 107TH ST,,9,6300,Service Building,Concrete Block,1.0,1978.0,,20600.0,6,,0,0,0,2022-04-01,950000.0
1,98464,Commercial,30152000,9306-9316 N 107TH ST,,9,6202,Office Building - 1 Story,Brick,1.0,1982.0,,9688.0,23,,0,0,35719,2022-10-07,385000.0
2,98508,Residential,49980110,9327 N SWAN RD,,9,40,,,,,,,0,,0,0,1341648,2022-01-07,800000.0
3,98519,Residential,49993200,9411 W COUNTY LINE RD,,9,40,Ranch,Aluminum/Vinyl,1.0,1959.0,6.0,1334.0,1,3.0,1,1,83200,2022-08-09,280000.0
4,98561,Residential,50042000,9322 N JOYCE AV,,9,40,Ranch,Aluminum/Vinyl,1.0,1980.0,10.0,1006.0,1,6.0,1,0,8303,2022-05-23,233100.0


In [68]:
data22 = data22.rename(columns = {'taxkey':'Taxkey','nbhd':'Nbhd','Rooms':'Nr_of_rms'})

In [60]:
data02.shape

(170, 19)

The shape of data02 is important to remember as the numbers are based off of much less data

In [61]:
data12.shape

(1996, 19)

In [62]:
data22.shape

(7146, 20)

In [69]:
data22 = data22.drop(columns = 'PropertyID')

In [70]:
full_df = pd.concat([data02, data12, data22], keys=['2002 Housing Data','2012 Housing Data','2022 Housing Data'])
full_df

Unnamed: 0,Unnamed: 1,PropType,Taxkey,Address,CondoProject,District,Nbhd,Style,Extwall,Stories,Year_Built,Nr_of_rms,Fin_sqft,Units,Bdrms,Fbath,Hbath,Lotsize,Sale_date,Sale_price,FinishedSqft
2002 Housing Data,0,Commercial,3230461110,2628 N 6TH ST,,6,6258,Commercial Exempt,,2.0,1880.0,0.0,1840.0,1,0.0,0,0,12750,2002-01,15900.0,
2002 Housing Data,1,Commercial,3590192000,1363 N PROSPECT AV,,3,6262,Mansions With Commercial Usage,,2.0,1876.0,0.0,6377.0,1,0.0,0,0,11840,2002-01,850000.0,
2002 Housing Data,2,Commercial,4161194000,617 S 94TH ST,,10,6272,Service Building,,1.0,1954.0,0.0,5022.0,1,0.0,0,0,9700,2002-01,119000.0,
2002 Housing Data,3,Commercial,1719836000,3624 W SILVER SPRING DR,,1,6218,"Store Bldg - Multi Story (Store & Apt, Store & O",,2.0,1955.0,0.0,6420.0,1,0.0,0,0,8792,2002-01,210000.0,
2002 Housing Data,4,Commercial,3480290000,3830 W LISBON AV,,15,6254,"Store Bldg - Multi Story (Store & Apt, Store & O",,2.0,1909.0,0.0,5956.0,1,0.0,0,0,4840,2002-01,48500.0,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
2022 Housing Data,7141,Condominium,7160339000,1912 W SALEM ST,COLLEGE HEIGHTS,13,5360,Low Rise 1-3 Stories,,2.0,1974.0,10.0,,1,4.0,1,1,1,2022-03-11,125900.0,1100.0
2022 Housing Data,7142,Condominium,7160351000,6316 S 20TH ST,COLLEGE HEIGHTS,13,5360,Low Rise 1-3 Stories,,1.0,1974.0,5.0,,1,2.0,1,1,1,2022-10-28,150000.0,1379.0
2022 Housing Data,7143,Condominium,7160365000,6376 S 20TH ST,COLLEGE HEIGHTS,13,5360,Low Rise 1-3 Stories,,2.0,1974.0,10.0,,1,4.0,1,1,1,2022-03-15,130000.0,1100.0
2022 Housing Data,7144,Condominium,7160366000,6378 S 20TH ST,COLLEGE HEIGHTS,13,5360,Low Rise 1-3 Stories,,2.0,1974.0,5.0,,1,2.0,1,1,1,2022-12-30,123000.0,1100.0


There exists some columsn with NaN so 

In [71]:
full_df = full_df.drop(columns = 'CondoProject')

In [72]:
full_df = full_df.drop(columns = 'Extwall')

In [73]:
full_df.head()

Unnamed: 0,Unnamed: 1,PropType,Taxkey,Address,District,Nbhd,Style,Stories,Year_Built,Nr_of_rms,Fin_sqft,Units,Bdrms,Fbath,Hbath,Lotsize,Sale_date,Sale_price,FinishedSqft
2002 Housing Data,0,Commercial,3230461110,2628 N 6TH ST,6,6258,Commercial Exempt,2.0,1880.0,0.0,1840.0,1,0.0,0,0,12750,2002-01,15900.0,
2002 Housing Data,1,Commercial,3590192000,1363 N PROSPECT AV,3,6262,Mansions With Commercial Usage,2.0,1876.0,0.0,6377.0,1,0.0,0,0,11840,2002-01,850000.0,
2002 Housing Data,2,Commercial,4161194000,617 S 94TH ST,10,6272,Service Building,1.0,1954.0,0.0,5022.0,1,0.0,0,0,9700,2002-01,119000.0,
2002 Housing Data,3,Commercial,1719836000,3624 W SILVER SPRING DR,1,6218,"Store Bldg - Multi Story (Store & Apt, Store & O",2.0,1955.0,0.0,6420.0,1,0.0,0,0,8792,2002-01,210000.0,
2002 Housing Data,4,Commercial,3480290000,3830 W LISBON AV,15,6254,"Store Bldg - Multi Story (Store & Apt, Store & O",2.0,1909.0,0.0,5956.0,1,0.0,0,0,4840,2002-01,48500.0,
