San Francisco housing construction history and associated data
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
R-scripts
census-rents
cities
cpi
employment
income
1920
1948
1949
1950
1951
1952
1953
1954
1955
1956
1957
1958
1959
1960
1961
1962
1963
1964
1965
1966
1967
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1984
2001
2002
2003
2br-vs-overall
README.md
calc-medians
combine
combined
combined.csv
craigslist-2003
craigslist-2004
craigslist-2005
craigslist-2006
craigslist-2007
craigslist-2008
craigslist-2009
craigslist-2010
craigslist-2011
craigslist-2012
craigslist-2013
craigslist-2014
craigslist-2015
craigslist-2016
craigslist-2016-07
craigslist-2016-10
craigslist-2017
craigslist-2017-02
craigslist-2017-09
craigslist-2018-06
evictions
extract-craiglist
housing-inventory
interest
irs-migration
medians
model
model-cpi-adjusted
model-distribution
model-evictions
permits
units-by-year
units-by-year-sum

README.md

Housing Inventory

This repository contains historical SF housing data and R scripts to graph that data. The data here was used to generate the graphs and analysis in the blog post "Employment, construction, and the cost of San Francisco apartments", and was recently used in a paper by Stanford researchers, "The Effects of Rent Control Expansion on Tenants, Landlords, and Inequality: Evidence from San Francisco.".

Data

Data for each year lives in the file named after the year. Later years may be listed as "craigslist-X".

You can get the rent out by running ./extract-craigslist craigslist-2016 for example. Note the data is not perfect. Here are some samples in the 2016 Craigslist data, for example.

799000 Apr 29 Exceptional Pacific Heights TIC $799000 / 2br - (Pacific Heights) pic
800 Apr 29 Awesome 5 Bedroom Available $800 / 5br - 3895ft2 - (2483 N Smiderle, San Bernardino, CA) pic
99 Apr 29 Jr. 1 BD. Washer & Dryer in unit! $99 deposit $3425 / 1br - 550ft2 - (nob hill) pic map

(It's not clear if these prices have been stripped before generating the averages in the housing-inventory file).

You can combine a bunch of data sources by running the "combine" script, ./combine. This generates the combined file in this repository.

The charts in the blog post are generated by running the model script in this repository, on the combined data.

calc-medians computes the medians for each year in the file. It prints the median, 95th, and 5th percentile for each year in the dataset. These values are present in the medians file in this repository.