Skip to content

jamesohawkins/US-Home-Ownership

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

First Figure

About the project

This repository includes the necessary Stata code and instructions for replicating the results from The Berkeley Institute for Young American's analysis of homeownership rates over age and time.

Methodology

I use data from IPUMS-USA to visualize homeownership rates across time and the age distribution. I use the Census-defined homeownership variable (OWNERSHP), which categorizes respondents as 1 if they reside in an owner-occupied (either owned outright or mortgaged) household or 0 if they're renting. I restrict the sample to households (non-group qarters), respondents age 21 or older, and the household's "householder" (otherwise known as "head of household" or "reference person"). I weight the analysis based on the Census-provided household weight variable (HHWT) to maintain the representativeness of the sample as it relates to the population of households in the U.S. In instances where I calculate standard errors, I use the IPUMS-recommended survey specification in Stata based on the CLUSTER, STRATA, and HHWT variables. In summary, this analysis measures the weighted percentage of householders in each age group who own a home, and changes in this percentage across time.

A note on householders: The most consequential methodological choice is to restrict the sample to householders. IPUMS notes that:

Beginning in 1980 the census questionnaire no longer referred to a "head of household", specifying instead the designation of "person one"--the first person listed on the census form. This reference person could be any household member in whose name the property was owned or rented. If no such person was present, any adult could be selected. Prior to 1970, enumerators were instructed to record the male as the head of household for all married couples, regardless of the couple's designation of a head.

Thus, my comparison between 1980 and 2021 is using a consistent definition of householders across time. Still, it is important to note that if there are unknown inconsistencies in who is being designated as a householder across time, especially inconsistencies by age, this would bias our results (for instance, if the true homeowner(s) are more/less likely to be absent from the household in more recent years). Still, I use Census' preferred methodology for measuring homeownership rates using the American Community Survey (for instance, see Figure 2 at this link). As a quality control test, I benchmark my results against the Census Bureau's publicly-reported results for 2005-2019 and replicate these results with little-to-no error.

Benchmarks

Year Estimates based on Census Estimates based on IPUMS-USA
2005 66.9% 66.9%
2006 67.3% 67.3%
2007 67.2% 67.2%
2008 66.6% 66.6%
2009 65.9% 65.9%
2010 65.4% 65.4%
2011 64.7% 64.6%
2012 64.0% 63.9%
2013 63.6% 63.5%
2014 63.2% 63.1%
2015 63.1% 63.0%
2016 63.2% 63.1%
2017 63.9% 63.9%
2018 64.0% 63.9%
2019 64.2% 64.1%

Note: Census estimates are from page 11 of the following link.

Replicating the results

Software

I use Stata Version 17. However, previous versions should work with some limits in visualization capability.

Directory

The main directory of the repository should include folders for "Scripts", "Raw-data", "Derived-data", and "Output".

Data

Source: Steven Ruggles, Sarah Flood, Matthew Sobek, Danika Brockman, Grace Cooper, Stephanie Richards, and Megan Schouweiler. IPUMS USA: Version 13.0 [dataset]. Minneapolis, MN: IPUMS, 2023. https://doi.org/10.18128/D010.V13.0

The data for this project can be obtained from IPUMS-USA. Due to data use limitations on IPUMS data, I cannot provide the raw data used for this project directly. In the space below I include instructions for downloading your own copy of the data extracts from IPUMS.

To download data from IPUMS-USA, a user must register for an account at this link.

After your account has been approved, the user will need to download the requisite variables and samples for this analysis. The following variables (other than the preselected variables in IPUMS) are necessary to complete the analysis:

Variables:

Required list
RELATE
OWNERSHP
AGE

Technically, the main analysis only requires years 1980 and 2021; therefore, the extract from IPUMS can be significantly reduced by only including those two years. However, I have included additional years since I have used other years in extended analysis.

Samples:

Decennial Census
1970 1% state fm1
1980 5% state
1990 5% state
ACS
2000-2021

The extract should be downloaded as a .dta file, which is a setting in the IPUMS extract system. This extract should be downloaded and decompressed in the Raw-data folder of the repository.

For additional instructions on downloading data from IPUMS, please refer to this guide. For instructions on opening an IPUMS extract, please refer to this guide.

Code

The only user-required change to the scripts necessary to replicate the main analysis is the corresponding file name of the user's IPUMS extract. On line 44 of 0_script-control.do, within the quotes rename global usa_file1 = "[INSERT EXTRACT NAME HERE]" to your corresponding extract name. For instance, this could be global usa_file1 = "usa_00001". See below:

image

Runtime

On a PC with an Intel i7-11700, 64GB ram, and running 64-bit Windows 11 the code executed in approximately 46 minutes. The longest chunk of the code is to produce the results for the second figure, which is based on the svyset and svy commands in Stata to produce accurate standard errors.

Ackowledgments

I am grateful to Sarah Swanbeck for reviewing and providing feedback on all stages of the analysis, and Jesús Guzmán and Nicholas Adams-Cohen for helpful suggestions to improve the readability of the visualizations. All errors or omissions are my own.

About

Analysis of homeownership rates in the U.S.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages