About the project

This repository includes the necessary Stata code and instructions for replicating the results from The Berkeley Institute for Young American's analysis of homeownership rates over age and time.

Methodology
Replicating the Results

Methodology

I use data from IPUMS-USA to visualize homeownership rates across time and the age distribution. I use the Census-defined homeownership variable (OWNERSHP), which categorizes respondents as 1 if they reside in an owner-occupied (either owned outright or mortgaged) household or 0 if they're renting. I restrict the sample to households (non-group qarters), respondents age 21 or older, and the household's "householder" (otherwise known as "head of household" or "reference person"). I weight the analysis based on the Census-provided household weight variable (HHWT) to maintain the representativeness of the sample as it relates to the population of households in the U.S. In instances where I calculate standard errors, I use the IPUMS-recommended survey specification in Stata based on the CLUSTER, STRATA, and HHWT variables. In summary, this analysis measures the weighted percentage of householders in each age group who own a home, and changes in this percentage across time.

A note on householders: The most consequential methodological choice is to restrict the sample to householders. IPUMS notes that:

Beginning in 1980 the census questionnaire no longer referred to a "head of household", specifying instead the designation of "person one"--the first person listed on the census form. This reference person could be any household member in whose name the property was owned or rented. If no such person was present, any adult could be selected. Prior to 1970, enumerators were instructed to record the male as the head of household for all married couples, regardless of the couple's designation of a head.

Thus, my comparison between 1980 and 2021 is using a consistent definition of householders across time. Still, it is important to note that if there are unknown inconsistencies in who is being designated as a householder across time, especially inconsistencies by age, this would bias our results (for instance, if the true homeowner(s) are more/less likely to be absent from the household in more recent years). Still, I use Census' preferred methodology for measuring homeownership rates using the American Community Survey (for instance, see Figure 2 at this link). As a quality control test, I benchmark my results against the Census Bureau's publicly-reported results for 2005-2019 and replicate these results with little-to-no error.

Benchmarks

Year	Estimates based on Census	Estimates based on IPUMS-USA
2005	66.9%	66.9%
2006	67.3%	67.3%
2007	67.2%	67.2%
2008	66.6%	66.6%
2009	65.9%	65.9%
2010	65.4%	65.4%
2011	64.7%	64.6%
2012	64.0%	63.9%
2013	63.6%	63.5%
2014	63.2%	63.1%
2015	63.1%	63.0%
2016	63.2%	63.1%
2017	63.9%	63.9%
2018	64.0%	63.9%
2019	64.2%	64.1%

Note: Census estimates are from page 11 of the following link.

Replicating the results

Software

I use Stata Version 17. However, previous versions should work with some limits in visualization capability.

Data

Source: Steven Ruggles, Sarah Flood, Matthew Sobek, Danika Brockman, Grace Cooper, Stephanie Richards, and Megan Schouweiler. IPUMS USA: Version 13.0 [dataset]. Minneapolis, MN: IPUMS, 2023. https://doi.org/10.18128/D010.V13.0

The data for this project can be obtained from IPUMS-USA. Due to data use limitations on IPUMS data, I cannot provide the raw data used for this project directly. In the space below I include instructions for downloading your own copy of the data extracts from IPUMS.

To download data from IPUMS-USA, a user must register for an account at this link.

After your account has been approved, the user will need to download the requisite variables and samples for this analysis. The following variables (other than the preselected variables in IPUMS) are necessary to complete the analysis:

Variables:

Required list
RELATE
OWNERSHP
AGE

Technically, the main analysis only requires years 1980 and 2021; therefore, the extract from IPUMS can be significantly reduced by only including those two years. However, I have included additional years since I have used other years in extended analysis.

Samples:

Decennial Census
1970 1% state fm1
1980 5% state
1990 5% state

ACS
2000-2021

The extract should be downloaded as a .dta file, which is a setting in the IPUMS extract system. This extract should be downloaded and decompressed in the Raw-data folder of the repository.

For additional instructions on downloading data from IPUMS, please refer to this guide. For instructions on opening an IPUMS extract, please refer to this guide.

Code

The only user-required change to the scripts necessary to replicate the main analysis is the corresponding file name of the user's IPUMS extract. On line 44 of 0_script-control.do, within the quotes rename global usa_file1 = "[INSERT EXTRACT NAME HERE]" to your corresponding extract name. For instance, this could be global usa_file1 = "usa_00001". See below:

Runtime

On a PC with an Intel i7-11700, 64GB ram, and running 64-bit Windows 11 the code executed in approximately 46 minutes. The longest chunk of the code is to produce the results for the second figure, which is based on the svyset and svy commands in Stata to produce accurate standard errors.

Ackowledgments

I am grateful to Sarah Swanbeck for reviewing and providing feedback on all stages of the analysis, and Jesús Guzmán and Nicholas Adams-Cohen for helpful suggestions to improve the readability of the visualizations. All errors or omissions are my own.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Output		Output
Scripts		Scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output

Output

Scripts

Scripts

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

Repository files navigation

About the project

Methodology

Benchmarks

Replicating the results

Software

Directory

Data

Variables:

Samples:

Code

Runtime

Ackowledgments

About

Releases

Packages

Languages

jamesohawkins/US-Home-Ownership

Folders and files

Latest commit

History

Repository files navigation

About the project

Methodology

Benchmarks

Replicating the results

Software

Directory

Data

Variables:

Samples:

Code

Runtime

Ackowledgments

About

Resources

Stars

Watchers

Forks

Languages