Skip to content

🟡 Data visualization & analysis of COVID 19 data in the NYC area

Notifications You must be signed in to change notification settings

yeeryan/nyt-mask-obs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

49 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nyt-mask-obs

➡️ View the Report Draft ⬅️

Overview

This report aims to compare the observed mask usage rates in select areas in NYC to the COVID 19 data taken around the same time. The data is shown with visuals such as maps and charts for comparison across areas in NYC. Each area in this analysis is divided by ZCTA which is used by NYC Health and can be inferred from the NY Times article. The dates chosen are due to the estimated 2 week time period for COVID 19-related symptoms show. Each data set is analysed and visualized, and the methodology and potential limitations are explored in the report.

Linear regression analysis is conducted later to determine if observed mask rates and population are signficiant predictors of case rates.

Data Visualizations

The New York Times article offers street intersections, neighbourhoods, and boros for geographic info. NYC Health provides modified ZCTAs and neighbourhood names for geographic info. In order to map out the NYT observations, I manually looked up each street intersections' ZCTA and added it to the NYT_obs data set. This map uses geographic data from the US Census to draw each ZCTA.

The layer control on the upper left switches between COVID 10 Rate per 100k from NYC Health on different weeks and mask observations. When looking at NYC Health data designated by a date, clicking on a ZCTA will provide the following:

  • neighbourhood information (as written by NYC Health)
  • boro and ZCTA
  • case rate

When looking at the mask observations, clicking on a ZCTA will show the following:

  • neighbourhood information (as written by the NYT)
  • street intersection of observation
  • observed mask usage

Additional data visualizations are included as interactive plots throughout the report.

Packages Used

  • tidyverse
  • ggfortify
  • dplyr
  • plotly (interactive plots)
  • leafet (interactive map)
  • tigiris (spatial data from US Census)

Usage

A. Access the report HERE

OR

B. Use R to recreate the interactive dashboard with the following instructions:

  1. Install the packages used in R with the following command:

install.packages(c("tidyverse","dplyr","tigiris","plotly","leaflet"))

  1. Clone the repository

  2. Open Report.Rmd in the project's report folder in RStudio

  3. Knit Report.Rmd to generate a dashboard

Data Sources

The New York Times - Are New Yorkers Wearing Masks?

NYC Health - COVID-19 Data

About

🟡 Data visualization & analysis of COVID 19 data in the NYC area

Topics

Resources

Stars

Watchers

Forks

Languages