Skip to content

Xiaozhu-Zhang1998/Data-Analysis-for-Beijing-PM2.5

Repository files navigation

Data Analysis for PM2.5 in Beijing

Was PM2.5 in Beijing decreasing?

This is a repository for the final project of Class Introduction to Data Science in UCLA Extension.


Project Description

Before the Beijing 2008 Summer Olympic Games, the severe air pollution in Beijing had already caught the whole world's attention. Among all those gauges of air pollution, PM2.5 concentration is no doubt the most famous and effective. According to U.S. Environmental Protection Agency (EPA), the air quality index (AQI) could be regarded as “Good” if AQI is less than 50, whereas the average AQI in Beijing from 2010 to 2014 was almost 100. In order to deal with this serious problem, we use data set from UCI Repository (with time series data from 2010 to 2014) to explore the factors influencing PM2.5 in Beijing, perform machine learning algorithm to impute missing values, and comment on the trend of PM2.5 during the 5 years studied. Suggestion on how to improve air quality in Beijing is given at the end.

Source of the Data Set

UCI Repository - Beijing pm2.5

Contents

File Description
PRSA_data_2010.1.1-2014.12.31.csv The dataset in csv format
Code.R The codes of data analysis in R file
Memo.pdf The memo for management-level audience to read
Report (Technical Details).pdf The formal report for data scientists to read

Authors

Xiaozhu Zhang - GitHub

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages