Skip to content

DAB-501-Fall2021/CourseContent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 

Repository files navigation

Week 6 - Data Transformation and Visualization

We will be using the NMMAPS dataset used in Week 4. First, load tidyverse, and then import the dataset using the code below: (you may need to replace the "")

chicago <- read_csv("https://raw.githubusercontent.com/DAB-501-Fall2021/CourseContent/main/Data%20Visualization/chicago-nmmaps.csv")

Transform and visualize the dataset to answer the following questions.

  1. What is the highest temperature in Summer? In Winter?
  2. Which season had the most deaths?
  3. For the season with the most deaths, what is the relationship between pm10, temperature, and death?

Week 4 - Data Visualization - Mapping vs. Setting

We will be using the dataset National Morbidity and Mortality Air Pollution Study (NMMAPS) to learn about ggobject and the difference between mapping and setting, as well as global and local mapping.

First, load tidyverse package

Next, import the dataset from the Data Visualization folder using the readr package, available through tidyverse

chicago <- read_csv(“https://raw.githubusercontent.com/DAB-501-Fall2021/CourseContent/main/Data%20Visualization/chicago-nmmaps.csv”)

Store current ggobject in variable g, including mappings

g <- ggplot(chicago, aes(x=date, y=temp))

Extend ggobject g by adding other layers, including settings

g + geom_point(colour = "firebrick", shape = "diamond", size = 2)

Now, extend the ggobject to other plot types.

Store new ggbject in variable p1, including labels

p1 <- ggplot(chicago, aes(x = date, y = temp, color = season)) + 
           geom_point() + 
           labs(x = "Year", y = "Temperature (°F)")

Extend ggobject p1 by adding other geom_xxxx layers

p1 + geom_rug()

Week 3 - Data Visualization

We will be using the Pokemon dataset found from Kaggle at the link below to create visualizations. The dataset contains both numerical and categorical data and it is available as a .csv file.

https://www.kaggle.com/abcsds/pokemon?select=Pokemon.csv

Please go to the 'Data Visualization' folder to download the dataset and follow the instructions to contruct the visualizaitons.

Week 2 - Try Out Swirl

Head over to the Swirl folder listed above and try out the Swirl tutorials.

Additional Swirl courses can be found at: http://swirlstats.com/scn/title.html

Week 1 - Install R and RStudio

Welcome to DAB 501 Basic Statistics & Exploratory Data Analytics

We will be using GitHub to share data and R files to be used throughout the course.

Each main topic will have a folder where new content will be added weekly.

Please feel free to refer back to the content to help with assignments and projects.


First, install R: https://cran.r-project.org/bin/windows/base/

Click on Download R 4.1.1 for Windows and wait for it to finish downloading. Run the executable file R-4.1.1-win.exe you just downloaded. Follow the installer instructions.

Next, install RStudio: https://www.rstudio.com/products/rstudio/download/#download

Under the Installers heading, click on RStudio 1.4.1717-Windows 10 and wait for it to finish downloading. Run the executable file RStudio-1.4.1717.exe you just downloaded. Follow the installer instructions.

About

DAB 501 Basic Statistics & Exploratory Data Analysis

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published