Skip to content

TZstatsADS/Spring2019-Proj2-grp11

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Background & Insights:

• Though ride-hailing industry is rapidly growing these year, New York City is actually limiting those apps such as Uber and Lyft by putting a freeze in licensing new cars in the city. This plan was made to prevent urban congestion. However, those apps are still very convenient options for people who need short-distance driving services. Under this circumstances, we want to compare the taxi and Uber pickup number. By doing this, we can see which one is more popular at NYC and also the pattern (shrinkage or growth) of number change for both options during 2015 and 2016.

• When weather condition is bad, like rainy, heavily snowy, hail, taking a taxi might be a easier choice for you, but will it be costly? Just to make life easier, we want to compare the taxi price in different weather condition.

• When going to the airports, you definitely want to go there on time. So we made a map by time specifically at New York Airports (i.e. JFK, EWR and LGA). This could help people to make a decision when they need to go to the airport.

Dataset (OPEN NYC):

• Yellow taxi data

• Uber pickup data

• NYC Airport data

Plan:

• Obtain the dataset we need and update the dataset by adding data of 2016

• Select features we interest that can clearly show the comparison under different conditions (e.g. business day or not)

• Process the data and explore it more by visualizing it

• Update the code to add panels we need

Project 2: Shiny App Development Version 2.0

screenshot

In this second project of GR5243 Applied Data Science, we develop a version 2.0 of an Exploratory Data Analysis and Visualization shiny app on a topic of your choice using NYC Open Data or U.S. government open data released on the data.gov website. See Project 2 Description for more details.

The learning goals for this project is:

  • business intelligence for data science
  • study legacy codes and further development
  • data cleaning
  • data visualization
  • systems development/design life cycle
  • shiny app/shiny server

*The above general statement about project 2 can be removed once you are finished with your project. It is optional.

Project Title: NYC Taxi Traffic

Link to Shiny App: https://yeyejiang.shinyapps.io/NYC_Taxi_Project/ Term: Spring 2019

  • Team: Group 11

  • NYC Taxi Traffic: + Team members

    • Chen, Xinyi
    • Hu, Xinyi
    • Jiang, Hongye
    • Lin, Nelson
    • Zhang, Liwei
  • Project summary: This app three topics, which are traffic map, time flow map and airport traffic map. For the dataset, we covered NYC weather, taxi and uber pickup number, airport traffic between 2015 and 2016.

The first map shows NYC traffic in normal weather(i.e. sunny) and bad weather(i.e. rainy, snowy, hail and foggy). Moreover, it also shows general NYC taxi information in 2016, including total pick up number and drop off number, weather condition, fare per distance and area category. After choosing a specific location, it will jump to another panel and shows the detailed information about traffic flow, fare and basic information of the selected neighborhood.

The second map focuses on comparing pickup numbers and locations of NYC taxi and uber by times. From the perspective of location, it's not hard to tell that NYC taxi is much more popular than uber in Manhattan Island, while only a small amount of taxi pickups appeared in Brooklyn and Queens. Besides, larger circle radiuses show larger pickup number. From 2015 to 2016, more customers chose to take Taxi rather than Uber. However, the Uber dataset only covers the first half year of 2015. If there is newer dataset for Uber, we might see a clearer trend of Taxi and Uber growth.

The third map shows airport traffic depending on where and when someone is picked up. This is useful for showing the amount of traffic one might encounter on their trip.

  • Contribution statement: (default) All team members contributed equally in all stages of this project. All team members approve our work presented in this GitHub repository including this contributions statement.

Panel 1(traffic map): Hongye Jiang: Combined and aggregated all the data; built the functions for drawing the map pattern; did neighborhood classification and basic statistical analysis. Xinyi Chen: Cleaned weather data, recreated “Taxi Traffic Overview” panel; made all functions work.

Panel 2(More Neighborhood Details): Xinyi Chen create a new “Neighborhood Details” panel; made all functions work; plot graphs with Hongye Jiang.

Panel 3(time flow map): Liwei Zhang queried, cleaned Uber data. Xinyi Hu queried, cleaned yellow taxi data of year 2016 and updated yellow taxi data of year 2015. We built the panel 2 together. Xinyi Hu integrated all three panels.

Panel 4(airport traffic map): Nelson Lin queried, cleaned the data and built the tab for the shiny app. Xinyi Hu integrated it into the rest of the app.

Presentation: Liwei Zhang is the presenter.

Following suggestions by RICH FITZJOHN (@richfitz). This folder is orgarnized as follows.

proj/
├── app/
├── lib/
├── data/
├── doc/
└── output/

Please see each subfolder for a README file.

About

Spring2019-Proj2-grp11 created by GitHub Classroom

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •