The data refer to all flights in the USA from 1987 to 2008. For the project you will have to work with the data from 2004 to 2007 only. The data and their description can be found here:
http://stat-computing.org/dataexpo/2009/the-data.html
The project is the following:
Congratulations. The committee plans to hire you as the director of New Orleans Airport (MSY,"New Orleans International "). We are in 2008, so they need to show that the current director is not a good one and the airport at 2006-2007 period performed worst than the past (i.e. 2004-2005), so they need to fire him. Help them to take the position.
Prepare 5 slides that the committee can use to support firing the current director from the position. Actually you need to show that during this period the airport performed worst.
GUIDELINES
-
5 slides + opening slide
-
Attach the R code as well
-
To be more convincing you need to compare for example with other airports etc.
-
The committee has sufficient knowledge to understand more sophisticated plots, do not use only barplots
-
All plots must be created using R in this project, exclusively
-
Every day of delay has a penalty -2 points
-
Common plots between the different submissions will be downgraded, try to be innovative and creative
The R code for creating the above plots is available in the /eva-giannatou-BAII-assignment1/ directory
Instructions for running the code:
Step 1: go to http://stat-computing.org/dataexpo/2009/the-data.html
and download the flight data from 2004 to 2007
we need the following files
1. 2004.csv
2. 2005.csv
3. 2006.cav
4. 2007.csv
Step 2: place the dataset (files from step 1) into the eva-giannatou-BAII-assignment1//dataset folder
Step 3: run the code found in the eva-giannatou-BAII-assignment1/data-tranformation folder
Step 4: run the plot codes