Before starting our analysis, we deeply studied the legend and the taxi_information pdf to better understand our dataset. For every analysis we have done we explicitly mention the information, provided in both of the files, concerning with our results. We had to deal with a really large amount of data so we decided to split the tasks and create different jupyter's notebook files to make our evaluation simplier and clearier. But let's drive deep into it. Here is how our homework is divided:
- Code_before_starting: This is all the code that covers the first section of the homework. We explain the choices we have made and the reason why we have chosen some methods instead of others.
- RQ1-2-3-4-5: These .ipynb files contain all the code relative to the research questions with comments
- Final_core.ipynb: All the code related to the first core question. T.student and p values are discussed in here.
- Finalmap: All the code related to the second core question. Choropleth map and comments about it.
And different images( We think no description is needed for those :P)
PArticipant's name: Nagham Almagout,Edoardo Cantagallo, Giulia Maslov
