Folder that contains work submitted (and progress) of assignment 2 coursework for module GEOG5995
- Assignment2- Elsa - first.py --> An earlier attempt of the program.
- Assignment2 - Elsa.py --> The finished program to be assessed.
- Assignment2_supportingDocumetation.docx --> supporting documentation to provide detailed guidance about using the program.
- LICENSE --> license file for use of program.
- README --> this document
- cleaned_data_pthon.xlsx --> data output from run program
- elsa_sampler_UKDA_Data_Dictionary.rtf --> data dictionary for dataset used by program.
- elsawave1.csv --> raw data used to run the program.
- output.txt --> textual output from program once run.
- plots.pdf --> pdf images of plots created by program when run.
- python_assignemnt2_UML.png --> UML diagram describing the process of the program.
The 'Assignment2-Elsa.py' file is my attempt to complete assignment 2 for course code GEOG5995. The assignment brief detailed: reading in a dataset, processing the data set in some way, displaying the results, and printing the results.
My attempt at this assignment takes the Elsa wave 1 data set (a national dataset looking at ageing), creates a new variable for age, and selects a number of other variables to create a visualisation of the data set. This includes looking at the spread of relationship status within the population, the age distribution, and the age that people consider old, and also a breakdown of how the population consider their health to be.
The outputs of this program are then printed either to the spreadsheet file, the pdf file or a txt file depending on the type of output generated.
This program can be run as a whole as long as the elwawave1.csv dataset is in the same folder as the program. The program has been developed using Spyder IDE so it is recommended that users also use this software.
This program during testing was taking approximately 4 seconds to run. In the command line the program will produce the time taken to run the program.
When the program is run 4 plots will be procuded in pop out windows. 1 pie chart. 1 bar chart and 3 histograms. These charts will also be printed to a pdy file called plots.pdf The program will also write the final dataframe to an excel file called cleaned_Data_pthon.xlsx. This can be viewed in excel. The program will also generate a new txt file that will contain the textual outputs of the program. This includes a contingency table and some summary statistics for the new age variable created.
For more details about how to run the program and what to expect please refer to the specific documentation created Click Here
see license here