Analysing Food Health Rating scores across the UK from http://ratings.food.gov.uk/open-data/en-GB
- The aim of this project is to initially extract and clean all data related to Food Health Ratings to understand the distribution of Ratings across the UK
- Once the data has been extracted and clean I plan to incorporate external data sources such as Housing Prices, Future development in high populated areas, Crime data to potentially understand what factors contribute to the determination of House Prices
I initially looked through the above website to understand the html of the page and to identify the different tags I am interested in. This allowed me to find the relevant data and perform my initial extraction.
Next step is to clean the data and perform EDA in Python and PySpark