This repository contains three statistical analysis projects. Each project demonstrates proficiency in statistical modelling, data visualisation, and Python programming.
File: Analysis of Regional Property Price Trends.ipynb
An analysis of regional property price changes in England between July 2017 and July 2018.
Key Features:
- Statistical analysis of flat price changes across 9 English regions
- Chi-square test and permutation test for independence testing
- Custom visualisations including dot plots and regional bar charts
- G-statistic calculation for hypothesis testing
Technologies: pandas, numpy, matplotlib, scipy
File: Demographic Analysis of UK Population Using 2021 Census Data.ipynb
A demographic analysis leveraging the 2021 UK Census data to explore population characteristics and trends.
Key Features:
- Analysis of census data at LSOA (Lower Layer Super Output Area) level
- Demographic trend identification and visualisation
- Spatial data analysis using geographic datasets
Technologies: pandas, numpy, matplotlib, geopandas
File: Statistical Analysis of Texas Bridges Using Regression Modelling.ipynb
A statistical investigation of Texas bridge infrastructure using regression modelling techniques to understand factors affecting bridge conditions.
Key Features:
- Regression analysis of bridge infrastructure data
- Model building and validation
- Statistical inference and interpretation
Technologies: pandas, numpy, matplotlib, scipy, statsmodels
Python Stats Projects/
βββ README.md
βββ data/
β βββ average_flat_prices.csv
β βββ LSOA_data.csv
β βββ LSOA_England_geom.gpkg
β βββ tx19_bridges_sample.csv
βββ notebooks/
β βββ Analysis of Regional Property Price Trends.ipynb
β βββ Demographic Analysis of UK Population Using 2021 Census Data.ipynb
β βββ Statistical Analysis of Texas Bridges Using Regression Modelling.ipynb
βββ html_exports/
βββ Analysis_of_Regional_Property_Price_Trends.html
βββ Demographic_Analysis_of_UK_Population_Using_2021_Census_Data.html
βββ Statistical_Analysis_of_Texas_Bridges_Using_Regression_Modelling.html
- Python 3.8 or higher
- Jupyter Notebook or JupyterLab
Install the required packages using pip:
pip install pandas numpy matplotlib scipy geopandas statsmodels- Hypothesis Testing: Chi-square tests, permutation tests
- Regression Analysis: Linear and multiple regression modelling
- Descriptive Statistics: Central tendency, dispersion, distribution analysis
- Data Visualisation: Custom plots using matplotlib
- Spatial Analysis: Geographic data manipulation and visualisation
- UK Ministry of Housing, Communities & Local Government for property price data
- Office for National Statistics for 2021 Census data
- Texas Department of Transportation for bridge infrastructure data