Skip to content

This repository contains three statistical analysis projects. Each project demonstrates proficiency in statistical modelling, data visualisation, and Python programming.

Notifications You must be signed in to change notification settings

rashad-malik/Python-Stats-Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Python Statistics Projects

This repository contains three statistical analysis projects. Each project demonstrates proficiency in statistical modelling, data visualisation, and Python programming.

πŸ“Š Projects Overview

1. Analysis of Regional Property Price Trends

File: Analysis of Regional Property Price Trends.ipynb

An analysis of regional property price changes in England between July 2017 and July 2018.

Key Features:

  • Statistical analysis of flat price changes across 9 English regions
  • Chi-square test and permutation test for independence testing
  • Custom visualisations including dot plots and regional bar charts
  • G-statistic calculation for hypothesis testing

Technologies: pandas, numpy, matplotlib, scipy


2. Demographic Analysis of UK Population Using 2021 Census Data

File: Demographic Analysis of UK Population Using 2021 Census Data.ipynb

A demographic analysis leveraging the 2021 UK Census data to explore population characteristics and trends.

Key Features:

  • Analysis of census data at LSOA (Lower Layer Super Output Area) level
  • Demographic trend identification and visualisation
  • Spatial data analysis using geographic datasets

Technologies: pandas, numpy, matplotlib, geopandas


3. Statistical Analysis of Texas Bridges Using Regression Modelling

File: Statistical Analysis of Texas Bridges Using Regression Modelling.ipynb

A statistical investigation of Texas bridge infrastructure using regression modelling techniques to understand factors affecting bridge conditions.

Key Features:

  • Regression analysis of bridge infrastructure data
  • Model building and validation
  • Statistical inference and interpretation

Technologies: pandas, numpy, matplotlib, scipy, statsmodels


πŸ“ Repository Structure

Python Stats Projects/
β”œβ”€β”€ README.md
β”œβ”€β”€ data/
β”‚   β”œβ”€β”€ average_flat_prices.csv
β”‚   β”œβ”€β”€ LSOA_data.csv
β”‚   β”œβ”€β”€ LSOA_England_geom.gpkg
β”‚   └── tx19_bridges_sample.csv
β”œβ”€β”€ notebooks/
β”‚   β”œβ”€β”€ Analysis of Regional Property Price Trends.ipynb
β”‚   β”œβ”€β”€ Demographic Analysis of UK Population Using 2021 Census Data.ipynb
β”‚   └── Statistical Analysis of Texas Bridges Using Regression Modelling.ipynb
└── html_exports/
    β”œβ”€β”€ Analysis_of_Regional_Property_Price_Trends.html
    β”œβ”€β”€ Demographic_Analysis_of_UK_Population_Using_2021_Census_Data.html
    └── Statistical_Analysis_of_Texas_Bridges_Using_Regression_Modelling.html

πŸš€ Getting Started

Prerequisites

  • Python 3.8 or higher
  • Jupyter Notebook or JupyterLab

Required Packages

Install the required packages using pip:

pip install pandas numpy matplotlib scipy geopandas statsmodels

πŸ“ˆ Key Statistical Methods Used

  • Hypothesis Testing: Chi-square tests, permutation tests
  • Regression Analysis: Linear and multiple regression modelling
  • Descriptive Statistics: Central tendency, dispersion, distribution analysis
  • Data Visualisation: Custom plots using matplotlib
  • Spatial Analysis: Geographic data manipulation and visualisation

πŸ™ Acknowledgments

  • UK Ministry of Housing, Communities & Local Government for property price data
  • Office for National Statistics for 2021 Census data
  • Texas Department of Transportation for bridge infrastructure data

About

This repository contains three statistical analysis projects. Each project demonstrates proficiency in statistical modelling, data visualisation, and Python programming.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published