A professional data analysis project exploring over 150 years of international football results. This project utilizes Python and Jupyter Notebooks to derive insights from the Kaggle "International Football Results from 1872 to 2024" dataset.
This repository contains a modular and professional approach to data exploration and visualization. The logic is separated into a core analytics module, ensuring maintainability and scalability.
- Data Preprocessing: Handling datetime conversions and result categorization.
- Statistical Analysis: Deep dive into goal trends and historical team performance.
- Visualization: High-resolution, professional charts using Seaborn and Matplotlib.
- Home Advantage Study: Statistical evaluation of match outcomes based on location.
├── data/
│ └── results.csv # Raw dataset
├── football_analytics.py # Core analytics engine (Modular Logic)
├── football_analysis.ipynb # Interactive analysis report
├── requirements.txt # Project dependencies
└── README.md # Project documentation
Ensure you have Python 3.8+ installed.
- Clone the repository or download the files.
- Install the required dependencies:
pip install -r requirements.txt
Open the Jupyter Notebook to view the full report:
jupyter notebook football_analysis.ipynb- Analysis of goal distributions over a century of play.
- Identification of the top 10 most successful international teams.
- Verification of "Home Advantage" through match outcome percentages.