π A comprehensive data analysis project on Nassau shipping data β uncovering trade patterns, cargo trends, port performance, and key logistics insights through interactive visualizations and a live Streamlit dashboard.
- π― Project Overview
- π Repository Structure
- π Dataset Overview
- π Key Metrics & KPIs
- π Analysis & Charts
- π οΈ Tech Stack
- β‘ Quick Start
- π₯οΈ Dashboard Features
- π‘ Key Insights
- π€ Connect With Me
The Nassau Shipping Analysis project is an end-to-end data analysis solution that dives deep into shipping data from the Nassau region. Using Python's powerful data science ecosystem, this project transforms raw shipping records into actionable business intelligence through:
- π Exploratory Data Analysis (EDA) β Uncovering hidden patterns in shipping data
- π Statistical Summaries β Descriptive statistics, correlations, and distributions
- πΊοΈ Visual Storytelling β Rich charts and graphs for every key metric
- π Interactive Dashboard β A deployed Streamlit app for live data exploration
- π‘ Business Insights β Actionable recommendations from data findings
π¦ data_analysis_project/
βββ π Nassau_Shipping_Analysis/
β βββ π Nassau_Shipping_Analysis.ipynb # Main Jupyter Notebook (EDA + Analysis)
β βββ π app.py # Streamlit Dashboard Application
β βββ π data/ # Raw & Processed Datasets
β βββ πΈ assets/ # Charts & Visualizations
βββ π .devcontainer/ # Dev Container Configuration
βββ π README.md # Project Documentation
The dataset captures detailed shipping activity across Nassau's maritime trade routes.
| Feature | Description | Type |
|---|---|---|
Ship Name |
Name/ID of the vessel | string |
Route |
Origin β Destination | string |
Cargo Type |
Type of goods transported | categorical |
Cargo Weight (tons) |
Weight of the shipment | float |
Departure Date |
Date of departure | datetime |
Arrival Date |
Date of arrival | datetime |
Transit Time (days) |
Duration of voyage | int |
Port of Loading |
Source port | string |
Port of Discharge |
Destination port | string |
Freight Cost ($) |
Cost of shipping | float |
Ship Type |
Vessel classification | categorical |
Status |
Delivered / In Transit / Delayed | categorical |
| π¦ Metric | π Value | π Description |
|---|---|---|
| Total Shipments | Analyzed across full dataset | Volume of all recorded voyages |
| Avg. Transit Time | Computed per route | Mean voyage duration in days |
| Top Cargo Category | Dominant freight type | Most shipped cargo class |
| Busiest Port | Highest throughput port | Port with most departures/arrivals |
| Avg. Freight Cost | Calculated across routes | Mean cost per shipment in USD |
| On-Time Delivery Rate | % of non-delayed shipments | Operational efficiency KPI |
| Cargo Volume (tons) | Total aggregated weight | Sum of all cargo transported |
| Peak Shipping Month | Identified via time series | Month with highest activity |
The analysis covers the following visual explorations generated in the Jupyter Notebook and rendered live in the dashboard:
β
Cargo weight distribution (histogram + KDE)
β
Freight cost distribution by ship type
β
Transit time spread across routes
β
Shipment volume by month (time series)
β
Heatmap β Correlation matrix of numeric features
β
Scatter plot β Freight cost vs. cargo weight
β
Box plot β Transit time per cargo type
β
Pair plot β Multi-variable relationship matrix
β
Bar chart β Top 10 busiest ports (loading & discharge)
β
Sankey diagram β Route flow visualization
β
Grouped bar β Route-wise average freight cost
β
Donut chart β Cargo type distribution
β
Monthly shipment volume trend
β
Seasonal freight cost fluctuation
β
Year-over-year cargo growth
β
Rolling average transit time
β
Delivery status breakdown (pie chart)
β
Delay rate per route (horizontal bar)
β
Ship type utilization (stacked bar)
β
Cost efficiency ratio per vessel class
Follow these steps to run the project locally:
git clone https://github.com/Techwithabhi/data_analysis_project.git
cd data_analysis_projectpython -m venv venv
source venv/bin/activate # On macOS/Linux
venv\Scripts\activate # On Windowspip install -r requirements.txtcd Nassau_Shipping_Analysis
jupyter notebook Nassau_Shipping_Analysis.ipynbstreamlit run app.pyπ Or simply visit the Live Dashboard β no setup required!
The live Streamlit app offers:
- ποΈ Interactive Filters β Filter by date range, cargo type, ship type, and port
- π Dynamic Charts β All plots update in real-time based on filter selections
- π Raw Data View β Explore the underlying dataset with search & sort
- π₯ Download Options β Export filtered data as CSV
- π± Responsive Layout β Works seamlessly across desktop and mobile
- π Dark/Light Mode β Adapts to system theme preference
Derived from the Nassau Shipping Analysis:
π INSIGHT 1 β Bulk cargo constitutes the largest share of shipments
β Dominant freight type driving port throughput volumes
π INSIGHT 2 β Transit time shows a strong positive correlation with freight cost
β Longer routes yield proportionally higher operational expenses
π INSIGHT 3 β Q3 (JulβSep) records peak shipping activity
β Seasonal demand surge affects port congestion and pricing
π INSIGHT 4 β Top 3 routes account for 60%+ of total cargo volume
β High route concentration presents both efficiency and risk factors
π INSIGHT 5 β Delay rates vary significantly by cargo type
β Perishable goods experience 2x higher delay rates vs. dry bulk
Contributions, issues, and feature requests are welcome!
- Fork the project
- Create your feature branch:
git checkout -b feature/AmazingFeature - Commit your changes:
git commit -m 'Add some AmazingFeature' - Push to the branch:
git push origin feature/AmazingFeature - Open a Pull Request