Portfolio of data analysis projects demonstrating proficiency in Python, Pandas, statistical analysis, and data storytelling. Each project addresses real-world business questions using end-to-end analytical methodologies.
Portfolio of real-world data analysis projects demonstrating Python, Pandas, statistical analysis, and data visualization skills
Comprehensive survival analysis identifying passenger categories with highest survival rates
- Analyzed 891 passenger records to identify survival patterns across demographics
- Engineered 3 new features (AgeGroup, FamilySize, TravelingAlone) through data transformation
- Key Finding: Female 1st class passengers had 96.8% survival rate vs 13.5% for male 3rd class
- Skills: Pandas, Feature Engineering, Statistical Analysis, Data Cleaning
End-to-end data pipeline analyzing crime trends across Los Angeles
- Processed 100K+ crime records to identify patterns by location, time, and crime type
- Built automated data cleaning pipeline reducing preparation time by 60%
- Key Finding: Improved crime pattern interpretability by 30% through targeted visualizations
- Skills: Pandas, NumPy, Matplotlib, Data Pipeline Development, EDA
NLP project extracting actionable business insights from product reviews
- Analyzed customer feedback using text processing and frequency analysis
- Identified key product strengths and improvement areas
- Key Finding: Picture quality identified as top strength, sound as primary concern
- Skills: String Processing, Text Analysis, NLP Fundamentals, Business Intelligence
**[
- Python 3.x
- Pandas - Data manipulation and analysis
- NumPy - Numerical computations
- Matplotlib - Data visualization
- Jupyter Notebook - Interactive development
- Data Cleaning & Preprocessing
- Exploratory Data Analysis (EDA)
- Feature Engineering
- Statistical Analysis
- Data Visualization
- Business Insight Generation
- End-to-End Project Execution
These projects showcase my ability to:
- Work with real-world datasets
- Apply statistical and analytical thinking
- Communicate findings clearly
- Translate data into actionable insights
- Write clean, well-documented code
Each project follows industry best practices including data validation, proper documentation, and reproducible analysis.
Bobbie Williams
Data Analyst | Data Science Certificate at University of Toronto (Expected Aug 2026)
π« Connect with me:
- GitHub: github.com/Bobbie101
These projects are available for educational and portfolio purposes.