This repository contains my pandas learning exercises, following the official pandas tutorial.
titanic_data.py
- Basic pandas DataFrame and Series operations- Practice with Titanic passenger data
- DataFrame creation, filtering, and statistical analysis
- Creating DataFrames from dictionaries
- Working with pandas Series
- Basic statistical operations (
.describe()
,.max()
,.mean()
) - Filtering data
- Understanding DataFrame vs Series
python titanic_data.py
The script demonstrates:
- DataFrame creation with passenger data
- Basic statistics on Age column
- Series creation and manipulation
- Comparison between DataFrame columns and standalone Series
- Load real CSV data
- Data cleaning and manipulation
- Advanced filtering and grouping
- Data visualization with matplotlib
- titanic_data.py - Basic pandas DataFrame and Series operations with sample data
- titanic.csv.py - Comprehensive real Titanic dataset analysis
This advanced analysis includes:
- Smart data loading (local → online fallback)
- Complete demographic survival analysis
- Age group categorization with pd.cut()
- Statistical summaries and missing data handling
- Real-world dataset exploration techniques
The script provides insights like:
- Overall survival rate: ~38.4%
- Female survival rate: ~74.2%
- Male survival rate: ~18.9%
- 1st class survival rate: ~62.9%
- 3rd class survival rate: ~24.2%