#  Strategic Supply Chain Analytics Framework
## Freight Analysis Framework (FAF5.7) - Complete Sequential Analysis

###  Analysis Objectives
This notebook provides comprehensive supply chain resilience analysis that can be **run sequentially from top to bottom**:

1. ** Risk Archetype Clustering**: Segment freight corridors by risk characteristics
2. ** Nearshoring Analysis**: Evaluate regional vs. long-distance trade patterns
3. ** Disruption Risk Assessment**: Identify vulnerable corridors and chokepoints
4. ** Performance Forecasting**: Predict efficiency and volume patterns
5. ** Strategic Momentum Tracking**: Monitor diversification and optimization progress

###  Key Findings Preview
- **Best Predictive Target**: Transportation efficiency (R = 0.26)
- **Strongest Predictors**: Geographic location (40%) and commodity type (39%)
- **Data Coverage**: 1.2M freight flow records across 51 states
- **Recommended Focus**: Descriptive risk analysis and efficiency optimization

###  Instructions
**Simply run all cells in order from top to bottom - no dependency issues!**

In [None]:
# 📚 CELL 1: LIBRARY IMPORTS & SETUP
print("🚀 Setting up Strategic Supply Chain Analytics Framework...")

# Core data science libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import warnings
warnings.filterwarnings('ignore')

# Machine learning libraries
from sklearn.preprocessing import StandardScaler, LabelEncoder
from sklearn.ensemble import RandomForestRegressor, RandomForestClassifier
from sklearn.cluster import KMeans
from sklearn.model_selection import train_test_split
from sklearn.metrics import r2_score, mean_absolute_error, silhouette_score

# Set display options
pd.set_option('display.max_columns', None)
pd.set_option('display.max_rows', 100)
plt.style.use('default')

print("✅ Libraries loaded successfully!")
print("📊 Ready for freight data analysis")


In [None]:
# 📊 CELL 2: DATA LOADING & INITIAL EXPLORATION
print("📊 Loading FAF5.7 freight data...")

# Load the dataset
df = pd.read_csv('FAF5.7_State.csv')

print(f"✅ Dataset loaded: {df.shape[0]:,} records, {df.shape[1]} columns")
print(f"📅 Data coverage: 2017-2030 (actual through 2023, forecasts 2024-2030)")
print(f"🗺️  Geographic scope: {df['dms_origst'].nunique()} origin states, {df['dms_destst'].nunique()} destination states")
print(f"🚛 Transport modes: {df['dms_mode'].nunique()} different modes")
print(f"📦 Commodities: {df['sctg2'].nunique()} commodity types")

# Display basic info
print("\n📋 Dataset Overview:")
print(df.head())

print("\n📈 Key Value Metrics:")
total_value_2023 = df['value_2023'].sum() / 1e9
total_tons_2023 = df['tons_2023'].sum() / 1e6
print(f"Total 2023 Freight Value: ${total_value_2023:.1f}B")
print(f"Total 2023 Freight Volume: {total_tons_2023:.1f}M tons")
